BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] (458 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 954 bits (2465), Expect = 0.0, Method: Compositional matrix adjust. Identities = 458/458 (100%), Positives = 458/458 (100%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA Sbjct: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV Sbjct: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ Sbjct: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL Sbjct: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT Sbjct: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR Sbjct: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS Sbjct: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK Sbjct: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 324 bits (831), Expect = 1e-86, Method: Compositional matrix adjust. Identities = 189/477 (39%), Positives = 294/477 (61%), Gaps = 33/477 (6%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVF+ F K+L K G FF+I+ALL+PV+ V G+L+D+VRW YY ++L QA TA Sbjct: 1 MVFNKSLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ASV L+ S+E+ S S + I++YL+ N + +L NF + + + I++ T V Sbjct: 61 ALSASVQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM-GIKSWLIQTKAEAETVS-RSYHKEHGVS 178 + RK + + + S Y+L LNP SLF ++ IKSW I T EAE S ++YHKE GVS Sbjct: 121 NIYNRKGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVS 180 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLN--CFGQPADRTVKSYSSQNGKVGI-RDEK------- 228 +QW+ID S SM G ++ CFG + +KS + K+GI R+E Sbjct: 181 VQWLIDDSGSM--------GSIIDRACFGS---KQLKSQYNVGSKIGIVRNENADTSDSF 229 Query: 229 --LSPYMVSCNKSLYYMLYPGPL--DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + +VSC++SLYY+L + D L E++ + S ++ I+K++LVRDALA+ I+ Sbjct: 230 YPIVGELVSCDRSLYYVLNDKKILEDDDLEEKNLDNHS--QYYIRKRYLVRDALATFIKR 287 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++KIDN+ D +RM +FN+R+ +WG+ K + V + ++E +T I+ +Q Sbjct: 288 VRKIDNLKDKLRMSFMYFNERIDHYFPMTWGI-KEFKQEVSSHYKRKHENTATDIHPILQ 346 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAICNKAKSQGIRIM 401 AY+ + S NED+ H+ KN++E KK+IVLLTDG E + + IC+ AK +GI+I Sbjct: 347 EAYNKLHSKNEDDEHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIF 406 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 TI++SV+ ++++KA FLS CASP+ FFEA +LN IF++ IG+ IFER+++I + Sbjct: 407 TISYSVDSSERKKANDFLSRCASPDKFFEAYDADKLNMIFKEHIGDAIFERLVKIRR 463 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 118/458 (25%), Positives = 194/458 (42%), Gaps = 86/458 (18%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G+F I+TA++ PV+LGV G+ + V + L++AA +A + + L + S Sbjct: 19 GNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAALATATALANGTIQTSQ-- 76 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRKSAYQVVLSS 135 E +NF N+ V D T+V + + + ++YQV +S Sbjct: 77 -----------AEAFAQNFVAGQMANYVQSGV-DFKSGTSVNVQTSTSGKSTSYQVTVSP 124 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 YDL +NPL ++++G K+ + T TV + +S+ +D S SM + Sbjct: 125 SYDLTVNPL---MQAVGFKTQHLST--SGTTVGGHSQTQGSISMFLALDKSGSMGE---- 175 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 A TV + D+ Y CN L+Y S + Sbjct: 176 ------------ATATVNA-----------DDPTESYTYDCN--LHY--------NSKNN 202 Query: 256 EHFVD--SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + D + S + K ++ A ++ + D + VR GA ++ + + + Sbjct: 203 KWVYDKCTGSRTNYYTKIEALKIAAGNLFGQLNSADPNAEYVRTGAVSYDINQYTPSNLA 262 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEA 367 WG + + A+ N G T + AM TAY ++ + N ED H++K Sbjct: 263 WGTAGVTSYVN---ALQAN--GGTNSSGAMSTAYSSLTAKNAAGNDAEDSAHKLKTGQTP 317 Query: 368 KKYIVLLTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 KKYIV +TDG+N D+ G A C+ AKS+GI I TIAF Q Y Sbjct: 318 KKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPAGGQTLLHY--- 374 Query: 421 NCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 CAS +S +F+A +L F+ IG + ++ R+T Sbjct: 375 -CASDDSHYFQAEKMEDLLAAFK-AIGAKASAQMTRLT 410 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 116/456 (25%), Positives = 182/456 (39%), Gaps = 83/456 (18%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G+F I+TA+L PV+LG GM + V + L++AA +A + + L + S Sbjct: 19 GNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAALATATALANGTIQTSQ-- 76 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA----YQVVLSS 135 E RNF N+ V DI T V + S YQV +S Sbjct: 77 -----------AEAFARNFVAGQMANYLQSGV-DIKSATGVTVQTNTSGNSTSYQVTVSP 124 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 YDL +NPL ++++G + + T T+ + +S+ +D S SM D Sbjct: 125 SYDLTVNPL---MQAVGFTTQHLST--SGTTIGGHSQTQGSISMYLALDKSGSM---GED 176 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + TV ++ Y CN G + Sbjct: 177 TA-------------TVN-----------EEDPTESYTYDCN---------GHYNKKGKW 203 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + S + K ++ A ++ + D VR GA ++ + + +WG Sbjct: 204 IYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWG 263 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKK 369 V T+ G T + AM TAY ++ + N ED H++K KK Sbjct: 264 TSG-----VSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTPKK 318 Query: 370 YIVLLTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 YIV +TDG+N D+ G A C+ AKS+GI I TIAF E + L C Sbjct: 319 YIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMA----PEGGQALLHYC 374 Query: 423 ASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 AS +S +F+A +L F+ IG + ++ R+T Sbjct: 375 ASDDSHYFQAEKMEDLLAAFK-AIGAKASSQLTRLT 409 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 117/462 (25%), Positives = 191/462 (41%), Gaps = 90/462 (19%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G+F I+TA+L PV+LG G+ + V + L++AA +A + + L + S Sbjct: 19 GNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAADSAALATATALGNGTIQTSQ-- 76 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---NPRKSA-YQVVLSS 135 E RNF N+ V DI TAV + N KSA YQV ++ Sbjct: 77 -----------AEAFARNFVAGQMANYLQNGV-DIKNATAVNVQTSNSGKSASYQVTVTP 124 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKA---EAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 YDL +NPL ++++G + + T + + + + + VS+ +D S SM Sbjct: 125 SYDLTVNPL---MQAVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFLALDKSGSM--- 178 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 G P + TV +D+ + CN P Sbjct: 179 ------------GDPTE-TVN-----------KDQPTETFTYDCN-------------PH 201 Query: 253 LSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L+++ + + S + K ++ A ++ + D VR GA ++ + Sbjct: 202 LNKKGKWVYDTCTGSRTNYYTKIEALKMAAGNLFGQLTSADPDAQYVRTGAVSYDIDQYT 261 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMK 362 + +WG V ++ G T + AM TAY ++ + N ED H++K Sbjct: 262 PSTLAWGTSG-----VSSYVNALQAGGGTNSSGAMGTAYSSLTAKNAAGNDAEDAAHKLK 316 Query: 363 NNLEAKKYIVLLTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 KKYIV +TDG+N D+ G A C+ AKS+GI I TIAF Q Sbjct: 317 TGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALL 376 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +Y S+ A +F+A +L F+ IG + ++ R+T Sbjct: 377 QYCASDAA---HYFQAEQMEDLLAAFK-AIGAKASAQLTRLT 414 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 111/467 (23%), Positives = 200/467 (42%), Gaps = 81/467 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------ITAS 65 ++++ G+F ++TAL+ P++L VGG+ VDV ++ L+ A A ++ + Sbjct: 13 RMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDA 72 Query: 66 VPLIQSLEEVSSR-------AKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREVRDIVR 116 P I+ ++++ + A + P + I N ++ + +V Sbjct: 73 RPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNATEI 132 Query: 117 D-TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 D TA + ++QV +++++ L N ++ R +G +S I+T++ AE+ + S ++ Sbjct: 133 DITATPNGAKGKSFQVTVANKHLLQFNAMT---RLLGPESIEIETRSTAESATES---KN 186 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 +S+ V+D S SM K+ + GK P Sbjct: 187 ALSMYLVLDRSGSM---------------------AWKTNTINTGKAKC------PNYTE 219 Query: 236 CNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 N S Y L GP + K ++ A+ ++ + D + Sbjct: 220 ANWSKYPDLKATGPC-----------------YVTKIDALKTAVGDLLAQLVTADPESAY 262 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR GA +N + S SWG + AI G TA +A +TA+ + ++ Sbjct: 263 VRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAI-----GGTASGNAFKTAFQKVTNAA 317 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA--ICNKAKSQGIRIMTIAFSVNKTQQ 412 ED H KN KYIV +TDGEN N++ + C+ AK+ ++I ++AF Sbjct: 318 EDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDTAKASKVQIYSVAFMA----P 373 Query: 413 EKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + L +CAS +S +FEA +L F+ IG V R+TK Sbjct: 374 DRGQKLLKSCASSSSHYFEAEEASDLVAAFK-AIGERAAASVSRLTK 419 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 16/178 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND-AM 343 IK I +VN+ VR G F+ +++ +WGV + I + GST + + Sbjct: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI------FGSTTKSTPGL 259 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRI 400 + AY+ I + E H K + + KKYI+ LTDGEN+ DN+E + CN+AK +G + Sbjct: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 I Q E A FL NCASP+ F+ ++ +L+ F RIG E+ ++ I K Sbjct: 320 YAIG-----VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL-RIGKEMVKQRILYNK 371 Score = 40.8 bits (94), Expect = 0.42, Method: Compositional matrix adjust. Identities = 41/179 (22%), Positives = 86/179 (48%), Gaps = 9/179 (5%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-TASVPLIQSLEEV 75 +C G I+TA+L+PV+ V G++++ + + L +++ TA+ L Q Sbjct: 12 NCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE--MNPRKSAYQVVL 133 + KN F++ + I+ +F N L++N +++ +I R T++ ++ + Y + Sbjct: 72 GKKQKNDFSY--RIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 SRY++ + + + L+ T +V S + G+ + V+D S SM D+ Sbjct: 130 VSRYEMPFIFCTFPWCANSSHAPLLIT----SSVKISSKSDIGLDMMMVLDVSLSMNDH 184 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 114/461 (24%), Positives = 191/461 (41%), Gaps = 92/461 (19%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV----PLIQSLEEV 75 G+F I+TA+L PV+LG GM + V + L++AA +A + + IQ+ E Sbjct: 19 GNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAALATATALANGTIQTTEA- 77 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRKSAYQV 131 E RNF N+ + DI T+V + + + ++YQV Sbjct: 78 ----------------EAFARNFVAGQMANYL-QSGTDIKSTTSVNVQTTTSGKSTSYQV 120 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML- 190 +S Y L +NPL ++++G + + T T+ + +S+ +D S SM Sbjct: 121 TVSPAYVLTVNPL---MQAVGFTTQHLST--SGTTIGGHSQTQGSISMFLALDKSGSMGE 175 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D +E P + + Y+++N K + D+ + +++ YY Sbjct: 176 DTATVNEESPTESYTYDCNL---HYNTKNNKW-VYDK------CTGSRTNYY-------- 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 K ++ A ++ + D VR GA ++ + Sbjct: 218 ------------------TKIEALKMAAGNLFSQLNSADPNAQYVRTGAVSYDINQYAPS 259 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNN 364 S +WG+ T V ++ G T + AM TAY ++ + N E+ H+ K Sbjct: 260 SLAWGI-----TGVSSYVNALQANGGTNSSGAMNTAYTSLTAKNAAGNDVENSAHQQKTG 314 Query: 365 LEAKKYIVLLTDGENTQDNEEGIA-------ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 KKYIV +TDG+N D G + C+ AKS+GI I TIAF Q Y Sbjct: 315 QVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSKGIEIYTIAFMAPAGGQALLHY 374 Query: 418 FLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 CAS +S +F+A +L F+ IG + ++ R+T Sbjct: 375 ----CASDDSHYFQAEKMEDLLAAFQ-AIGAKASAQLTRLT 410 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 115/454 (25%), Positives = 181/454 (39%), Gaps = 88/454 (19%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 TA+L PV+LG GM V V + L++AA +A + + L + S Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTS--------- 51 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA----YQVVLSSRYDLLL 141 + E Y RNF N+ V DI T V + S YQV +S YDL + Sbjct: 52 ---EAEAY-ARNFVAGQMANYLQSGV-DIKSATGVSVQTNTSGNSTSYQVTVSPSYDLTV 106 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG---VSIQWVIDFSRSML-DYQRDSE 197 NPL ++++G + + T T+ + + G +S+ +D S SM D +E Sbjct: 107 NPL---MQAVGFTTQHLST--SGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATVNE 161 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 P + P + P+ K ++ Sbjct: 162 EDPTESYTYPCN---------------------PHYNRKGKEVWDTC------------- 187 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + S + K ++ A ++ + D VR GA ++ + S +WG Sbjct: 188 ---TGSRANYYTKIEALKMAAGNLFAQLSGADPNAQYVRTGAVSYDIVQYAPSSLAWGAI 244 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYI 371 V ++ G T + AM TAY ++ + N ED H++K+ +KYI Sbjct: 245 G-----VSSYVNALQAGGGTNSSGAMSTAYLSLTAKNAAGNDAEDSAHKLKSGQIPQKYI 299 Query: 372 VLLTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 V +TDG+N D+ G A C+ AKS+GI I TIAF Q +Y CAS Sbjct: 300 VFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQY----CAS 355 Query: 425 PNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 S +F+A +L F+ IG + +V R+T Sbjct: 356 DASHYFQAEKMEDLFAAFK-AIGAKASTQVTRLT 388 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 96/380 (25%), Positives = 150/380 (39%), Gaps = 70/380 (18%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA----YQVVLSSRYDLLLNPLSLFLRSM 151 RNF N+ V DI T V + S YQV +S YDL +NPL ++++ Sbjct: 1 RNFVAGQMANYLQSGV-DIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPL---MQAV 56 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G + + T T+ + +S+ +D S SM D+ T Sbjct: 57 GFTTQHLST--SGTTIGGHSQTQGSISMYLALDKSGSM---GEDTA-------------T 98 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 V ++ Y CN G + + + S + K Sbjct: 99 VN-----------EEDPTESYTYDCN---------GHYNKKGKWIYDTCTGSRANYYTKI 138 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ A ++ + D VR GA ++ + + +WG V T+ Sbjct: 139 EALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSG-----VSTYVNAL 193 Query: 332 NEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T + AM TAY ++ + N ED H++K KKYIV +TDG+N D+ Sbjct: 194 QAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSG 253 Query: 386 G-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHEL 437 G A C+ AKS+GI I TIAF E + L CAS +S +F+A +L Sbjct: 254 GRSYDTLTKATCDTAKSKGIEIYTIAFMA----PEGGQALLHYCASDDSHYFQAEKMEDL 309 Query: 438 NKIFRDRIGNEIFERVIRIT 457 F+ IG + ++ R+T Sbjct: 310 LAAFK-AIGAKASSQLTRLT 328 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 110/476 (23%), Positives = 204/476 (42%), Gaps = 99/476 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------ITASV 66 +++ G+F ++TAL+ P++L VGG+ VDV ++ L+ A A ++ + Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 67 PLIQSLEEVSSR-------AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 P I+ + ++ + A +S P + + + ++ D ++V Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAA---AGSTAPSWDDVNTSEVV---I 116 Query: 120 VEM--NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 VE + ++QV +++++ L N ++ R +G +S ++T++ A++ + S ++ + Sbjct: 117 VETPNGTKGKSFQVSVANKHLLQFNAMT---RLLGKESIELETRSTADSATES---KNAI 170 Query: 178 SIQWVIDFSRSMLDYQRDS--EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 S+ V+D S SM ++ D+ +P R + +S G+ +R SP Sbjct: 171 SMYLVLDRSGSMA-WKTDTVDTSRP---------RCINWTASNWGESNVR--ATSP---- 214 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C +VD K ++ A+ + + K+D N+ + Sbjct: 215 C---------------------YVD---------KITTLKSAVDKLFTPLAKMDPGNEYL 244 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R GA +NDR +WG + A G T + A A + ++ E Sbjct: 245 RAGAASYNDRQDRASKLTWGTKNASAHVQGLDAT-----GGTDSSSAFAAAVEELLLDGE 299 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------------AICNKAKSQGIRIMTI 403 +E H KN +KYIV +TDGENT N + A C AK+ GI I T+ Sbjct: 300 NEAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNNGIAIFTV 359 Query: 404 AFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 AF ++ + L CA SP+ + EA+ L F ++IG + + R+TK Sbjct: 360 AFMA----PQRGKDLLKACATSPDHYKEADDAAALVSEF-EKIGQKAAAMIARLTK 410 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 112/451 (24%), Positives = 192/451 (42%), Gaps = 67/451 (14%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +LI TG+F I+TA+ +PV+ G+ VDV + L+QA A + + L Sbjct: 13 QLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQQATDAAALATATALANGN 72 Query: 73 EEVSSRAKNSFTFPKQKIEEYLI--RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S+ + + F ++ YL N + LK T T+ + ++Y Sbjct: 73 ATTSNAQQLATQFVTGQMSNYLSGDTNTADALKAGTT-------ANVTSATNSSGGTSYT 125 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V +++ YD+ +N +S L GIK+ + + + + S + K+ +S++ +D S SML Sbjct: 126 VAVNASYDMSVNGMSQLL---GIKTMHVSAASTSTSGSAAAAKQAALSMEIALDKSGSML 182 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN D + KS + Y N Y YP Sbjct: 183 ----------LNT--DVIDTSQKS--------------CTQYYTEGN---YLYQYPKAKS 213 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P IKK ++ A+ +++ + D + VR A ++ V S Sbjct: 214 PCY--------------IKKIAALKTAVGTLLDQLDSADPKSQYVRTAAIAWSSEVDSSS 259 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + +WG ++ N G T + M AY + +S+E K N +K Sbjct: 260 ALAWGTTTTRSNVISGL----NANGGTESSAPMALAYKNVSASSEATAQAAKGNTTFQKI 315 Query: 371 IVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 IVL+TDGEN T + + +A C AK G+ I ++AF ++ + L NCA SP++ Sbjct: 316 IVLMTDGENNATSSDTKTLATCKAAKDAGVLIYSVAFMA----PDRGQTLLKNCASSPSN 371 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +F+A +L F+ IGN+ +++ +TK Sbjct: 372 YFDAQQMSDLIAAFK-TIGNQASKQITLLTK 401 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 108/462 (23%), Positives = 190/462 (41%), Gaps = 85/462 (18%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ G+F ++TA+L+PV +GV G+ +D AL+ + A + A+ + + Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + A + +F ++ + R+ + T E V+ T ++N ++Y V Sbjct: 101 SEADAIAL-AKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTT--QVNSSSTSYDVE 157 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQT--KAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 L+ Y + +NPLS R +G ++ ++ KA+A T + E +S+ V+D S SM Sbjct: 158 LTGSYTITMNPLS---RVLGWETVTLKAYGKAQAATTA----SESPLSMYLVLDRSGSMN 210 Query: 191 D-----YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 D Y + +G T SY SC K Sbjct: 211 DETATTYTGTCTKTTTSGYGWNKKTTTTSY-------------------SCTK------- 244 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 ++ SL+ + A + +KK D ++ VR GA +N Sbjct: 245 -----------NYTKIESLKLAV----------ADLAAQLKKADPNSEYVRTGADSYNAS 283 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKN 363 + + SWG ++ T+ + G T A+ AY + +SN+ E+ H + + Sbjct: 284 ADTAQAMSWGTANVV-----TYVNALSATGGTDARGALSAAYSALQTSNKTEITAHNVSS 338 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +YIV +TDGE T ++ + C K+ GI+I T+AF Sbjct: 339 VSKIGRYIVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAF----MAPANG 394 Query: 416 RYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRI 456 + LS CAS S ++EA L F + IG + R+ Sbjct: 395 KSLLSACASDASHYYEATDAASLVAAFGE-IGKKATSTSTRL 435 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 62/210 (29%), Positives = 100/210 (47%), Gaps = 21/210 (10%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 LD S S E F DSS I K + ++ +++ +K I +VN+ V+ G F++++ Sbjct: 178 LDVSRSMESFFDSS-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 Query: 309 DPSFSWGVHKLIRTI--VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 WGV L R I + F + N ++ AY+ I H + Sbjct: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNS------TPGLKYAYNQIFDMQGMRQHCNTEDAN 286 Query: 367 AKKYIVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 KK IV +TDGEN T+++++ + CN+AK +G + I V ++ + FL CAS Sbjct: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-----FLRACAS 341 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 PNSF+ + H + F IG +I + I Sbjct: 342 PNSFYLVENPHSMYDAF-SHIGKDIVTKRI 370 Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust. Identities = 29/134 (21%), Positives = 60/134 (44%), Gaps = 23/134 (17%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE------ 73 G I+TA+ +P++ V GM+++V + + L +++ A+ ++ Sbjct: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 Query: 74 ----EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM--NPRKS 127 ++ R KN++ +F N L+ N ++ DIVR T++++ P+ Sbjct: 77 LKGGDILCRIKNTWNM-----------SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 Query: 128 AYQVVLSSRYDLLL 141 Y + SRY + L Sbjct: 126 GYSISAISRYKIPL 139 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 55/195 (28%), Positives = 94/195 (48%), Gaps = 13/195 (6%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 H K ++ A+ ++ + +D + VR GA +N + + WG + + + K Sbjct: 215 HYYTKIEALKLAVGTLTGELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNK 274 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN--TQDN 383 A D + G +A +TAY+ + + ED+ H K KYIV +TDG+N T + Sbjct: 275 LTATDGTDSG-----EAFKTAYNKLADAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSAD 329 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFR 442 E C+KA+ +++ TIAF + + LS CA+ P ++F A L K F+ Sbjct: 330 TETKTWCDKARDAKMQVYTIAFMA----PARGQALLSYCATAPGNYFPAGDMTALLKAFK 385 Query: 443 DRIGNEIFERVIRIT 457 + IG + +V R+T Sbjct: 386 E-IGMKASNQVTRLT 399 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 63/212 (29%), Positives = 98/212 (46%), Gaps = 26/212 (12%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS-FSWGVHKL 319 + S H K ++ A ++ + D + VR GA + D V PS +WG+ Sbjct: 68 TGSRAHYYTKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSY-DLVEYTPSKLAWGI--- 123 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVL 373 T V ++ G T + A+ TAY ++ + N ED H++K KKYIV Sbjct: 124 --TAVTSYVNALESGGGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLPKKYIVF 181 Query: 374 LTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +TDG+N D+ G A C+ AK++GI TIAF E + L CAS + Sbjct: 182 MTDGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMA----PEGGQALLHYCASDD 237 Query: 427 S-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + +F+A +L F+ IG + +V R+T Sbjct: 238 AHYFQAEKMEDLLAAFK-AIGAKASAQVTRLT 268 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 92/193 (47%), Gaps = 13/193 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK ++++ A+ +I ++ D VR+GA+ +N ++ W K K F Sbjct: 181 KKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPGK-----TKEFV 235 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T DA AY + E+ H K+ KK+IV +TDG+N + + Sbjct: 236 DALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMTDGDNNYSSADSST 295 Query: 389 --ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +C+ AK GI + T+AF+ + + LS CAS FF+A ++ +L + F++ I Sbjct: 296 KHLCDDAKDDGIEVYTVAFAA----PNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKN-I 350 Query: 446 GNEIFERVIRITK 458 G + V R+T+ Sbjct: 351 GYAASKVVSRLTE 363 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 51/167 (30%), Positives = 78/167 (46%), Gaps = 13/167 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A +N + + +WG + A G TA A +TAY +I++ Sbjct: 257 VRTAAISYNSVQDTAGTLAWGTSGAAAYVNALVAT-----GGTASAGAFKTAYQKVIAAT 311 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA--ICNKAKSQGIRIMTIAFSVNKTQQ 412 E+ H KN KY+V +TDGEN N++ + C+ AK+ + I ++AF Sbjct: 312 ENTAHAAKNGQVPSKYMVFMTDGENNYANDDTVTKQWCDTAKANKVEIYSVAFMA----P 367 Query: 413 EKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 E+ + L CAS +S +FEA +L F+ IG V R+TK Sbjct: 368 ERGQALLKYCASSSSHYFEAEEVTDLVAAFK-AIGERAAAVVSRLTK 413 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 92/438 (21%), Positives = 173/438 (39%), Gaps = 95/438 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + +G+F ++TA+L+PV+LGV G +++ ++ + + Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELAN---------------VMQVKADMQNT 55 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENN-----LKKNFTDREVRDIVRDTAVEM---- 122 + + A + K+ + I+ N ++KN T E ++ +++ + Sbjct: 56 ADSAALAAATEARLREGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKNSPTRVTTTE 115 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 N R Y V + ++ + LNP+ F+ G K+ + A++ + +K +S+ Sbjct: 116 NARGKTYAVETTIKHQIQLNPMLGFI---GAKTLDLSVTGTAKS---TINKGAPISMYLA 169 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + +C ++Y+S N K SP V+ SL Sbjct: 170 LDRSGSMSFKTDTVDTTKTSC---------QNYTSDNWSKYPNLAKTSPCYVNKAASL-- 218 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID---NVN---DTVR 296 + A+ ++ ++ K D VN + VR Sbjct: 219 --------------------------------KTAVGFLVATLNKADPTYTVNGGSELVR 246 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 GA+ + S WG + + K I E G T ++ AY+ + +N D Sbjct: 247 TGASVYTHETYVAQSIGWGTSGVTSYVDKQ--IPEFPSGGTDARSSLNAAYNALKKANPD 304 Query: 357 EV--HRMKNNLEAKKYIVLLTDGE--------NTQDNEEGIAICNKAKSQGIRIMTIAFS 406 E H+ K + ++YIVL+TDGE N+ ++ C AK GI+I ++AF Sbjct: 305 EARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIFSVAFM 364 Query: 407 VNKTQQEKARYFLSNCAS 424 +K + L CAS Sbjct: 365 A----PDKGKSLLQYCAS 378 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 101/454 (22%), Positives = 195/454 (42%), Gaps = 69/454 (15%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A AI+ Sbjct: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ +L + R ++ K+ + + R +N++K++ + + +T ++ Sbjct: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA--VFYNTEIQ--- 119 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +V SSR + + + + ++ + Y+ + S + + Sbjct: 120 -----NIVNSSRISM---------------THMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 F +L+ QR Q + F R ++G R L +V + S++ + Sbjct: 160 FIEHLLN-QR--YNQKIVSFIPALLRI---------EMGERPIFLIELVVDLSGSMHCAM 207 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFF 302 P D V+S+ + K+ + +++AL + SI + +V + V MG + Sbjct: 208 NSDPED--------VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 RV + SWG K+ + + + +D + T AM+ AY + S D+ Sbjct: 260 TTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTS---DKKRSFF 314 Query: 363 NN----------LEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 N L +K+I+ LTDGEN + N I IC+KAK I+I+TI+ + + Sbjct: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 Query: 411 QQEKARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 Q L C +SP + + L +F++ Sbjct: 375 GQR----LLKTCVSSPEYHYNVVNADSLIHVFQN 404 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 96/434 (22%), Positives = 179/434 (41%), Gaps = 85/434 (19%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G+F ++TA+L+PV+LG G +++ + L+ + Sbjct: 10 TRRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQN---------TADSAA 60 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRK 126 +R K ++I+E + ++K T+ E + + +++ V + + R Sbjct: 61 LAAATEARLKEG-ALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARG 119 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 Y + + Y + LNPL F G K+ + A T + +K +S+ V+D S Sbjct: 120 KTYTIQTTINYQMQLNPLLGFF---GAKTLDL---AATGTAVSTVNKGAPISMYLVLDRS 173 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ D+ LN K S QN Y V S YP Sbjct: 174 GSM-SFKTDT----LN---------TKKTSCQN------------YTVDNWGS-----YP 202 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV------NDTVRMGAT 300 + ++S +V K L + A+ ++ ++ K D ++ VR GA+ Sbjct: 203 N-----------LKNTSPCYVNKATSL-KTAVGYLVATLNKADPTYTANGGSELVRTGAS 250 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEV 358 + + +WG + + K I E G T ++ AY+ + +N E + Sbjct: 251 VYTHETYAAQPITWGTSSVATYVDK--QIPEFPSGGTDARSSLNAAYNALKKANTVEAKE 308 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIA--------ICNKAKSQGIRIMTIAFSVNKT 410 H+ K + ++YIVL+TDGE T ++ + C+ AK GI+I ++AF Sbjct: 309 HKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIKIFSVAFMA--- 365 Query: 411 QQEKARYFLSNCAS 424 +K + L +CAS Sbjct: 366 -PDKGKSLLQHCAS 378 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 97/476 (20%), Positives = 187/476 (39%), Gaps = 119/476 (25%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++++ G+F I+TA++MPV+LG G+ +D + + L++A +A + A+ L Sbjct: 11 RRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAAATALASG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE-VRDIVRDTAVEMNPRKSA-- 128 ++ A E + ++F + N+ D + + I T+V+++ +A Sbjct: 71 AASTTADA------------EAIAKDFVSGQMANYVDTDAISSIKAGTSVDIDVSATATS 118 Query: 129 --YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 Y+V +++ Y + P F+ +G K+ I A T S + + +S++ V+D Sbjct: 119 KSYKVTVATSYGIAATP---FMSVLGYKTLNI--GASTSTSSGTSDTKTALSMELVLD-- 171 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 Q+G +G + + Y Sbjct: 172 -------------------------------QSGSMGEKTTTCATY-------------- 186 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + + + K ++ A ++ ++ D + VR GA +N+ + Sbjct: 187 -------------NGKNCKTYVTKIDALKKAADALFDALDTADPDHSLVRTGAYSYNNGL 233 Query: 307 I---------SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--- 354 I S +WG T+ G T + M+ A +I ++ Sbjct: 234 IYNSQKTQIKSMSGMAWGT-----ATTATYVSGITASGGTDATEPMRQATLSIAKASDGS 288 Query: 355 --EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI----------AICNKAKSQGIRIMT 402 E + H +K N +YI+L+TDGE T + G+ C+ K+ GI+I T Sbjct: 289 DVETQAHAVKGNTIVSRYIILMTDGEMTGNT--GVWQSSFDQNVRNQCDATKTAGIKIFT 346 Query: 403 IAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +AF +K + L CASP ++EA + +L F I E + V +T Sbjct: 347 VAFMA----PDKGKQLLQYCASPGGNYYEAETMEKLVASFT-SIAKEATKAVTLLT 397 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 54/161 (33%), Positives = 83/161 (51%), Gaps = 16/161 (9%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 DT +G T + RV + SWG K+ + IV+ ID N +G T AM+ AY + S Sbjct: 257 DTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVE--EIDVNMLGQTDSTPAMKKAYQILTS 314 Query: 353 SNE-----DEVH-RMK-NNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTI 403 + + +H R+K L +K+++ LTDGEN + + + I IC KAK I+I+TI Sbjct: 315 DKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKTIKICEKAKKNSIKILTI 374 Query: 404 AFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRD 443 S+N + K L C S P ++ T L ++F+D Sbjct: 375 --SINASANGKR--LLKKCVSAPEYYYNVVDTGSLLRVFQD 411 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 86/171 (50%), Gaps = 11/171 (6%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A+ S+I+ ++ +D + R+GA ++ +W + +R+ V + Sbjct: 258 LKKAVRSLIKELQTVDPDDQFTRLGAYAYHWYYAGKKELTWNKNS-VRSWVNSLPAS--- 313 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A+Q A + +++++E H KN E +I+ +TDG + N C A Sbjct: 314 -GGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNWAKRE-CTSA 371 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRD 443 K+ GI I T+AF + R L CA+ ++ +++A + +ELNK+F+D Sbjct: 372 KNAGITIYTVAFKAPAS----GRNLLKACATSDAHYYDAKNANELNKVFKD 418 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 99/457 (21%), Positives = 186/457 (40%), Gaps = 89/457 (19%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K+ +G F I++A ++ V L ++D+ + ++ ++ + AII+ + V Sbjct: 18 KNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIV------V 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN---PRKSAYQVV 132 S N +++I + + +N N+ +NF I+ + + + K Y++ Sbjct: 72 SDPKINDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFSKDLTNKYEYKIT 131 Query: 133 LSSRYDL--------LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + +++ L L P ++ I + +IQ ++ + S++ V+D Sbjct: 132 MEAKHQLSGKNFILGFLMP-NVITHISSISTGIIQKPSDKK----------AFSVEMVLD 180 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SMLD ++S C YS N K KS Y L Sbjct: 181 CSGSMLDSMQES------CDLSSGRGGYYFYSKNNNKP---------------KSKIYAL 219 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 D FV+ L+++ +++ +I + R+G FN Sbjct: 220 KTASSD-------FVN------------LIQET----VQTFPQI-----SARIGLITFNH 251 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKN 363 ++ D S + + +TI + G T M AY+ + + NE + H + + Sbjct: 252 YIMQDSKLSNNFNVIKKTISRM-----KPKGGTDTFLPMNAAYEYLNNIPNETKAHNISD 306 Query: 364 NLEAKKYIVLLTDGENTQDNEE--GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 N+ K+YI+L+TDGEN + + I +C+ A+ GI I +I +N + Sbjct: 307 NVPLKRYIILMTDGENNHPSYDLKTINVCDNARKNGIIIYSIF--LNYYEYTDGYELARK 364 Query: 422 CASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 CAS FF AN+T L F+ I + I ++ +RI Sbjct: 365 CASSEKHFFYANNTKALLDSFKS-IAHAIQDKAVRIA 400 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 59.7 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 13/121 (10%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-------ICNKAKSQG 397 TA + + ED H++K KKYIV +TDG+N D+ G + C+ AKS+G Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRI 456 I I TIAF Q Y CAS +S +F+A +L F + IG + +V R+ Sbjct: 64 IEIYTIAFMAPAGGQALLHY----CASDDSHYFQAEKMEDLLAAF-EAIGAKSAAQVTRL 118 Query: 457 T 457 T Sbjct: 119 T 119 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 94/446 (21%), Positives = 193/446 (43%), Gaps = 66/446 (14%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEEVSSRAKNS 82 +TA+++ V +D+ Y + ++ A A+++ ++ +++++ +++ + Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 T K++I+++L + + +++N D + + T + NP Q + S+ + Sbjct: 61 STIFKKQIKKHLKQG--SYIRENAGDIAQKAQINITKDKNNP----LQYIAESKAQYEIP 114 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY----QRDSEG 198 +LFL+ + I S L + + + +SI V+D SRSM D D+ Sbjct: 115 TENLFLKGL-IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 N + P KS+ S+N KS Y P P + + Sbjct: 174 MTSNKYLLPPPPK-KSFWSKN----------------TTKSKYA---PAPAPANRKIDVL 213 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--PSFSWGV 316 ++S+ + + S+ ++I++ N+ +VR+G +N ++ + S + Sbjct: 214 IESAG------------NLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNL 259 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ VK+ N +T AM AY + + E H + KK+++ +TD Sbjct: 260 NE-----VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS-HNTIGSTRLKKFVIFITD 313 Query: 377 GENT-----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFE 430 GEN+ Q+ + IC ++ G++I ++A S Q+ L C S FF Sbjct: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD----LLRKCTDSSGQFFA 369 Query: 431 ANSTHELNKIFRDRIGNEIFERVIRI 456 N + EL + F D+I ++I E+ +RI Sbjct: 370 VNDSRELLESF-DKITDKIQEQSVRI 394 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 14/117 (11%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNK 392 G T I DA++ +Y+ + S D K+ IVL++DGE + ++ IA N+ Sbjct: 451 GGTNIEDALRVSYERLEGSGSD-----------KRIIVLMSDGEANEGLVGDDLIAYANE 499 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K G+ I T+ F + + + + + + ASP +E + +L F D IG++I Sbjct: 500 IKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPGCHYEVDDASQLRYFFGD-IGDDI 555 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 40/164 (24%), Positives = 69/164 (42%), Gaps = 30/164 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ HK +RT++ I G TAI DA+ A Sbjct: 138 RIGLILFGTQAYLQTPLTFD-HKTVRTLLNESRIGIAG-GQTAIGDAIGLA--------- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + R+KN+ K ++LLTDG NT + + A QG++I T+ ++ + Sbjct: 187 --LKRLKNHKTGSKVLILLTDGANTAGSVSPVQAAELAARQGMKIYTVGVGADEMRIPGV 244 Query: 416 RYFLSNCASPNS-----------------FFEANSTHELNKIFR 442 F S +P++ +F A +T EL +I++ Sbjct: 245 LGFGSQIVNPSADLDEVTMKKIASLTGAQYFRARNTDELRRIYQ 288 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust. Identities = 41/168 (24%), Positives = 73/168 (43%), Gaps = 20/168 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + ++ ++ +K+ N + R G+ FND V G+ ++ I K A Sbjct: 192 EVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLKRGLKRIYNDIKKYRA--- 248 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAI 389 GST M+ A++ + + +D R KK ++ LTDGEN N + I + Sbjct: 249 --FGSTNSYVGMKYAWEQLYGNPQDTKDR-------KKIVIFLTDGENMIINATRKTIEL 299 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 CN K + I +IA +V+ + L C+S + + A+ L Sbjct: 300 CNDMKKKKAVIYSIALAVDNKE------VLQGCSSSGNVYAADDAQSL 341 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust. Identities = 37/115 (32%), Positives = 54/115 (46%), Gaps = 26/115 (22%) Query: 363 NNLEAKKYIVLLTDGENT--------------------QDNEEGIAICNKAKSQGIRIMT 402 NN +K +VL+TDG NT Q + + I+ICN K GI I T Sbjct: 351 NNKLPRKALVLMTDGANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKRSGIEIFT 410 Query: 403 IAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFE-RVIR 455 + F VN + A L CA+ +F+A S EL+ F RI + + + R+I+ Sbjct: 411 VGFMVNSSS---ALDLLKECATDGEHYFDATSPEELHSAF-GRIADGLTQIRLIQ 461 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 12/107 (11%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T N A++T D + + +AK +VL TDG + +D EG K Sbjct: 466 GTTFTNGALKTMAD---------LFQKSKRADAKLKVVLFTDGYSAEDTSEG---AEALK 513 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 SQG+ + T+ S K+ + SPN FF A+ EL+K F Sbjct: 514 SQGVVVYTVGISTEKSTGLNMKELHGMATSPNHFFNASDFVELSKNF 560 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 40/114 (35%), Positives = 57/114 (50%), Gaps = 25/114 (21%) Query: 363 NNLEAKKYIVLLTDGENT--QDNEEGIA----------------ICNKAKSQGIRIMTIA 404 NN E +K IVL+TDG NT + GIA +C+ AKS+ I I TI Sbjct: 378 NNKEPRKTIVLMTDGANTLYANTSGGIAVANATQVAVTYSDQIRVCDYAKSKKIEIYTIG 437 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F V KA L CA+ +F+A S+ +L K F + IG ++ + +R+T Sbjct: 438 FDVT---DSKALSTLKACATDAQHYFDAKSSADLIKAF-ETIGGKLSK--VRLT 485 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust. Identities = 42/177 (23%), Positives = 72/177 (40%), Gaps = 40/177 (22%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + + +RT + + ++ TAI + A Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHTSLLNMLQTVRTDIAAKGLIQD---GTAIGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++LLTDG N + + N AKS GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILLTDGSNNMGDLSPMTSANIAKSLGIRVYTIGVGTN 232 Query: 409 KTQQEKARYFLS----------------------NCASPNSFFEANSTHELNKIFRD 443 K ARY + ++ +F+ A + EL +I++D Sbjct: 233 KV----ARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGNFYRATNNQELKQIYKD 285 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 30/163 (18%) Query: 293 DTVRMGATFFNDRVISDPSFSW---GVHKLIRTIVKTFAIDENEMG----STAINDAMQT 345 D R+G F+DR ++ V KL+ D+ ++G TAI DA+ Sbjct: 133 DGDRLGLVLFSDRAYLQAPLTFDREAVRKLL---------DQAQVGLTGQKTAIGDAIAV 183 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + V R+K+ E + +VLLTDG N + + A GIRI TI Sbjct: 184 S-----------VKRLKDRPEDGRVLVLLTDGANNEGVMSPDKAADLAAKLGIRIYTIGV 232 Query: 406 SVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFR--DRI 445 +++ R A+ ++F A L +I+R DR+ Sbjct: 233 GSARSRDLDERTLRQIADATGGAYFRATDVQGLAQIYRAIDRL 275 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 44/177 (24%), Positives = 73/177 (41%), Gaps = 40/177 (22%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + + +RT + + ++ TAI + A Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHTSLLNMLQTVRTDIAAKGLIQD---GTAIGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++LLTDG N + + N AKS GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILLTDGSNNMGDLSPMTSANIAKSLGIRVYTIGVGTN 232 Query: 409 KTQQEKARY--------------------FLSNCASP--NSFFEANSTHELNKIFRD 443 K ARY L + A+ +F+ A + EL +I++D Sbjct: 233 KV----ARYPMPVAGGVQYVNMPVEIDTKVLKDIAATTDGNFYRATNNQELKQIYKD 285 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 33/183 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 D L +V +K RMG F + ++ ++ V+T ++E E+G Sbjct: 128 DRLTAVKSVVKNFIQKRQGDRMGLVVFGSQAFLQSPLTYDLNT-----VETL-LNETEIG 181 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 +TAI DA+ A + + N E K ++LLTDG NT + + Sbjct: 182 MAGNNTAIGDAIGIA-----------LKHLHQNSEKKAVLILLTDGSNTAGAVQPLDAAK 230 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNC------------ASPNSFFEANSTHELNK 439 +A+ G++I TI N+ A F N + FF A T++LN+ Sbjct: 231 QAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFMAKDTNQLNE 290 Query: 440 IFR 442 I++ Sbjct: 291 IYQ 293 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 69/155 (44%), Gaps = 21/155 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 R+G F D F+ + +R + DE+ + ST + DA+ A + Sbjct: 157 RLGLILFGDAAYLQAPFTADLASWLRLL------DESRVAMAGQSTHVGDALGLAIKVMS 210 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S DE+ + N K ++LLTDG +T + + A +GIR+ IA +T Sbjct: 211 S---DEIKSSQKN----KVVLLLTDGNDTDSSVPPLEAAKIAAKKGIRVHVIAIGDPQTV 263 Query: 412 QEKA--RYFLSNCAS--PNSFFEANSTHELNKIFR 442 E+A + A+ F+A ST ELNK+++ Sbjct: 264 GEQAMDMEVIEGVAALTGGKAFKAISTQELNKVYQ 298 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 5/78 (6%) Query: 368 KKYIVLLTDGENTQDNE--EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 KKY++L +DGE + N+ E A K K G ++T+ +N E A + AS Sbjct: 913 KKYVILFSDGEPSDSNDKMETEASAVKLKEAGYTVITVGLGLN---NETATWLGEKVASA 969 Query: 426 NSFFEANSTHELNKIFRD 443 F A++ ELNKIF++ Sbjct: 970 GCAFTADTAEELNKIFQN 987 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust. Identities = 52/205 (25%), Positives = 85/205 (41%), Gaps = 35/205 (17%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K + +D A I S + DN+ T+ G F + +D + Sbjct: 96 VSTSMLAEDLKPNRMEAAKDVAAEFI-SGRPNDNIGLTIFAGEAFTQCPMTTDHQSLLNL 154 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + +RT + + E+ TA+ + A V R+K++ K ++LLTD Sbjct: 155 LQNVRTDLSARGLIED---GTAVGMGLANA-----------VSRLKDSKAKSKVVILLTD 200 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLS 420 G N + + + N AKS GIR+ TI NK E LS Sbjct: 201 GSNNRGDLSPMTSANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQYVNIPVEIDTKTLS 260 Query: 421 NCASP--NSFFEANSTHELNKIFRD 443 AS +F+ A + +EL +I++D Sbjct: 261 GIASVTHGNFYRATNNNELKQIYKD 285 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 46.2 bits (108), Expect = 0.012, Method: Compositional matrix adjust. Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 24/136 (17%) Query: 280 SVIRSI--KKIDN-VNDTVRMGATFFNDR--VISDPSFSW-GVHKLIRTIVKTFAIDENE 333 SV++ I K +D V D R+G F D V+S +F +H+L+ IV T A Sbjct: 124 SVLKGIMGKFVDGRVGD--RIGLIIFGDTSYVLSPLTFDRNAIHQLLDGIVPTLA----- 176 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TAI D + + +++ E + ++L+TDG+N + A Sbjct: 177 GGGTAIGDGIGLG-----------IKKLRERPEGSRVLILVTDGKNETGTIPPLKAAQLA 225 Query: 394 KSQGIRIMTIAFSVNK 409 K +GIRI TI K Sbjct: 226 KQEGIRIYTIGVGSTK 241 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 12/90 (13%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ + T+++ + E G TAI A+ TA ++R+KN+ K I+LLTD Sbjct: 148 YRFLLTMLQRLQVGRLEDG-TAIGTAIATA-----------INRLKNSEARSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 G+N + + + A+ GIRI TI S Sbjct: 196 GQNNRGEIDPLTAAELARQAGIRIYTIGLS 225 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 45.1 bits (105), Expect = 0.023, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 18/181 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V + +V +AS S + DN+ T+ G F + D + + +RT + T Sbjct: 105 VYPNRMVVAKEVASEFISGRPNDNIGLTIFAGEAFTQCPMTLDHAALLNLLHGVRTDLVT 164 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + ++ TAI + A V R+K++ K ++LLTDG N + Sbjct: 165 SGLMQD---GTAIGMGLANA-----------VSRLKDSKAKSKIVILLTDGSNNAGSISP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 + A+ GIRI TI F ++ A + L + A + F+ A S EL++I++ Sbjct: 211 MTAAAIARKFGIRIYTIGFGKETGEEIGAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQ 270 Query: 443 D 443 D Sbjct: 271 D 271 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 45.1 bits (105), Expect = 0.026, Method: Compositional matrix adjust. Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 5/83 (6%) Query: 368 KKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--RYFLSNCA 423 KK IVL++DG T +D EE I K K QG+ I T+ F N T++ KA +Y + A Sbjct: 318 KKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLGFFQN-TEEYKAEGQYLMEKIA 376 Query: 424 SPNSFFEANSTHELNKIFRDRIG 446 S +E +S+ +L F D G Sbjct: 377 SEGCHYEVSSSEDLVFFFEDVAG 399 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 44.7 bits (104), Expect = 0.029, Method: Compositional matrix adjust. Identities = 42/150 (28%), Positives = 67/150 (44%), Gaps = 19/150 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ V + +++ + E +TAI DA+ + Sbjct: 134 RLGLVLFGTQAFLQAPLTFDVKTVQEMLIEAESGYAGE--ATAIGDAIALS--------- 182 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK-AKSQGIRIMTIAFSVNKTQQEK 414 + R++ AK+ I+LLTDGENT E GIA A +I TIAFS +E Sbjct: 183 --IKRLREQPNAKRVIILLTDGENTA-GELGIATATDLAVKANTKIYTIAFS--PYDREV 237 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + + A + FF A +T +L +I R Sbjct: 238 DSHSMQQIAEQTGGEFFRARNTRDLEEIHR 267 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 44.7 bits (104), Expect = 0.030, Method: Compositional matrix adjust. Identities = 32/106 (30%), Positives = 47/106 (44%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K ++LLTDG N + AKS GIR+ TI NK Sbjct: 180 NAVSRLKDSKSKSKVVILLTDGSNNMGEISPMTAAEIAKSYGIRVYTIGVGTNKVAPYPM 239 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E LS+ A + +F+ A + +EL KI+RD Sbjct: 240 PVAGGVQYVNIPVEIDTKTLSDIAQTTDGNFYRATNNNELKKIYRD 285 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 44.7 bits (104), Expect = 0.033, Method: Compositional matrix adjust. Identities = 45/181 (24%), Positives = 79/181 (43%), Gaps = 18/181 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V + +V +AS S + DN+ T+ G F + D + + +RT + T Sbjct: 105 VYPNRMVVAKEVASEFISGRPNDNIGLTIFAGEAFTQCPMTLDHAALLNLLHGVRTDLVT 164 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + ++ TAI + A V R+K++ K ++LLTDG N + Sbjct: 165 SGLMQD---GTAIGMGLANA-----------VSRLKDSKAKSKIVILLTDGSNNAGSISP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 + A+ GIR+ TI F ++ A + L + A + F+ A S EL++I++ Sbjct: 211 MTAAAIARKFGIRVYTIGFGKETGEEIGAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQ 270 Query: 443 D 443 D Sbjct: 271 D 271 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 44.3 bits (103), Expect = 0.037, Method: Compositional matrix adjust. Identities = 44/173 (25%), Positives = 71/173 (41%), Gaps = 32/173 (18%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ + G +F + +D + + + +RT + + E+ TAI + A Sbjct: 127 DNIGLAIFAGESFTQCPMTTDHASLLNLLQNVRTDIAARGLIED---GTAIGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++LLTDG N + + AKS GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILLTDGSNNRGDISPSTAAEIAKSLGIRVYTIGVGTN 232 Query: 409 KTQQ----------------EKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 K E LS AS F+ A +T+EL KI+++ Sbjct: 233 KVAPYPMPVAGGVQYVNVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKE 285 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 44.3 bits (103), Expect = 0.040, Method: Compositional matrix adjust. Identities = 35/145 (24%), Positives = 60/145 (41%), Gaps = 34/145 (23%) Query: 321 RTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R V T +DE ++G TAI DA+ A + R++ +++ ++LLTD Sbjct: 156 RKTVSTL-LDEAQLGFAGEQTAIGDAVGLA-----------IKRLRERPASQRVLILLTD 203 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY-FLSNCASPNS-------- 427 G NT + AK GI+I T+ ++ +Q + S +P+S Sbjct: 204 GANTAGEVAPRQAADLAKQAGIKIYTVGVGADQMEQRMGLFGGFSRTVNPSSDLDEDTLR 263 Query: 428 ---------FFEANSTHELNKIFRD 443 +F A + EL I+ + Sbjct: 264 YMAETTGGLYFRARNPQELQAIYEE 288 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 44.3 bits (103), Expect = 0.043, Method: Compositional matrix adjust. Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 24/118 (20%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ A V R+++ E ++ ++LLTDG + E + A+++ Sbjct: 183 TAIGDAIGLA-----------VKRLRSQPEGQRVLILLTDGVSNAGVLEPLRAAEVARAE 231 Query: 397 GIRIMTIAF-----------SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 G+RI T+AF S++ Q L A + FF A T +L I+ Sbjct: 232 GVRIHTVAFGGDGSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAGIY 289 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 44.3 bits (103), Expect = 0.045, Method: Compositional matrix adjust. Identities = 42/173 (24%), Positives = 70/173 (40%), Gaps = 32/173 (18%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D H + T+++ D G IND Sbjct: 127 DNIGLTIFAGEAFTQCPMTTD-------HASLLTLLQDVRTDMATRG--LINDGT----- 172 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 I + V R+K++ + ++LLTDG N + + AKS GIR+ TIA + Sbjct: 173 AIGMGLANAVSRLKDSKTKSRVVILLTDGANNAGDISPLTAAQMAKSLGIRVYTIAVGTS 232 Query: 409 KTQQEKARY-----FLSNCA-------------SPNSFFEANSTHELNKIFRD 443 K ++S A + +F+ AN+T +L +I+ D Sbjct: 233 KVAPYPIEVGGRVQYISRPADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHD 285 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 43.9 bits (102), Expect = 0.052, Method: Compositional matrix adjust. Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 4/92 (4%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + V R+K++ K I+LLTDG N + + + AK GIRI TI A Sbjct: 180 NSVSRLKDSKAKSKVIILLTDGSNNVGSISPMTAASIAKKYGIRIYTIGLGKESEGDLGA 239 Query: 416 RYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 + L N A + F+ A S EL+KI++D Sbjct: 240 IDYKTLQNIAVSTNGEFYRAQSQAELSKIYQD 271 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 43.9 bits (102), Expect = 0.053, Method: Compositional matrix adjust. Identities = 34/125 (27%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLDPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+RI TIAF Y L P FF A T E Sbjct: 232 GVRIYTIAFGGGGG------YSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 43.9 bits (102), Expect = 0.057, Method: Compositional matrix adjust. Identities = 32/121 (26%), Positives = 48/121 (39%), Gaps = 29/121 (23%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 158 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 206 Query: 397 GIRIMTIAFSV----------------NKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 G+R+ TIAF + +E R FF A T EL I Sbjct: 207 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTG--GRFFRARDTEELAGI 264 Query: 441 F 441 + Sbjct: 265 Y 265 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 43.9 bits (102), Expect = 0.060, Method: Compositional matrix adjust. Identities = 34/125 (27%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLDPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+RI TIAF Y L P FF A T E Sbjct: 232 GVRIHTIAFGGGGG------YSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 43.5 bits (101), Expect = 0.067, Method: Compositional matrix adjust. Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 11/69 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLDPLKAAELAKAE 231 Query: 397 GIRIMTIAF 405 G+RI TIAF Sbjct: 232 GVRIHTIAF 240 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 43.5 bits (101), Expect = 0.070, Method: Compositional matrix adjust. Identities = 32/118 (27%), Positives = 51/118 (43%), Gaps = 24/118 (20%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ A V R+++ E ++ ++LLTDG + E + A+++ Sbjct: 183 TAIGDAIGLA-----------VKRLRSQPEGQRVLILLTDGVSNAGVLEPLRAAEVAQAE 231 Query: 397 GIRIMTIAFS-----------VNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 G+RI T+AF ++ Q L AS FF A T +L I+ Sbjct: 232 GVRIHTVAFGGDGSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAGIY 289 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 43.5 bits (101), Expect = 0.072, Method: Compositional matrix adjust. Identities = 36/124 (29%), Positives = 56/124 (45%), Gaps = 23/124 (18%) Query: 288 IDNVNDTVRMGATFFNDRVISD----PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 +D + +R+G T F+D + D P++ H LIR ++ D G TA DA+ Sbjct: 119 LDQLPPGIRVGVTTFSD--VPDGTQTPTYD---HDLIRRTIEAQIAD----GGTATGDAL 169 Query: 344 QTAYDTIISSNEDEVHRMKNNLE-AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 Q A DT+ R++ N E +VLL+DG T + + + A I I T Sbjct: 170 QVALDTL--------ERLEQNGERTPAAMVLLSDGATTTGRDP-VMVARAAGEARIPIYT 220 Query: 403 IAFS 406 +A Sbjct: 221 VALG 224 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 43.5 bits (101), Expect = 0.074, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 49/125 (39%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQRQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+R+ TIAF + + Y L P FF A T E Sbjct: 232 GVRVHTIAFGGSGS------YSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 43.5 bits (101), Expect = 0.078, Method: Compositional matrix adjust. Identities = 31/115 (26%), Positives = 51/115 (44%), Gaps = 21/115 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 R+G F DR ++ V + R A+DE ++G +TAI+D + A Sbjct: 126 RIGLVIFGDRAYFAQPLTFDVDAVAR------AVDEAQIGISGRATAISDGLGLA----- 174 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + R+ + + +VL++DG +T N + + A GIRI TIA Sbjct: 175 ------MKRLAASEAPTRVVVLMSDGVDTSGNVQAVDAARLAAGHGIRIHTIALG 223 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 43.1 bits (100), Expect = 0.089, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGALNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+R+ TIAF + Y L P FF A T E Sbjct: 232 GVRVHTIAFGGSGG------YSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 43.1 bits (100), Expect = 0.090, Method: Compositional matrix adjust. Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 4/92 (4%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + V R++++ K ++LLTDG N + + AK GIR+ TI F ++ A Sbjct: 180 NAVSRLQDSKAKSKIVILLTDGSNNVGSISPMTAAAIAKKFGIRVYTIGFGRETGEEIGA 239 Query: 416 RYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 + L N A + F+ A S EL++I++D Sbjct: 240 IDYRALQNIAVSTNGEFYRAQSQAELSRIYQD 271 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 43.1 bits (100), Expect = 0.094, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 19/116 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKT---FAIDENEMGSTAINDAMQTAYDTIIS 352 R+G F R S+ + + R +++ FA E TAI DA+ A Sbjct: 129 RLGLILFGSRAYLQSPLSFDIQTVQRFLLEAQIGFAGQE-----TAIGDAIGLA------ 177 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R++ + + +VLLTDG++T + + N A G+RI TI + Sbjct: 178 -----VKRLQERPASSRVLVLLTDGQDTASTVDPLEAANLAADLGVRIYTIGIGAD 228 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 43.1 bits (100), Expect = 0.10, Method: Compositional matrix adjust. Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K + K ++LLTDG N + + A+S GIR+ TI NK Sbjct: 180 NAVSRLKGSKAKSKVVILLTDGSNNMGDISPLTAAQIARSLGIRVYTIGVGTNKVAPYPM 239 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E LS+ A + +F+ A + EL +I+RD Sbjct: 240 PVAGGVQYVNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIYRD 285 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 43.1 bits (100), Expect = 0.10, Method: Compositional matrix adjust. Identities = 32/121 (26%), Positives = 48/121 (39%), Gaps = 29/121 (23%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSV----------------NKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 G+R+ TIAF + +E R FF A T EL I Sbjct: 232 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTG--GRFFRARDTEELAGI 289 Query: 441 F 441 + Sbjct: 290 Y 290 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 42.7 bits (99), Expect = 0.11, Method: Compositional matrix adjust. Identities = 43/177 (24%), Positives = 72/177 (40%), Gaps = 40/177 (22%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + + +RT + + ++ TA+ + A Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHASLLNLLQNVRTDIAARGLIQD---GTAVGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++LLTDG N + + AKS GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILLTDGSNNMGDLSPMTSAQIAKSLGIRVYTIGVGTN 232 Query: 409 KTQQEKARY--------------------FLSNCASP--NSFFEANSTHELNKIFRD 443 K ARY LS+ A+ +F+ A + EL +I+ D Sbjct: 233 KV----ARYPMPVTGGIQYVNIPVEIDTKTLSDIAATTDGNFYRATNNRELKQIYND 285 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 42.7 bits (99), Expect = 0.12, Method: Compositional matrix adjust. Identities = 30/118 (25%), Positives = 52/118 (44%), Gaps = 24/118 (20%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ A V R+++ E ++ ++LLTDG + E + A+++ Sbjct: 183 TAIGDAIGLA-----------VKRLRSQPEGQRVLILLTDGVSNAGVLEPLRAAEVARAE 231 Query: 397 GIRIMTIAF-------------SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 G+RI T+AF S ++ ++A + FF A T +L I+ Sbjct: 232 GVRIHTVAFGGDGSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAGIY 289 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 42.7 bits (99), Expect = 0.12, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 67/155 (43%), Gaps = 18/155 (11%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 +E++FVD R + V+ L++ I N + R+G F R + + Sbjct: 100 AEDYFVD----RKRSNRLQAVKPVLSAFI-------NRRENDRIGLIAFAGRAYTVAPLT 148 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + HK + + I E G TAI D++ A ++ ++ + +IVL Sbjct: 149 FD-HKWLARQTERLQIGLIEDG-TAIGDSLAVATSRLLEGAKERAGEREG-----AFIVL 201 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 LTDGENT + + AK GIR+ TIA N Sbjct: 202 LTDGENTAGMMDPMEGATLAKDAGIRVYTIAAGKN 236 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 11/70 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 171 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 219 Query: 397 GIRIMTIAFS 406 G+R+ TIAF Sbjct: 220 GVRVHTIAFG 229 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQRHGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+R+ TIAF + + Y L P FF A T E Sbjct: 232 GVRVHTIAFGGSGS------YSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 42.4 bits (98), Expect = 0.15, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+R+ TIAF + Y L P FF A T E Sbjct: 232 GVRVHTIAFGGSGG------YSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust. Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 9/71 (12%) Query: 376 DGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 DG QD + A+C+ AK++G R+ IAF+ + LS CAS +S F A Sbjct: 555 DGVTDQDARHTLRFRALCDAAKAKGFRVWVIAFASDLNDD------LSYCASASSTFPAT 608 Query: 433 STHELNKIFRD 443 + ELN F++ Sbjct: 609 NATELNTAFQE 619 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 50/116 (43%), Gaps = 19/116 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKT---FAIDENEMGSTAINDAMQTAYDTIIS 352 R+G F R S+ + + R ++++ FA E TAI DA+ A Sbjct: 137 RLGLILFGSRAYLQSPLSFDIQTVQRFLLESQIGFAGQE-----TAIGDAIGLA------ 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R++ + ++LLTDG++T + + N A G+RI TI + Sbjct: 186 -----VKRLQERPATSRVLILLTDGQDTASTVDPLEAANLAADLGVRIYTIGIGAD 236 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 48/125 (38%), Gaps = 37/125 (29%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--------------------NSFFEANSTHE 436 G+R+ TIAF + Y L P FF A T E Sbjct: 232 GVRVHTIAFGGSGG------YSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEE 285 Query: 437 LNKIF 441 L I+ Sbjct: 286 LAGIY 290 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 42.0 bits (97), Expect = 0.18, Method: Compositional matrix adjust. Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%) Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEA----KKYIVLLTDG-ENTQDNEEGIAICNKAKS 395 +AM+T+ T + D +++ NN + K+Y+VLLTDG N ++ NK KS Sbjct: 130 NAMETSGGTHQNEGLDRAYKILNNDQNTSNLKRYVVLLTDGCPNGVTYDQITTSINKIKS 189 Query: 396 QGIRIMTIAFSVNKTQQ--EKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 +++T+ +++T + A+ +L A N + AN LN IF +G Sbjct: 190 TNTKLITVGVGLDETNTGLKAAKDYLQANADDNMAYNANDASHLNTIFTQILG 242 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 42.0 bits (97), Expect = 0.19, Method: Compositional matrix adjust. Identities = 44/188 (23%), Positives = 75/188 (39%), Gaps = 46/188 (24%) Query: 283 RSIKKIDNVNDTV----------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +++++ID V D R+G F R S R VK F + E Sbjct: 113 QAVRRIDAVRDIGAEFIERREGDRVGLILFGSRAYMQSPLS-----FDRDTVKQF-LSEA 166 Query: 333 EMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++G TAI DA+ A V R+++ + + ++LLTDG++T + + + Sbjct: 167 QIGFAGSETAIGDALGLA-----------VKRLRDKEDGDRVVILLTDGQDTASSVDPLD 215 Query: 389 ICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA--SPNSFFEANS 433 A + G+++ TI ++ E LS A + +F A S Sbjct: 216 ATALAANYGVKVYTIGIGADEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYFRARS 275 Query: 434 THELNKIF 441 EL KI+ Sbjct: 276 PDELAKIY 283 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 42.0 bits (97), Expect = 0.20, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 11/70 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ V R++ + ++ +VLLTDG NT + AK++ Sbjct: 296 TAIGDAIAL-----------SVKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 344 Query: 397 GIRIMTIAFS 406 G+R+ TIAF Sbjct: 345 GVRVHTIAFG 354 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 42.0 bits (97), Expect = 0.20, Method: Compositional matrix adjust. Identities = 34/138 (24%), Positives = 54/138 (39%), Gaps = 30/138 (21%) Query: 321 RTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R V+TF +DE ++G +TAI DA+ A V R++ + ++L+TD Sbjct: 155 RRTVRTF-LDEAQIGIAGKNTAIGDAIGLA-----------VKRLRQRPAQSRVLILITD 202 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA------------- 423 G N + A +G+RI TI N N + Sbjct: 203 GANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIAG 262 Query: 424 -SPNSFFEANSTHELNKI 440 + ++F A+ ELN I Sbjct: 263 ITHGTYFRAHDGAELNAI 280 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 42.0 bits (97), Expect = 0.21, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 32/173 (18%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + + +RT + + + TAI + A Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHASLINMLRSVRTDIAARGLISD---GTAIGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++LLTDG N + + AKS GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILLTDGSNNMGDISPLTSAQIAKSLGIRVYTIGVGTN 232 Query: 409 KTQQ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 K E L N A + +++ A S ++L +I++D Sbjct: 233 KVAPYPMPVAGGVQYVNIPVEIDSKTLKNIAETTDGNYYRATSNNQLKQIYKD 285 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 42.0 bits (97), Expect = 0.21, Method: Compositional matrix adjust. Identities = 37/145 (25%), Positives = 64/145 (44%), Gaps = 20/145 (13%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +KL+ +++ + + E TAI A+ +A +RMK + A K ++LL+D Sbjct: 167 YKLLTDLIQDISFNMMEAKGTAIGSAIASA-----------TNRMKESESASKVLILLSD 215 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE+ N + + A + I+I TIA + +F S+ + + E Sbjct: 216 GESNAGNVDPLFAAQLASALDIKIYTIAVGKDGMVPYGTDFF-GRPQMVESYLDETNLRE 274 Query: 437 LNKI-----FRDRIG---NEIFERV 453 + KI FR G N IF+R+ Sbjct: 275 IAKIGNGEFFRASDGGTLNNIFDRI 299 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 42.0 bits (97), Expect = 0.21, Method: Compositional matrix adjust. Identities = 45/190 (23%), Positives = 77/190 (40%), Gaps = 28/190 (14%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G R SP + + + + L L S++ E FV H + ++ A++I Sbjct: 80 GPRTIAASPAQPASGRDIVFAL---DLSGSMAAEDFVLDG---HAASRIDALKRVGAALI 133 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAIND 341 + D R+G F +R + S+ V + RT+ + I +G STAI + Sbjct: 134 KR-----RTGD--RIGLVIFAERAYAAAPLSFDVDAVSRTLAE---IPLGLVGHSTAIGE 183 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + R+ + + IVLL+DG N + + A + G++I Sbjct: 184 GLGLA-----------LKRLTESKAPSRVIVLLSDGANDAGTTDPTGVAELANNLGVKIY 232 Query: 402 TIAFSVNKTQ 411 TI V TQ Sbjct: 233 TIGLGVVDTQ 242 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 42.0 bits (97), Expect = 0.22, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 11/70 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ V R++ + ++ +VLLTDG NT + AK++ Sbjct: 296 TAIGDAIAL-----------SVKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 344 Query: 397 GIRIMTIAFS 406 G+R+ TIAF Sbjct: 345 GVRVHTIAFG 354 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 42.0 bits (97), Expect = 0.23, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 11/70 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 296 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 344 Query: 397 GIRIMTIAFS 406 G+R+ TIAF Sbjct: 345 GVRVHTIAFG 354 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 42.0 bits (97), Expect = 0.23, Method: Compositional matrix adjust. Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 4/55 (7%) Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 A C A+S GIR+ TI F VN + R + NCAS P+ +F++ S+ L F Sbjct: 455 AACAYARSLGIRVYTITFQVNSS---STRSLMQNCASNPSLYFDSPSSEALEDAF 506 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 42.0 bits (97), Expect = 0.23, Method: Compositional matrix adjust. Identities = 47/212 (22%), Positives = 82/212 (38%), Gaps = 34/212 (16%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S+SEE F D +H D L + +++ + R+G F D Sbjct: 102 LSGSMSEEDFADKKGNKH---------DRLTIAKQVLREFAAQREHDRLGLILFADSAYV 152 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ ++ V +++ E+G A A I+ E E R + Sbjct: 153 QAPFTEDIN------VWQSLLEDVELGYAGFKTAFGDAIGLSIAVFEQEQSR-------Q 199 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-------N 421 + ++LLTDG++T + A G++I TIA T K RY + + Sbjct: 200 RVMILLTDGDDTSSKMPPVKAAEIAAKYGVKIYTIAIGDPST---KGRYKMDLPTLEKVS 256 Query: 422 CASPNSFFEANSTHELNKIFR--DRIGNEIFE 451 A+ F A +L++ + D++ + FE Sbjct: 257 AATGGQMFHAMDRKQLDQAYATIDQLEQQEFE 288 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust. Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 16/93 (17%) Query: 321 RTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R V+TF +DE ++G +TAI DA+ A V R++ + +VL+TD Sbjct: 155 RRTVRTF-LDEAKIGIAGKNTAIGDAIGLA-----------VKRLRQRPAQSRVLVLITD 202 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 G N + A +G+RI TI N Sbjct: 203 GANNGGQIHPLTAARLAAQEGVRIYTIGIGANP 235 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 41.6 bits (96), Expect = 0.24, Method: Compositional matrix adjust. Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 16/92 (17%) Query: 321 RTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R V+TF +DE ++G +TAI DA+ A V R++ + +VL+TD Sbjct: 108 RRTVRTF-LDEAQIGIAGKNTAIGDAIGLA-----------VKRLRQRPAQSRVLVLITD 155 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + A +G+RI TI N Sbjct: 156 GANNGGQIHPLTAARLAAQEGVRIYTIGIGAN 187 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 41.6 bits (96), Expect = 0.26, Method: Compositional matrix adjust. Identities = 43/169 (25%), Positives = 74/169 (43%), Gaps = 14/169 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S++E+ F + I + ++ LA ++ +K D R+G F D Sbjct: 109 LSGSMAEQDFTSKQGKK--ISRLQATKEVLADFAKT-RKGD------RLGLILFGDAAFV 159 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED-EVHRMKNNLE 366 F+ + + +T D G ST + DA+ A S + + +N++E Sbjct: 160 QTPFTADQDVWLELLNQT---DVAMAGQSTHLGDAIGLAIKVFEQSGKQMSAEQAQNDIE 216 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +K +++LTDG +T E I AK++G+RI IA +T E A Sbjct: 217 REKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVA 265 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 41.2 bits (95), Expect = 0.32, Method: Compositional matrix adjust. Identities = 36/133 (27%), Positives = 55/133 (41%), Gaps = 30/133 (22%) Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 +DE+ +G TAI DA+ A V R K + + +VLLTDG+NT N Sbjct: 158 LDESVLGLVGERTAIGDAIALA-----------VKRFKGKQQTNRVLVLLTDGQNTAGNL 206 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFF 429 AK+ +RI IA + ++ + N A + +F Sbjct: 207 SPEQALELAKAYDVRIYPIAVGAEEVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYF 266 Query: 430 EANSTHELNKIFR 442 A ST EL +I++ Sbjct: 267 RARSTEELERIYQ 279 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 41.2 bits (95), Expect = 0.33, Method: Compositional matrix adjust. Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 8/89 (8%) Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 EV +N + ++ +VL+TDG + D+ +C++AK GI + + F VN ++ E Sbjct: 295 EVGPNRNVDKVERKLVLMTDGNDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGSRLE 354 Query: 414 KARYFLSNCA-SPNSFFEANSTHELNKIF 441 + + CA P F A+ T +L+ F Sbjct: 355 Q----FTRCAVDPKGVFSASDTQDLDHYF 379 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 41.2 bits (95), Expect = 0.33, Method: Compositional matrix adjust. Identities = 30/84 (35%), Positives = 41/84 (48%), Gaps = 1/84 (1%) Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 V K++ K IVLL+DGE+ + AK QGI+I TIA NK + ++A Sbjct: 153 VRSFKHSKAKTKVIVLLSDGEHNSGSVSPKEATELAKEQGIKIYTIAMG-NKGEADEALL 211 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 S FF A+S EL I+ Sbjct: 212 ETIAKDSNGEFFSASSAKELKNIY 235 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 41.2 bits (95), Expect = 0.35, Method: Compositional matrix adjust. Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 11/86 (12%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + + TA ++R++ + K I+LLTDGEN + + A+ Sbjct: 166 GTAIGNGLATA-----------INRLRESNAKSKVIILLTDGENNRGEIAPLTAAEIARD 214 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSN 421 QGIR+ TI T F N Sbjct: 215 QGIRVYTIGVGTRGTAPYPTVDFFGN 240 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 41.2 bits (95), Expect = 0.38, Method: Compositional matrix adjust. Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 28/134 (20%) Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 +DE +G +TAI DA+ A + SN D K ++ ++LLTDG NT Sbjct: 179 LDEAAVGLAGKATAIGDAIGLAVKRLEQSNRD-----KRIASKEQVLILLTDGVNTAGQL 233 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS----------------- 427 A G+ I TI + + +F + +P++ Sbjct: 234 SAPQAAELAAEHGLTIYTIGIGADAMTVQS--FFGTQRVNPSADLDEKMLTDIATKTGGR 291 Query: 428 FFEANSTHELNKIF 441 +F A+ T EL KI+ Sbjct: 292 YFRAHDTQELQKIY 305 >gi|310820889|ref|YP_003953247.1| hypothetical protein STAUR_3632 [Stigmatella aurantiaca DW4/3-1] gi|309393961|gb|ADO71420.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 763 Score = 40.8 bits (94), Expect = 0.41, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 77/181 (42%), Gaps = 41/181 (22%) Query: 224 IRDEKLSPYMVSCNKSLY--YMLYPGPLDPSLSEE-----HFVDSSSLRHVIKKKHLVRD 276 I D ++ Y ++ N+ L Y+ P P + + D+S H + + RD Sbjct: 404 ISDPRMVVYDLAANRQLVRAYLEEEPPPKPKRMKRTSVRVYVCDASGSMHGARAR--FRD 461 Query: 277 ALASVIRSIKKIDNVNDTVRMGA-------TFFND------RVISDPSFSWGVHKLIRTI 323 A+ I +++N+ R G +FFND RV + SW + KL R Sbjct: 462 AII-----IAELNNLRAKARQGLPFDPLYFSFFNDAPTELARVDTAAGASWQIEKLFR-- 514 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I+ A+ +A+D+I S+ ++ A+ +VL+TDGE+ D Sbjct: 515 ------QSPAEGQTDISLALISAFDSIRSAQ------GRDPYLARATVVLITDGEDRVDQ 562 Query: 384 E 384 E Sbjct: 563 E 563 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 40.8 bits (94), Expect = 0.43, Method: Compositional matrix adjust. Identities = 41/141 (29%), Positives = 60/141 (42%), Gaps = 21/141 (14%) Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 D WG L T + FA DEN +TA ND + + I+ + + NL + Sbjct: 527 DAGMVWGARLLSPTGL--FA-DEN---ATAPNDRPISRH--IVFMTDGAMAPNMGNLTFQ 578 Query: 369 KYIVLLTDGENTQD-------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 Y L+ T D N +C A+ +GI I ++F V L+N Sbjct: 579 GYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVGSNDS------LNN 632 Query: 422 CASPNSFFEANSTHELNKIFR 442 CAS FEA++ ELN+ F+ Sbjct: 633 CASSGQAFEADNAAELNEQFQ 653 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 40.8 bits (94), Expect = 0.47, Method: Compositional matrix adjust. Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 11/68 (16%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + + AK++ Sbjct: 183 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLDPLKAAELAKAE 231 Query: 397 GIRIMTIA 404 G+RI TIA Sbjct: 232 GVRIHTIA 239 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 40.8 bits (94), Expect = 0.47, Method: Compositional matrix adjust. Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 16/98 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------- 408 + V R+K++ K ++LLTDG N + + AK GIR+ TI Sbjct: 180 NSVSRLKDSKAKSKVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGEDIGA 239 Query: 409 ---KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 KT Q+ A L+N F+ A S EL+KI++D Sbjct: 240 IDYKTLQDIA--VLTN----GEFYRAQSQAELSKIYQD 271 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 40.8 bits (94), Expect = 0.49, Method: Compositional matrix adjust. Identities = 39/169 (23%), Positives = 68/169 (40%), Gaps = 34/169 (20%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + L+ I +++ TAI + A Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHA---SLLNLLAGIRADLSVNHLIQDGTAIGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K+ E K ++LLTDG N + + + A+ G+R+ TI + Sbjct: 182 ---------VGRLKDVKEGSKVVILLTDGSNNVGDISPLTAASIARKFGVRVYTIGLGTD 232 Query: 409 --------------KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 KT Q+ A + F+ A S EL++I++D Sbjct: 233 GKDIQGRPVGEIDYKTLQDIA------MQTDGEFYRAQSRAELSQIYKD 275 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 40.8 bits (94), Expect = 0.49, Method: Compositional matrix adjust. Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 16/98 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------- 408 + V R+K++ K ++LLTDG N + + AK GIR+ TI Sbjct: 180 NSVSRLKDSKAKSKVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGEDIGA 239 Query: 409 ---KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 KT Q+ A L+N F+ A S EL+KI++D Sbjct: 240 IDYKTLQDIA--VLTN----GEFYRAQSQAELSKIYQD 271 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 40.4 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 34/111 (30%), Positives = 52/111 (46%), Gaps = 19/111 (17%) Query: 335 GSTAINDAMQTAYDTII---SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T ++DA Q AYD I S ++ H++K + LLTDG+ N + I Sbjct: 89 GMTNMHDAFQAAYDIIFGKYSGHKRPTHQVKTA------VFLLTDGQWNW-NGDPWPIAK 141 Query: 392 KAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + K +GI I TI + VN L + ASPN++F N + ++ Sbjct: 142 RLKDRGIEIFTIGVTNGVNVNT-------LRSLASPNNYFHYNDFTQFREL 185 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 40.4 bits (93), Expect = 0.60, Method: Compositional matrix adjust. Identities = 29/98 (29%), Positives = 41/98 (41%), Gaps = 14/98 (14%) Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ------ 411 ++R+K++ K I+L+TDGEN + AK GI+I TI + Q Sbjct: 173 LNRLKDSEAKSKVIILMTDGENNSGEMSPMGASEIAKELGIKIYTIGIGAREIQIRVPFG 232 Query: 412 ------QEKARYFLSNCASPNS--FFEANSTHELNKIF 441 E L N AS +F A S E +IF Sbjct: 233 HTTVKNTELDENLLKNIASTTGGEYFRAGSEKEFQEIF 270 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 40.0 bits (92), Expect = 0.71, Method: Compositional matrix adjust. Identities = 31/115 (26%), Positives = 45/115 (39%), Gaps = 13/115 (11%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + + TA ++R++ + K I+LLTDG N Q + AK+ Sbjct: 166 GTAIGNGLATA-----------INRLRESDAKSKVIILLTDGVNNQGQIAPMTAAEIAKA 214 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 QGIR+ TI A N N E + ++ K D G F Sbjct: 215 QGIRVYTIGVGTEGMAPYPAIDMFGNLTFVNQKVEID--EKVLKAISDMTGGRYF 267 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 40.0 bits (92), Expect = 0.71, Method: Compositional matrix adjust. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 30/123 (24%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ A V R++ + +AK ++LLTDG + + + + A + Sbjct: 174 TAIGDAIGLA-----------VKRLRKDPQAKNVLILLTDGASNSGSLDPLKAAKLAAQR 222 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP-----------------NSFFEANSTHELNK 439 G+++ TI E +F +P +F A T EL + Sbjct: 223 GLKVYTIGIGAEAV--EVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRARDTEELAQ 280 Query: 440 IFR 442 I++ Sbjct: 281 IYQ 283 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 40.0 bits (92), Expect = 0.72, Method: Compositional matrix adjust. Identities = 39/134 (29%), Positives = 63/134 (47%), Gaps = 19/134 (14%) Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +I+ ++G TAI + + + ++I S +ED + IVLL+DGENT + Sbjct: 153 SIENLQLGPGTAIGEGVFASLESISSFDED-----ADVDPPPSAIVLLSDGENTSGRDIS 207 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCASPNS--FFEANSTH 435 A+ A+ Q + + TIAF E Y L AS F+EA S Sbjct: 208 QAVAMAAE-QEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGGHFYEAESET 266 Query: 436 ELNKIFRDRIGNEI 449 EL++++ D IG+ + Sbjct: 267 ELDEVYED-IGSSL 279 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 40.0 bits (92), Expect = 0.72, Method: Compositional matrix adjust. Identities = 49/193 (25%), Positives = 79/193 (40%), Gaps = 36/193 (18%) Query: 274 VRDALASVIRSIKKIDN----VNDTVRMGATFF--NDRVISDPSFSWGVHKLIRTIVKTF 327 V D L S IR++K++ + + R+G F N V S SF VK F Sbjct: 110 VSDELVSRIRAVKQVGSRFIEQREGDRLGLILFGSNAYVQSPLSFD-------TATVKRF 162 Query: 328 AIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + E ++G TAI DA+ A V R+K + ++LL+DG++T + Sbjct: 163 LL-EAQIGFAGQDTAIGDAIGLA-----------VKRLKERPAENRVLILLSDGKDTASS 210 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--ARYFLSNCASPNSFFEANSTHELNKI- 440 + + A GIRI TI + F + +P++ + ++ KI Sbjct: 211 VQPLNAAKLAADLGIRIYTIGIGADSLTMPGLFGSSFGARQVNPSAELDEAGLQQIAKIT 270 Query: 441 ----FRDRIGNEI 449 FR R E+ Sbjct: 271 DGKYFRARNPEEL 283 >gi|149248918|ref|XP_001528813.1| hypothetical protein LELG_05791 [Lodderomyces elongisporus NRRL YB-4239] gi|146453354|gb|EDK47610.1| hypothetical protein LELG_05791 [Lodderomyces elongisporus NRRL YB-4239] Length = 1326 Score = 40.0 bits (92), Expect = 0.76, Method: Compositional matrix adjust. Identities = 29/110 (26%), Positives = 48/110 (43%), Gaps = 4/110 (3%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENTQDNEEGI 387 D +G + Q + I N D+ ++ ++ K K+ V E+ Q N+ Sbjct: 308 DATPVGQILLGLPFQNEHKLSIGFNNDDKRELRFKVDGKVLKFPVQQVGRESFQVNQYPE 367 Query: 388 AICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 CN+A S Q ++T +F+ +K E+ RYF+S CA F H Sbjct: 368 IQCNEATSANQLEELLTTSFNGSKATAEEMRYFISKCADVTDVFYVKGGH 417 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 40.0 bits (92), Expect = 0.78, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 11/69 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ + V R++ + ++ +VLLTDG NT + AK++ Sbjct: 73 TAIGDAIALS-----------VKRLREQKQGQRVVVLLTDGVNTAGVLNPLKAAELAKAE 121 Query: 397 GIRIMTIAF 405 G+R+ TIAF Sbjct: 122 GVRVHTIAF 130 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 40.0 bits (92), Expect = 0.78, Method: Compositional matrix adjust. Identities = 43/168 (25%), Positives = 67/168 (39%), Gaps = 35/168 (20%) Query: 296 RMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 RMG F+ + + GV + +K ID+ TAI D + TA Sbjct: 132 RMGLVIFSGETFTQVPLTTDHGVMLNMLAEMKNGLIDDG----TAIGDGLATA------- 180 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT--- 410 + R+K++ K ++LLTDG N + + AK GIR+ TI T Sbjct: 181 ----ISRLKDSEAISKVVILLTDGMNNAGSVDPYTAAEIAKLYGIRVYTIGVGSYGTAPY 236 Query: 411 ----------QQEKARY---FLSNCAS--PNSFFEANSTHELNKIFRD 443 QQ K L++ AS +F A S +L++I+ + Sbjct: 237 PVQTPFGTQIQQMKVEIDEKLLASVASMTGGKYFRATSNQKLDEIYEE 284 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 40.0 bits (92), Expect = 0.82, Method: Compositional matrix adjust. Identities = 49/200 (24%), Positives = 77/200 (38%), Gaps = 47/200 (23%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 V+ + +VR L+ I D R+G F D+ ++ R V Sbjct: 106 RVVDRFTMVRHVLSDFIER-------RDGDRLGLILFADQAYLQAPLTFD-----RFAVA 153 Query: 326 TFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 F +DE +G TAI DA+ V R + ++ + +VLLTDGEN Sbjct: 154 RF-LDEAVLGLVGQQTAIGDAIALG-----------VKRFNDLEQSSRVLVLLTDGENNA 201 Query: 382 DNEEGIAICNKAKSQGIRIMTIAF--------------SVNKT----QQEKARYFLSNCA 423 + A+ G+++ TI +VN + Q EK+ LS Sbjct: 202 GRFTPAQAVSLARQSGVKLYTIGIGSAEIRRRGLLGTRTVNPSSDLDQAEKSFIQLSEST 261 Query: 424 SPNSFFEANSTHELNKIFRD 443 +F A ST EL I+++ Sbjct: 262 G-GRYFRARSTEELESIYQE 280 >gi|149236387|ref|XP_001524071.1| hypothetical protein LELG_04884 [Lodderomyces elongisporus NRRL YB-4239] gi|146452447|gb|EDK46703.1| hypothetical protein LELG_04884 [Lodderomyces elongisporus NRRL YB-4239] Length = 1345 Score = 39.7 bits (91), Expect = 1.00, Method: Compositional matrix adjust. Identities = 29/110 (26%), Positives = 48/110 (43%), Gaps = 4/110 (3%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENTQDNEEGI 387 D +G + Q + I N D+ ++ ++ K K+ V E+ Q N+ Sbjct: 327 DATPVGQILLGLPFQNEHKLSIGFNNDDKRELRFKVDGKVLKFPVQQVGRESFQVNQYPE 386 Query: 388 AICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 CN+A S Q ++T +F+ +K E+ RYF+S CA F H Sbjct: 387 IQCNEATSANQLEELLTTSFNGSKATAEEMRYFISKCADVTDVFYVKGGH 436 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 39.7 bits (91), Expect = 1.0, Method: Compositional matrix adjust. Identities = 33/145 (22%), Positives = 61/145 (42%), Gaps = 36/145 (24%) Query: 321 RTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 RT VK + E+++G +TAI DA+ + + R+++ + ++LLTD Sbjct: 156 RTTVKQLLV-ESQIGFAGPNTAIGDAIGLS-----------IKRLRDRPAENRVVILLTD 203 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------------KTQQEKARY 417 G+NT + AK G+++ TI N + E Sbjct: 204 GQNTAGEVSPRQAADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLT 263 Query: 418 FLSNCASPNSFFEANSTHELNKIFR 442 +++ +F A+S ELN+I++ Sbjct: 264 YIAETTG-GRYFRAHSPQELNQIYQ 287 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 39.7 bits (91), Expect = 1.0, Method: Compositional matrix adjust. Identities = 40/138 (28%), Positives = 56/138 (40%), Gaps = 22/138 (15%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H+++ + +T A + TAI A+ TA +R+K + +K +VL+TD Sbjct: 159 HEVLGRLAETVAPGFFDEPGTAIGTAILTA-----------TNRLKASSSKEKALVLITD 207 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTI-------AF--SVNKTQQEKARYFLSNCA--SP 425 GEN A + GIRI T+ AF + N K R L A S Sbjct: 208 GENNAGEVTPETAARLAANYGIRIYTVFAGKEARAFENTSNTALNRKGRSELETVARISG 267 Query: 426 NSFFEANSTHELNKIFRD 443 F A L K FRD Sbjct: 268 GRMFSAGDVFGLMKSFRD 285 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust. Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 16/91 (17%) Query: 363 NNLEAKKYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + KKY++++TDG NTQ+ N C AK+QGI + + ++ Sbjct: 423 GDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRV------ED 476 Query: 414 KARYFLSNCASPNS-FFEANSTHELNKIFRD 443 L CAS +S +++ + +L K +D Sbjct: 477 GDSSLLEACASQSSYYYDLSQASDLTKTMQD 507 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust. Identities = 30/101 (29%), Positives = 48/101 (47%), Gaps = 15/101 (14%) Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEE-GIAICNKAKSQGIRI 400 Q+A+ + N+D V R +VL+TDG +N+ +E +C +AK GI + Sbjct: 288 QSAWRVGPNRNQDNVQRK---------LVLMTDGMDDNSHLDELINAGLCTRAKDLGIEL 338 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I F V + E+ F S + F AN+T +L+ F Sbjct: 339 NFIGFGVQSWRLEQ---FTRCAGSAGAVFSANNTQDLDDYF 376 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust. Identities = 33/121 (27%), Positives = 49/121 (40%), Gaps = 26/121 (21%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI DA+ A V R E+ ++LLTDG+NT N A S+ Sbjct: 172 TAIGDAIGLA-----------VKRFDEREESNNVLILLTDGQNTAGNITPEQAKELAISK 220 Query: 397 GIRIMTIAFSVNK-------------TQQEKARYFLSNCASPNS--FFEANSTHELNKIF 441 G+++ TI +K QE L+N A+ +F A + EL I+ Sbjct: 221 GVKVYTIGVGADKMLIQSFFGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQELQAIY 280 Query: 442 R 442 + Sbjct: 281 Q 281 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust. Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 21/111 (18%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENTQDNEEGIAICNK 392 G T I A+ AYD + + NN + K K+++LLTDG+ E I Sbjct: 98 GGTNIGRAVSIAYDLF--------NNLDNNRKEKYPKFLILLTDGDGDYSEEYTIL---- 145 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 AK GI+I TI +++ L + A + +F A +LNKIF Sbjct: 146 AKKAGIKIYTIGLGNGVSEK-----LLKDIAKGTDGEYFHAKDASKLNKIF 191 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust. Identities = 39/141 (27%), Positives = 60/141 (42%), Gaps = 25/141 (17%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 +++D + SWG + T FA N G+T N + TAY ++ + + + Sbjct: 436 LMTDGTNSWGTNSSAPTGSLYFAAGYFRNANGTTP-NPRLTTAYQNTNIADGNTARKALD 494 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA-RYFLSNC 422 L A+ C K+ I I TI FSV + A + L NC Sbjct: 495 ALTAEA--------------------CANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNC 534 Query: 423 AS-PNSFFEANSTHELNKIFR 442 AS P+ F+ ANS+ +L K F+ Sbjct: 535 ASSPDQFYLANSSDDLIKAFK 555 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 16/83 (19%) Query: 369 KYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 K IV LTDG+NT++ N ICN K+ GI++ ++ E + Sbjct: 346 KVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRV------IEGNATLI 399 Query: 420 SNCAS-PNSFFEANSTHELNKIF 441 NCA+ P ++ + EL +F Sbjct: 400 RNCATEPGMYYSVTTASELTSVF 422 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 46/183 (25%), Positives = 73/183 (39%), Gaps = 22/183 (12%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT--IV 324 VI + V +AS S + DN+ T+ G F + +D + + +RT +V Sbjct: 105 VIPNRLEVAKEVASDFISGRPNDNIGLTIFAGEAFTQCPLTTDHASLINLLNSVRTDLVV 164 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K D +G IN V R+K++ K ++LLTDG N + Sbjct: 165 KGLIQDGTAIGMGLIN----------------AVGRLKSSKAKSKVVILLTDGSNNVGSI 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + AK IR+ TI N + L A + F+ A S EL++I Sbjct: 209 SPMTAAEIAKKFNIRVYTIGLGTEQNNGYSDIDYTTLRQIANVTNGKFYSAQSQTELSQI 268 Query: 441 FRD 443 ++D Sbjct: 269 YKD 271 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 33/134 (24%), Positives = 54/134 (40%), Gaps = 30/134 (22%) Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 I E ++G TAI DA+ + ++ KK ++L+TDG NT Sbjct: 165 IQETQIGFAGEKTAIGDAIGLG-----------IKQLSELPSDKKVLILMTDGANTAGRV 213 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCAS--PNSFF 429 + N A QG+ I TI ++ + ++ L N AS ++ Sbjct: 214 SPLQAANFAAEQGVTIHTIGIGADEMEVQGFFGPQTVNPSEDLDEALLENVASLTGGKYY 273 Query: 430 EANSTHELNKIFRD 443 A ST +L +I+ D Sbjct: 274 RAKSTSDLEEIYGD 287 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 39.3 bits (90), Expect = 1.4, Method: Compositional matrix adjust. Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 20/135 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 RD +AS + + + DN+ V G +F + D H +I+ +++T I + E Sbjct: 112 ARD-VASEMIAARPNDNIGLVVFAGESFTLCPLTVD-------HNVIQQMLETTEIGQLE 163 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TAI + TA +T+ S+ K I+LLTDG N + A Sbjct: 164 DG-TAIGLGLATAINTLRGSDNK-----------SKVIILLTDGSNNAGDITPSMAAELA 211 Query: 394 KSQGIRIMTIAFSVN 408 + GIRI T+A N Sbjct: 212 QQYGIRIYTVAAGTN 226 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 39.3 bits (90), Expect = 1.4, Method: Compositional matrix adjust. Identities = 42/152 (27%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DNV T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNVGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TA+ + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLIKDVKCGLIEDG-TAVGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GTNNRGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust. Identities = 34/117 (29%), Positives = 51/117 (43%), Gaps = 11/117 (9%) Query: 336 STAINDAMQTAYDTI-----ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 TAI DA+ + + SS+ + + E K I+LL+DG+NT N + I Sbjct: 192 GTAIGDALARCINRMRDRPAASSDTTQAKTEQWKSERSKVIILLSDGDNTAGNLDPITAA 251 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARY----FLSNCAS--PNSFFEANSTHELNKIF 441 + AK+ I+I TIA +A L A+ SFF A + L +F Sbjct: 252 SLAKAFNIKIYTIAVGQPVASASEASTVDEGILKKIATIGKGSFFRAVDSGRLKTVF 308 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust. Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 KK R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 KKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPDG--SVSLIILLTDGDPT 389 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust. Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 30/123 (24%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI D + A T + S + + ++LL+DG NT + I N AK Sbjct: 170 TAIGDGLALATKTFVDSEAPQ-----------RVVILLSDGSNTAGTLDPIEAANIAKKY 218 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSN-----------------CASPNSFFEANSTHELNK 439 G++I TI + + E ++F++ + +F A EL Sbjct: 219 GVKIYTIG--IGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQA 276 Query: 440 IFR 442 I++ Sbjct: 277 IYQ 279 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust. Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 21/113 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 R+G F D + + ++T+ K + E ++G +TAI DA+ + Sbjct: 132 RLGLILFGDTAFLQTPLT----RDVKTVSKMLS--EAQIGLVGRATAIGDALGLS----- 180 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 V R N E+ + +VLLTDG+NT N A+ +GI++ TI Sbjct: 181 ------VKRFANKDESNRIVVLLTDGQNTAGNLNPEDALLLAREEGIKVYTIG 227 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust. Identities = 52/209 (24%), Positives = 88/209 (42%), Gaps = 21/209 (10%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S++E+ F +S I + + ++ L+ +++ +K D R+G F D Sbjct: 114 LSGSMAEQDF--TSKAGENISRLNAAKEVLSDFVKT-RKGD------RLGLILFGDAAFV 164 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ + + +T + ST + DAM A I E R+ + Sbjct: 165 QTPFTPDQKVWLELLNQTDVAMAGQ--STHLGDAMGLA----IKVFEQSKSRIGVEENKE 218 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS----NCAS 424 K ++LTDG +T E I AK++G+RI IA +T E A + S Sbjct: 219 KVAIVLTDGNDTGSFVEPIEAAKVAKAKGVRIHVIAMGDPQTLGEAALDMKTIRRIAKES 278 Query: 425 PNSFFEANSTHELNKIFRD--RIGNEIFE 451 FEA + EL K + D R+ +++E Sbjct: 279 GGKAFEAMNRDELAKAYDDIGRLEPQLYE 307 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust. Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 30/123 (24%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI D + A T + S + + ++LL+DG NT + I N AK Sbjct: 170 TAIGDGLALATKTFVDSEAPQ-----------RVVILLSDGSNTAGTLDPIEAANIAKKY 218 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSN-----------------CASPNSFFEANSTHELNK 439 G++I TI + + E ++F++ + +F A EL Sbjct: 219 GVKIYTIG--IGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQT 276 Query: 440 IFR 442 I++ Sbjct: 277 IYQ 279 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust. Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 27/116 (23%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 R+G F D + + + T+ K ++E ++G +TAI DA+ A Sbjct: 132 RLGLILFGDTAFLQTPLT----RDLNTVSKM--LEEAQIGLVGRATAIGDALGLA----- 180 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIA 404 V R ++ + +VLLTDGENT N EE + + A+ +GI++ T+ Sbjct: 181 ------VKRFSQKQDSNRILVLLTDGENTAGNLAPEEALLL---AREEGIKVYTVG 227 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 21/118 (17%) Query: 328 AIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 AID + G T I + ++ A +IS + D+ R+K I+LLTDGE DN Sbjct: 100 AIDRIDSWGGTNIAEGIRIANHQLISQSSDD--RIK-------VIILLTDGEGYYDNN-- 148 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFR 442 + +AK+ GI I TI + + L N A+ +F +S +L ++F+ Sbjct: 149 --LTTEAKNNGITIYTIGLGTSVDEN-----LLRNIATQTGGMYFPVSSASQLPQVFK 199 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust. Identities = 32/111 (28%), Positives = 47/111 (42%), Gaps = 14/111 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G F R I + H + D +G T + A++TA + E Sbjct: 129 RAGLVAFAGRAILQCPLTLD-HSAFNLFLNALEPDYLPVGGTDLGGAIETALNGF----E 183 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 EV E++K I+L+TDGENT + I + KA QG++I I Sbjct: 184 KEV-------ESEKAIILITDGENTTGDS--IEMAKKAADQGVKIFCIGVG 225 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust. Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 20/135 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 RD +AS + + + DN+ V G +F + D H +I+ +++T I + E Sbjct: 112 ARD-VASEMIAARPNDNIGLVVFAGESFTLCPLTVD-------HNVIQQMLETTEIGQLE 163 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TAI + TA +T+ S+ K I+LLTDG N + A Sbjct: 164 DG-TAIGLGLATAINTLRGSDNK-----------SKVIILLTDGSNNAGDITPSMAAELA 211 Query: 394 KSQGIRIMTIAFSVN 408 + GIRI T+A N Sbjct: 212 QQYGIRIYTVAAGTN 226 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust. Identities = 43/152 (28%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TAI + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLLKDMKCGLIEDG-TAIGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIRI TI N Sbjct: 196 GVNNKGDISPLTAAEIAKSFGIRIYTIGVGTN 227 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust. Identities = 41/152 (26%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ + + D E G TA+ + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLFQGIQCDIIEDG-TAVGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust. Identities = 21/55 (38%), Positives = 29/55 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + V R+K++ K I+LLTDG N + + + AK GIRI TI N T Sbjct: 180 NAVTRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEIAKQFGIRIYTIGVGTNGT 234 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust. Identities = 21/55 (38%), Positives = 29/55 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + V R+K++ K I+LLTDG N + + + AK GIRI TI N T Sbjct: 180 NAVTRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEIAKQFGIRIYTIGVGTNGT 234 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust. Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 30/123 (24%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI D + A T + S + + ++LL+DG NT + I N AK Sbjct: 170 TAIGDGLALATKTFVDSEAPQ-----------RVVILLSDGSNTAGTLDPIEAANIAKKY 218 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSN-----------------CASPNSFFEANSTHELNK 439 G++I TI + + E ++F++ + +F A EL Sbjct: 219 GVKIYTIG--IGAGEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFRARDAQELQT 276 Query: 440 IFR 442 I++ Sbjct: 277 IYQ 279 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust. Identities = 42/152 (27%), Positives = 62/152 (40%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DNV T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNVGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TA+ + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLIKDVKCGLIEDG-TAVGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + AKS GIR+ TI N Sbjct: 196 GTNNRGEISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 16/83 (19%) Query: 369 KYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 KY++++TDGENTQ+ N + C AK GI + TI E L Sbjct: 480 KYMIVITDGENTQNRWSTSASTINARTLLACQAAKDLGITVYTIRV------MEGNSDML 533 Query: 420 SNCAS-PNSFFEANSTHELNKIF 441 +CAS P F++ ++ +L Sbjct: 534 KSCASRPEYFYDVTASSQLTSTL 556 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 11/88 (12%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H+L+ +V+ ++D TAI A+ ++ +N R++ ++ ++ ++LLTD Sbjct: 156 HRLLGMLVRQVSVDAISDKGTAIGSAI------LVGTN-----RLRASVSKERVLLLLTD 204 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIA 404 GE+ + A+S+GIRI I Sbjct: 205 GEHNSGEVGPVTASEIAQSEGIRIYVIG 232 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust. Identities = 19/54 (35%), Positives = 29/54 (53%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + V R+K++ K ++LLTDG N + + A+S GIR+ TI NK Sbjct: 180 NAVSRLKDSKAKSKVVILLTDGSNNMGDISPMTSAQIARSFGIRVYTIGIGTNK 233 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust. Identities = 21/55 (38%), Positives = 29/55 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + V R+K++ K I+LLTDG N + + + AK GIRI TI N T Sbjct: 180 NAVTRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEIAKQFGIRIYTIGVGTNGT 234 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 31/120 (25%), Positives = 51/120 (42%), Gaps = 14/120 (11%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G F + +D + + + +RT + + + + TA+ + A Sbjct: 118 DNIGLTIFAGEAFTQCPMTTDHASLLNLLQGVRTDIASRGLIAD---GTAVGMGLANA-- 172 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K + K I+LLTDG N + + AKS GIR+ TI N Sbjct: 173 ---------VSRLKESKAKSKVIILLTDGSNNMGDISPLTAAQIAKSLGIRVYTIGVGTN 223 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 41/152 (26%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ + + D E G TA+ + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLFQGIKCDIIEDG-TAVGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 19/54 (35%), Positives = 29/54 (53%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + V R+K++ K ++LLTDG N + + A+S GIR+ TI NK Sbjct: 180 NAVSRLKDSKAKSKVVILLTDGSNNMGDISPMTSAQIARSFGIRVYTIGIGTNK 233 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 42/152 (27%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TAI + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLLKDMKCGLIEDG-TAIGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GVNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 41/152 (26%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TA+ + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLIKDVKCGLIEDG-TAVGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 72/173 (41%), Gaps = 32/173 (18%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ T+ G +F + +D + + + +RT + + + TA+ + A Sbjct: 127 DNIGLTIFAGESFTQCPMTTDHASLLNLLRNVRTDIAARGLISD---GTAVGMGLANA-- 181 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V R+K++ K ++L+TDG N + + A+S GIR+ TI N Sbjct: 182 ---------VSRLKDSKAKSKVVILITDGSNNMGDISPMTSAQIAQSLGIRVYTIGVGTN 232 Query: 409 K----------TQQ------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 K T Q E LS+ A + +F+ A + EL +I+ D Sbjct: 233 KVAPYPMNVGGTTQYVNIPVEIDSKTLSDIAAVTEGNFYRATNNKELKQIYND 285 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust. Identities = 45/165 (27%), Positives = 74/165 (44%), Gaps = 23/165 (13%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 + PV+LG GM V V + L++AA +A + + L + S Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTS------------ 48 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRKSAYQVVLSSRYDLLLNPL 144 + E Y RNF N+ V DI T+V + + + ++YQV +S YDL +NPL Sbjct: 49 EAEAY-ARNFVAGQMANYLQSGV-DIKGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPL 106 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++G K+ + T TV + +S+ +D S SM Sbjct: 107 ---MQAVGFKTQHLST--SGTTVGGHSQTQGSISMFLALDKSGSM 146 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust. Identities = 42/152 (27%), Positives = 63/152 (41%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V +S L +K L +D A I + DN+ T+ G +F + D Sbjct: 96 VSTSMLAEDLKPNRLEAAKDVAAEFING-RPNDNIGITLFAGESFTQCPLTVD------- 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ ++K E G TAI + A V R+K++ K I+LLTD Sbjct: 148 HAVLLNLLKDMKCGLIEDG-TAIGMGIANA-----------VTRLKDSKAKSKVIILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 G N + + + AKS GIR+ TI N Sbjct: 196 GVNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 39/90 (43%), Gaps = 16/90 (17%) Query: 368 KKYIVLLTDGENTQD--------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + YIVLL+DG NTQ+ + +C K K G+ I TI VN ++ Sbjct: 443 QDYIVLLSDGLNTQNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTI--QVNVGSKD 500 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRD 443 L NCAS +F S E F++ Sbjct: 501 PLSQVLQNCASDGNFQMITSATETADAFQN 530 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 38.5 bits (88), Expect = 2.5, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 21/118 (17%) Query: 328 AIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 AID + G T I + ++ A +IS + D+ R+K I+LLTDGE DN Sbjct: 595 AIDRIDSWGGTNIAEGIRIANHQLISQSSDD--RIK-------VIILLTDGEGYYDNN-- 643 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFR 442 + +AK+ GI I TI + + L N A+ +F +S +L ++F+ Sbjct: 644 --LTTEAKNNGITIYTIGLGTSVDEN-----LLRNIATQTGGMYFPVSSASQLPQVFK 694 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 38.5 bits (88), Expect = 2.5, Method: Compositional matrix adjust. Identities = 12/45 (26%), Positives = 29/45 (64%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 +L++ TG+ ++ A MP+++G G+ D V+W+ ++ +++ A Sbjct: 8 RLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQA 52 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 38.5 bits (88), Expect = 2.6, Method: Compositional matrix adjust. Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 5/69 (7%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI DA+ A + + + + R K +++K I+LLTDGEN + E I A++ Sbjct: 176 GTAIGDAISLAVEKLNALDA----RRKEKIQSK-IIILLTDGENNAGDLEPIQAAELAQT 230 Query: 396 QGIRIMTIA 404 GI++ TI Sbjct: 231 MGIKVYTIG 239 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust. Identities = 48/172 (27%), Positives = 76/172 (44%), Gaps = 15/172 (8%) Query: 249 LDPSLSEEHFV--DSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFND 304 L S++E+ FV SS L V K L ++ LA +++ +K D R+G F D Sbjct: 106 LSGSMAEQDFVSKQSSDLGAVKKISRLEATKEVLADFVKT-RKGD------RLGLILFGD 158 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKN 363 F+ + + +T D G ST + DA+ A S+ED+ +N Sbjct: 159 AAFVQTPFTADQSVWLELLNQT---DVAMAGQSTHLGDAIGLAIKVFEQSSEDKASAEEN 215 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +K ++LTDG +T E I A ++ +RI IA +T E+A Sbjct: 216 AKPREKVAIVLTDGNDTGSYVEPIDAAKVAAAKDVRIHMIAMGDPRTVGEQA 267 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 28/53 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + AKS GIR+ TI N Sbjct: 180 NAVSRLKDSKAKSKVIILLTDGTNNAGDISPLTAAEIAKSFGIRVYTIGVGTN 232 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust. Identities = 19/55 (34%), Positives = 29/55 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + R+K++ K I+LLTDG N + + + AK GIR+ TI N T Sbjct: 180 NSISRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEIAKQFGIRVYTIGVGTNGT 234 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust. Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 N+ A C AKS GIR+ TI F VN + R + +CA+ P +F++ S L F Sbjct: 430 NDRTEAACEYAKSLGIRVYTITFQVNSS---STRDMMRDCATHPTLYFDSPSDDALRSAF 486 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 24/201 (11%) Query: 258 FVDSSSLRHVI---KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-S 313 VDSS +I K+K V D+L+ + + + ++ +++ A F+ V DPSF S Sbjct: 48 IVDSSESSKIILFDKQKDFV-DSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSS 106 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 W + + VK+ +N Q + SN + + + + K +L Sbjct: 107 WKDLQTFKQRVKS------------MNFIGQGTFSYYAISNATGLLKREGRKDGVKVALL 154 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +TDG + N + +I A++ GI +TI S + K R + +S ++ Sbjct: 155 MTDGIDHPKNPDVQSISEDARTAGILFITIGLS-TVVNEAKLRLISGDSSSEPIPLLSDP 213 Query: 434 THELNKIFRDRIGNEIFERVI 454 T D+I + +F+RV+ Sbjct: 214 T------LVDKIRDRLFKRVL 228 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 38.1 bits (87), Expect = 2.9, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTN 227 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 28/53 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + AKS GIR+ TI N Sbjct: 180 NAVSRLKDSKAKSKVIILLTDGTNNAGDISPLTAAEIAKSFGIRVYTIGVGTN 232 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust. Identities = 26/106 (24%), Positives = 46/106 (43%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + R+K++ K I+LLTDG N + + + AK GIR+ TI N T Sbjct: 180 NAISRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPM 239 Query: 416 RYF----------------LSNCASP--NSFFEANSTHELNKIFRD 443 + + L+ A ++F A S +L +++R+ Sbjct: 240 QTYAGVQYVNVPVEIDEQTLTQIAGTTNGNYFRATSNSKLEEVYRE 285 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 38.1 bits (87), Expect = 3.1, Method: Compositional matrix adjust. Identities = 39/144 (27%), Positives = 57/144 (39%), Gaps = 43/144 (29%) Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-------KYIVLLTDG 377 + E ++G TAI DA+ + V R+KN A K I+LLTDG Sbjct: 166 LQEAQLGFAGKDTAIGDAIGLS-----------VKRLKNQSSASSAKPSNSKVIILLTDG 214 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN--------------CA 423 ENT E + A+ G +I T+ ++ R F N A Sbjct: 215 ENTAGEVEPLQAAKLAEKIGAKIYTVGIGADEM---IVRGFFGNRRVNPSASLDEETLTA 271 Query: 424 SPNS----FFEANSTHELNKIFRD 443 N+ +F A +T ELN I+ + Sbjct: 272 IANTTGGLYFRARNTQELNNIYSE 295 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 38.1 bits (87), Expect = 3.3, Method: Compositional matrix adjust. Identities = 40/144 (27%), Positives = 66/144 (45%), Gaps = 15/144 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPTVGETNPR 396 Query: 388 AICN---KAKSQGIRIMTIAFSVN 408 +I N +A S G + + F + Sbjct: 397 SIQNNVREAVSGGYSLFCLGFGFD 420 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 37.7 bits (86), Expect = 3.4, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTN 227 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 37.7 bits (86), Expect = 3.6, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTN 227 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust. Identities = 23/67 (34%), Positives = 32/67 (47%) Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +N+ M I + V+R+K+ K I+LLTDG N + I AK+ G Sbjct: 157 GVNNGMIEDGTAIGLGLANAVNRIKDGKSKSKVIILLTDGSNNSGDIAPITAAEIAKTFG 216 Query: 398 IRIMTIA 404 IRI TI Sbjct: 217 IRIYTIG 223 >gi|118357659|ref|XP_001012078.1| hypothetical protein TTHERM_00986260 [Tetrahymena thermophila] gi|89293845|gb|EAR91833.1| hypothetical protein TTHERM_00986260 [Tetrahymena thermophila SB210] Length = 630 Score = 37.7 bits (86), Expect = 3.8, Method: Compositional matrix adjust. Identities = 51/194 (26%), Positives = 77/194 (39%), Gaps = 46/194 (23%) Query: 87 KQKIEEYLIRNFENNLKKNFT---DREVRDIVRDTAVEMNPRKSAY-----QVVLSSRY- 137 KQ+IE Y I KN T ++++ I ++N Y Q+VL S++ Sbjct: 423 KQEIENYQIAILNPANVKNLTIILGQQLQYIKSINLFQLNMTAGNYYETDFQLVLFSQFF 482 Query: 138 ------DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 DLLLN R I L YHK+ +S ++ S S++D Sbjct: 483 NEYLTLDLLLN-----FRKCKIGEIL-------------YHKQGYISCDQCVEGSYSLVD 524 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQN-------GKVGIRDEKLSPYMVSCNKSLYYML 244 + SE + C P + K YS+Q + + D + YM+ CN+S Y + Sbjct: 525 PYQKSENGLVQCQKCPIEYAKKCYSNQIILNQHFWRESNLTDNIFTCYMLGCNESAYNSI 584 Query: 245 ------YPGPLDPS 252 Y GPL S Sbjct: 585 NGCIKGYIGPLCNS 598 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 18 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 70 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 71 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 119 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 37.7 bits (86), Expect = 4.2, Method: Compositional matrix adjust. Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 14/75 (18%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE----GIAIC 390 G TA DA++ A + +++ HR +A+ IVLLTDG+ T+ N + Sbjct: 141 GGTATGDAIKKARELLVN------HRP----DAQPVIVLLTDGDATEPNGNAYNYALTNS 190 Query: 391 NKAKSQGIRIMTIAF 405 N+AK +GI TIA Sbjct: 191 NEAKQEGIVFYTIAL 205 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 37.7 bits (86), Expect = 4.2, Method: Compositional matrix adjust. Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 18/84 (21%) Query: 366 EAKKYIVLLTDGENTQ----------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +A+K+++++TDG+NT+ D +A C AK++GI + T+ K Q + Sbjct: 319 KARKFMIVVTDGDNTKSFTSWSASVIDKRTALA-CENAKAKGITVYTV-----KIIQGNS 372 Query: 416 RYFLSNCAS-PNSFFEANSTHELN 438 L CAS P F++ S ++LN Sbjct: 373 N-MLRKCASAPEYFYDLTSANQLN 395 >gi|310796149|gb|EFQ31610.1| fungal specific transcription factor domain-containing protein [Glomerella graminicola M1.001] Length = 848 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 44/171 (25%), Positives = 81/171 (47%), Gaps = 21/171 (12%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID--------- 289 S+ M Y LDP L+ HF D S + H++ K+ +++ A SV IK Sbjct: 640 SMVIMRYLDELDPLLA--HFQDVSDMYHMVLKRDILQAAF-SVCFYIKAATESHVTGNTL 696 Query: 290 NVNDTVRMGATFFNDRV-ISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAIND--AMQT 345 + TV M ++ V S +F+ W +LI+T+ KT + +G + D A+ Sbjct: 697 GIASTVAMSPDPQDEGVAFSTETFALWSPSRLIKTVEKTLDLLVRNVGGNDLKDIVALAV 756 Query: 346 AYDTIISSNEDE-VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ++ ++N+++ +H ++ L++ VL +T N E +++ N A S Sbjct: 757 VLSSVQTANQEQKLHSIRLRLQS----VLDACKASTNTNPEKMSLQNPADS 803 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 26/106 (24%), Positives = 46/106 (43%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + V R+K++ K ++LLTDG N + + + AK GIR+ TI N T Sbjct: 180 NAVSRLKDSKAKSKVVILLTDGVNNRGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPM 239 Query: 416 RYF----------------LSNCASP--NSFFEANSTHELNKIFRD 443 + + +S A ++F A S +L +++R+ Sbjct: 240 QTYAGVQYVQMPVEIDEQTMSQIAGTTNGNYFRATSNTKLKEVYRE 285 >gi|260773718|ref|ZP_05882633.1| formate efflux transporter [Vibrio metschnikovii CIP 69.14] gi|260610679|gb|EEX35883.1| formate efflux transporter [Vibrio metschnikovii CIP 69.14] Length = 484 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 25/57 (43%), Positives = 35/57 (61%), Gaps = 4/57 (7%) Query: 25 ITALLMPVMLG--VGG-MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 ITA L+PV LG VGG +LV + WS Y +AA + IT + PL QS++E + + Sbjct: 253 ITANLIPVTLGNIVGGAVLVGLANWSIYRRPHLKAAHVSTITKTTPL-QSVKETTMK 308 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 29/53 (54%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGVNNKGDISPLTAAEIAKSFGIRVYTIGVGTN 227 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 41/157 (26%), Positives = 66/157 (42%), Gaps = 20/157 (12%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + V GA + + SD + + +LI T + D E G TA+ DA+ Sbjct: 132 DRIGIVVFAGAAYLQSPLSSD---RFALDELI---AGTSSEDIEEQG-TAVGDAL----- 179 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++SS +R+KN+ K I+LLTDG + + K+ GI++ I Sbjct: 180 -VLSS-----YRLKNSEAKSKVIILLTDGVSNTGKLDPDTAAYTTKTMGIKVYCIGIGKE 233 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRD 443 + Q E L +S + FF A S L + + Sbjct: 234 EGQYEINYESLQKISSNTNGKFFRAESPEVLESVLNE 270 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 37.7 bits (86), Expect = 4.4, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 41/70 (58%), Gaps = 11/70 (15%) Query: 368 KKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +K ++L TDG + D + + +A+C + K QGI I++I F+ N++Q + +CA Sbjct: 384 RKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIEIISIDFN-NRSQ------VMKSCA 436 Query: 424 SPNSFFEANS 433 S ++ A++ Sbjct: 437 SAGQYYIADN 446 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 37.7 bits (86), Expect = 4.4, Method: Compositional matrix adjust. Identities = 20/55 (36%), Positives = 29/55 (52%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + V R+K++ K I+LLTDG N + + + AK GIR+ TI N T Sbjct: 180 NAVSRLKDSKAKSKVIILLTDGVNNRGDISPLTAAEIAKQFGIRVYTIGVGTNGT 234 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust. Identities = 26/109 (23%), Positives = 46/109 (42%), Gaps = 12/109 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + + ++L+ + E TAI A+ TA Sbjct: 139 RIGLVVFSGKSFTQCPLTLD-YRLLTNFISELKAGTIEEDGTAIGTAIATA--------- 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 +R++ + K I+LLTDG+N E + A + GI+I T+ Sbjct: 189 --TNRLRESTAKSKVIILLTDGQNNAGEIEPVTAAELAAALGIKIYTVG 235 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust. Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 5/69 (7%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI DA+ A + + + + ++K+ K I+LLTDGEN E I A++ Sbjct: 176 GTAIGDAISLAVEKLNALDARRDEKVKS-----KVIILLTDGENNAGEVEPIQAAELAET 230 Query: 396 QGIRIMTIA 404 GI++ TI Sbjct: 231 LGIKVYTIG 239 >gi|242075518|ref|XP_002447695.1| hypothetical protein SORBIDRAFT_06g013460 [Sorghum bicolor] gi|241938878|gb|EES12023.1| hypothetical protein SORBIDRAFT_06g013460 [Sorghum bicolor] Length = 740 Score = 37.4 bits (85), Expect = 4.8, Method: Compositional matrix adjust. Identities = 31/131 (23%), Positives = 62/131 (47%), Gaps = 3/131 (2%) Query: 59 TAIITASVPLIQSL--EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 T IT P Q L EE + +N+ +F + + ++N+ +L K +T ++ R Sbjct: 83 TYAITDVSPTGQPLAPEEALPKYRNAISFCVRDFLDITVKNWAEDLVKAYTMKQCAVSFR 142 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEH 175 + EMN + + + L+S+Y++ + FL ++L + +A ++ R+ YH Sbjct: 143 NWRSEMNVKYAKTGMDLTSKYNITKGQWAHFLEQRNDPNFLARCEANSQLAKRNKYHHHL 202 Query: 176 GVSIQWVIDFS 186 G +Q F+ Sbjct: 203 GTGMQQKGKFT 213 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 37.4 bits (85), Expect = 4.9, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 37.4 bits (85), Expect = 5.0, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 276 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 328 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 329 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 377 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 37.4 bits (85), Expect = 5.1, Method: Compositional matrix adjust. Identities = 28/106 (26%), Positives = 46/106 (43%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K ++LLTDG N + + AKS IR+ TI NK Sbjct: 180 NAVSRLKDSKTKSKVVILLTDGSNNMGDISPMTSAQIAKSLDIRVYTIGVGTNKVAPYPM 239 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E LS+ A + +F+ A + +L +I++D Sbjct: 240 SVGGGTQYINIPVEIDSKTLSDIAAVTEGNFYRATNNQQLKQIYKD 285 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 37.4 bits (85), Expect = 5.1, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 273 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 325 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 326 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 374 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 37.4 bits (85), Expect = 5.3, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 37.4 bits (85), Expect = 5.5, Method: Compositional matrix adjust. Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 10/128 (7%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 GV+ R ++F N +G T + D S N VH++ K + LL Sbjct: 104 GVYGFQRFDKQSFLNSLNGIGPTGWTPIAKALEDAKASFNG--VHKL-----GSKSVYLL 156 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 TDGE T + I + + Q I++ I F N+ + A ++EA+S Sbjct: 157 TDGEETCGGDP-IKTAKELRKQHIKVNVIGFDFNEGFNGQLHAIAG--AGGGKYYEAHSQ 213 Query: 435 HELNKIFR 442 ++N+IF+ Sbjct: 214 KDMNRIFK 221 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 37.4 bits (85), Expect = 5.6, Method: Compositional matrix adjust. Identities = 46/173 (26%), Positives = 78/173 (45%), Gaps = 17/173 (9%) Query: 249 LDPSLSEEHFV-----DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L S++E+ FV DS +++ I + ++ LA +++ +K D R+G F Sbjct: 106 LSGSMAEQDFVSKQSSDSGTVKK-ISRLEATKEVLADFVKT-RKGD------RLGLILFG 157 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMK 362 D F+ + + +T D G ST + DA+ A S+ED+ + Sbjct: 158 DAAFVQTPFTADQSVWLELLNQT---DVAMAGQSTHLGDAIGLAIKVFEQSSEDKASAEE 214 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 N +K ++LTDG +T E I A ++G+RI IA +T E+A Sbjct: 215 NAKPREKVAIVLTDGNDTGSYVEPIDAAKVAAAKGVRIHMIAMGDPRTVGEQA 267 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 37.4 bits (85), Expect = 5.8, Method: Compositional matrix adjust. Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 15/101 (14%) Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKSQGI 398 I+S+N ++ KN KK +++L+DG +T D + +GI I C + K I Sbjct: 336 ILSANNIFKNKAKNG--HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDI 393 Query: 399 RI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ + IA+S + + E C ++++EA++ HEL Sbjct: 394 QMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHEL 434 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 37.4 bits (85), Expect = 5.8, Method: Compositional matrix adjust. Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 15/101 (14%) Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKSQGI 398 I+S+N ++ KN KK +++L+DG +T D + +GI I C + K I Sbjct: 336 ILSANNIFKNKAKNG--HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDI 393 Query: 399 RI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ + IA+S + + E C ++++EA++ HEL Sbjct: 394 QMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHEL 434 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 37.0 bits (84), Expect = 6.0, Method: Compositional matrix adjust. Identities = 29/128 (22%), Positives = 54/128 (42%), Gaps = 25/128 (19%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI DA+ + + + + + + ++++ K ++LLTDGENT + I A++ Sbjct: 165 GTAIGDAIALSVEKLNALDARQERKVQS-----KILILLTDGENTAGELDPIQAAELAET 219 Query: 396 QGIRIMTIAFSVNK----------TQQEKARYFLSNC----------ASPNSFFEANSTH 435 GI+I I T +++ Y N + +F A T Sbjct: 220 LGIKIYAIGVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTD 279 Query: 436 ELNKIFRD 443 L+ I+R+ Sbjct: 280 SLDAIYRE 287 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 27/51 (52%) Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 V+R+K++ K I+LLTDG N N + AK GI++ TI N Sbjct: 183 VNRLKDSKAKSKVIILLTDGVNNAGNIDPTTATELAKELGIKVYTIGIGTN 233 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 37.0 bits (84), Expect = 6.4, Method: Compositional matrix adjust. Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 15/101 (14%) Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKSQGI 398 I+S+N ++ KN KK +++L+DG +T D + +GI I C + K I Sbjct: 336 ILSANNLFKNKAKNG--HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDI 393 Query: 399 RI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ + IA+S + + E C ++++EA++ HEL Sbjct: 394 QMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHEL 434 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 37.0 bits (84), Expect = 6.7, Method: Compositional matrix adjust. Identities = 44/203 (21%), Positives = 83/203 (40%), Gaps = 33/203 (16%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S+SE F+D+S +D L +V ++ + R+G F D Sbjct: 102 LSGSMSERDFLDNSGKE---------QDRLTAVKSVLETFAAKREGDRLGLILFGDSAYL 152 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNN 364 F+ H+ + +D+ ++G ST + DA+ T I + E++ Sbjct: 153 QSPFT-ADHEAWLAL-----LDQAQVGMAGESTHLGDAVGLTIKTYIDNPENQT------ 200 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----L 419 +K ++LTDG +T I A++ GIR+ +A T ++A F + Sbjct: 201 --VEKVAIILTDGNDTDSLVPPIDAAKVAQAYGIRLYIVAMGSPNTTGDQAIDFSTIETM 258 Query: 420 SNCASPNSFFEANSTHELNKIFR 442 + +F A S +L+ +++ Sbjct: 259 ATVTGGQAFL-AMSQEDLDAVYQ 280 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 37.0 bits (84), Expect = 6.9, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 200 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 252 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 253 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 301 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 23/119 (19%) Query: 328 AIDE-NEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 AID + G T I + ++ A +IS S+ED + K I+LLTDGE DN Sbjct: 561 AIDRIDSWGGTNIAEGIRIANQQLISRSSEDRI----------KVIILLTDGEGYYDNN- 609 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFR 442 + +AK+ GI I TI + + L + A+ +F +S +L ++F+ Sbjct: 610 ---LTTEAKNNGITIYTIGLGTSVDEN-----LLRDIATQTGGMYFPVSSASQLPQVFK 660 >gi|53802845|ref|YP_115363.1| PKD domain-containing protein [Methylococcus capsulatus str. Bath] gi|53756606|gb|AAU90897.1| PKD domain protein [Methylococcus capsulatus str. Bath] Length = 833 Score = 37.0 bits (84), Expect = 7.2, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 13/122 (10%) Query: 304 DRVISDPSFSWGV-HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 DR + DPS W HK+ R ++K E+G +A + A I+S +DE R Sbjct: 700 DRAVVDPSRLWAPDHKMRRILLK-------ELGESAQSPASNAMTVKILSVMQDEPVRRI 752 Query: 363 NNLEAKKYIVLL--TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 N+ ++ V++ G+ + E K K+ R+ TI FSV TQQ + Sbjct: 753 NSADSSPDAVIVHKATGDKVRLRAE---RAGKKKNGNGRVYTIGFSVTDTQQNVCEGTVK 809 Query: 421 NC 422 C Sbjct: 810 VC 811 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 37.0 bits (84), Expect = 7.4, Method: Compositional matrix adjust. Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-RMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K R+AL ++ + D N V AT + ++ P+ + V+K ++F Sbjct: 288 RKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLV--PASAENVNK-----ARSF 340 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A +G T INDAM A + SSN++E R+ + I+LLTDG+ T Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEG--SVSLIILLTDGDPT 389 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 23/119 (19%) Query: 328 AIDE-NEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 AID + G T I + ++ A +IS S+ED + K I+LLTDGE DN Sbjct: 561 AIDRIDSWGGTNIAEGIRIANQQLISLSSEDRI----------KVIILLTDGEGYYDNN- 609 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFR 442 + +AK+ GI I TI + + L + A+ +F +S +L ++F+ Sbjct: 610 ---LTTEAKNNGITIYTIGLGTSVDEN-----LLRDIATQTGGMYFPVSSASQLPQVFK 660 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust. Identities = 26/96 (27%), Positives = 44/96 (45%), Gaps = 12/96 (12%) Query: 317 HKLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 H +I+ + + +D+ + TAI + A ++R+K++ K I+LLT Sbjct: 142 HNVIKEMTRKLTVDDITSNTRTAIGMGIGVA-----------LNRLKDSEAKSKVIILLT 190 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 DGEN A + AK GI+I TI + + Sbjct: 191 DGENNSGEMSPSAAADIAKELGIKIYTIGIGAKEIK 226 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust. Identities = 36/152 (23%), Positives = 64/152 (42%), Gaps = 17/152 (11%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 DT R+G F D + + L+ +++ I TAI DA+ A Sbjct: 128 DTDRIGLVVFGDYAFTQAPLTLD-KGLLLNLIENLRIGMAGR-KTAIGDALGVAG----- 180 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 R+K+ K ++LL+DGENT + + GI+I TI T+Q Sbjct: 181 ------KRIKDIPAMSKVVILLSDGENTAGDMTPQGAAEALAALGIKIYTIGMG---TEQ 231 Query: 413 EKARYFLSNCA-SPNSFFEANSTHELNKIFRD 443 ++ A ++ A++T +L+ I+++ Sbjct: 232 AGSKELAQIAAIGQGKYYHASNTEQLDSIYKE 263 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 36.6 bits (83), Expect = 8.0, Method: Compositional matrix adjust. Identities = 41/154 (26%), Positives = 65/154 (42%), Gaps = 11/154 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSN 354 R+G F D F+ + + +T D G ST + DA+ A S Sbjct: 147 RLGLILFGDAAFVQTPFTADQDVWLELLNQT---DVAMAGQSTHLGDAIGLATKVFEQSE 203 Query: 355 EDEVHRMKNNLEA---KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++++A +K +++LTDG +T E I AK++G+RI IA +T Sbjct: 204 KQSAAVQDSSIDANVKEKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTV 263 Query: 412 QEKARYF--LSNCA--SPNSFFEANSTHELNKIF 441 E A + A S FEA + EL K + Sbjct: 264 GEVALDMETIKRVAQESGGEAFEALNRDELTKAY 297 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 36.6 bits (83), Expect = 8.5, Method: Compositional matrix adjust. Identities = 43/152 (28%), Positives = 66/152 (43%), Gaps = 12/152 (7%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I + +G+F I+TALLM ++G GM VDV HAL Q A + S+ Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVA------HALSLRTQ-LYAAADAAAVGSIA 60 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 E S + T L + N+ + T E+ DI D +++ K+A + L Sbjct: 61 EKSGAVAAAMTMNGNGTVS-LGKTDARNIFMSQTSGELTDIHIDLGIDVT--KTANK--L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 +S+ + F+R G S +I A AE Sbjct: 116 NSQVSFTATVPTTFMRIFGRDSIIISGTATAE 147 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 36.6 bits (83), Expect = 8.7, Method: Compositional matrix adjust. Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 11/88 (12%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ V+ + + + TAI A+ + V+R++++ + ++LLTD Sbjct: 154 HDWLKKNVQRLELGDINLSGTAIGTALGAS-----------VNRLRDHESRSRIVILLTD 202 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIA 404 GEN ++ AKS +++ TIA Sbjct: 203 GENNSGTLSPLSAAEAAKSLNVKVYTIA 230 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 36.6 bits (83), Expect = 9.0, Method: Compositional matrix adjust. Identities = 41/154 (26%), Positives = 65/154 (42%), Gaps = 11/154 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSN 354 R+G F D F+ + + +T D G ST + DA+ A S Sbjct: 147 RLGLILFGDAAFVQTPFTADQDVWLELLNQT---DVAMAGQSTHLGDAIGLAIKVFEQSE 203 Query: 355 EDEVHRMKNNLEA---KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++++A +K +++LTDG +T E I AK++G+RI IA +T Sbjct: 204 KQSAAVQDSSVDANEKEKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTV 263 Query: 412 QEKARYF--LSNCA--SPNSFFEANSTHELNKIF 441 E A + A S FEA + EL K + Sbjct: 264 GEVALDMETIKRVAQESGGEAFEALNRDELTKAY 297 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 36.6 bits (83), Expect = 9.6, Method: Compositional matrix adjust. Identities = 19/54 (35%), Positives = 29/54 (53%) Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++R+K++ K I+LLTDGEN A + AK GI+I TI + + Sbjct: 173 LNRLKDSEAKSKVIILLTDGENNSGEMSPSAAADIAKELGIKIYTIGIGAKEIK 226 Searching..................................................done Results from round 2 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 529 bits (1362), Expect = e-148, Method: Composition-based stats. Identities = 458/458 (100%), Positives = 458/458 (100%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA Sbjct: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV Sbjct: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ Sbjct: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL Sbjct: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT Sbjct: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR Sbjct: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS Sbjct: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK Sbjct: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 181/466 (38%), Positives = 281/466 (60%), Gaps = 11/466 (2%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVF+ F K+L K G FF+I+ALL+PV+ V G+L+D+VRW YY ++L QA TA Sbjct: 1 MVFNKSLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ASV L+ S+E+ S S + I++YL+ N + +L NF + + + I++ T V Sbjct: 61 ALSASVQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM-GIKSWLIQTKAEAETVS-RSYHKEHGVS 178 + RK + + + S Y+L LNP SLF ++ IKSW I T EAE S ++YHKE GVS Sbjct: 121 NIYNRKGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVS 180 Query: 179 IQWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +QW+ID S SM R G + ++N + +VSC+ Sbjct: 181 VQWLIDDSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCD 240 Query: 238 KSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 +SLYY+L D L E++ + S ++ I+K++LVRDALA+ I+ ++KIDN+ D + Sbjct: 241 RSLYYVLNDKKILEDDDLEEKNLDNHS--QYYIRKRYLVRDALATFIKRVRKIDNLKDKL 298 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 RM +FN+R+ +WG+ K + V + ++E +T I+ +Q AY+ + S NE Sbjct: 299 RMSFMYFNERIDHYFPMTWGI-KEFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNE 357 Query: 356 DEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 D+ H+ KN++E KK+IVLLTDG E + + IC+ AK +GI+I TI++SV+ +++ Sbjct: 358 DDEHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSER 417 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +KA FLS CASP+ FFEA +LN IF++ IG+ IFER+++I + Sbjct: 418 KKANDFLSRCASPDKFFEAYDADKLNMIFKEHIGDAIFERLVKIRR 463 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 353 bits (904), Expect = 5e-95, Method: Composition-based stats. Identities = 111/464 (23%), Positives = 182/464 (39%), Gaps = 83/464 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +++ G+F I+TA+L PV+LG GM + V + L++AA +A + + L Sbjct: 11 RRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAALATATALANG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR----KS 127 + S E RNF N+ V DI T V + + Sbjct: 71 TIQTSQ-------------AEAFARNFVAGQMANYLQSGV-DIKSATGVTVQTNTSGNST 116 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +YQV +S YDL +NPL ++++G + + T T+ + +S+ +D S Sbjct: 117 SYQVTVSPSYDLTVNPL---MQAVGFTTQHLST--SGTTIGGHSQTQGSISMYLALDKSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++ Y CN G Sbjct: 172 SMGEDTATVNE---------------------------EDPTESYTYDCN---------G 195 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + S + K ++ A ++ + D VR GA ++ Sbjct: 196 HYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQY 255 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRM 361 + + +WG V T+ G T + AM TAY ++ + N ED H++ Sbjct: 256 TPSALAWGTSG-----VSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKL 310 Query: 362 KNNLEAKKYIVLLTDGENTQDNEE-------GIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K KKYIV +TDG+N D+ A C+ AKS+GI I TIAF E Sbjct: 311 KTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMA----PEG 366 Query: 415 ARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L CAS +S +F+A +L F+ IG + ++ R+T Sbjct: 367 GQALLHYCASDDSHYFQAEKMEDLLAAFK-AIGAKASSQLTRLT 409 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 350 bits (897), Expect = 3e-94, Method: Composition-based stats. Identities = 108/467 (23%), Positives = 184/467 (39%), Gaps = 84/467 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +++ G+F I+TA+L PV+LG G+ + V + L++AA +A + + L Sbjct: 11 RRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAADSAALATATALGNG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP----RKS 127 + S E RNF N+ V DI TAV + + + Sbjct: 71 TIQTSQ-------------AEAFARNFVAGQMANYLQNGV-DIKNATAVNVQTSNSGKSA 116 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE---AETVSRSYHKEHGVSIQWVID 184 +YQV ++ YDL +NPL ++++G + + T + + + + + VS+ +D Sbjct: 117 SYQVTVTPSYDLTVNPL---MQAVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFLALD 173 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SM D +D+ + CN Sbjct: 174 KSGSMGDPTETVN---------------------------KDQPTETFTYDCN------- 199 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 L+ + + S + K ++ A ++ + D VR GA ++ Sbjct: 200 --PHLNKKGKWVYDTCTGSRTNYYTKIEALKMAAGNLFGQLTSADPDAQYVRTGAVSYDI 257 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEV 358 + + +WG V ++ G T + AM TAY ++ + N ED Sbjct: 258 DQYTPSTLAWGTSG-----VSSYVNALQAGGGTNSSGAMGTAYSSLTAKNAAGNDAEDAA 312 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEE-------GIAICNKAKSQGIRIMTIAFSVNKTQ 411 H++K KKYIV +TDG+N D+ A C+ AKS+GI I TIAF Sbjct: 313 HKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPP-- 370 Query: 412 QEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L CAS +F+A +L F+ IG + ++ R+T Sbjct: 371 --GGQALLQYCASDAAHYFQAEQMEDLLAAFK-AIGAKASAQLTRLT 414 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 341 bits (873), Expect = 2e-91, Method: Composition-based stats. Identities = 109/464 (23%), Positives = 186/464 (40%), Gaps = 82/464 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +++ G+F I+TA++ PV+LGV G+ + V + L++AA +A + + L Sbjct: 11 RRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAALATATALANG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP----RKS 127 + S E +NF N+ V D T+V + + + Sbjct: 71 TIQTSQ-------------AEAFAQNFVAGQMANYVQSGV-DFKSGTSVNVQTSTSGKST 116 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +YQV +S YDL +NPL ++++G K+ + T TV + +S+ +D S Sbjct: 117 SYQVTVSPSYDLTVNPL---MQAVGFKTQHLST--SGTTVGGHSQTQGSISMFLALDKSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + D+ Y CN L+Y Sbjct: 172 SMGEATATVNA---------------------------DDPTESYTYDCN--LHYN---- 198 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + S + K ++ A ++ + D + VR GA ++ Sbjct: 199 --SKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFGQLNSADPNAEYVRTGAVSYDINQY 256 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRM 361 + + +WG + + G T + AM TAY ++ + N ED H++ Sbjct: 257 TPSNLAWGTAGVTSYV-----NALQANGGTNSSGAMSTAYSSLTAKNAAGNDAEDSAHKL 311 Query: 362 KNNLEAKKYIVLLTDGENTQDNEE-------GIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K KKYIV +TDG+N D+ A C+ AKS+GI I TIAF Sbjct: 312 KTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMA----PAG 367 Query: 415 ARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L CAS +S +F+A +L F+ IG + ++ R+T Sbjct: 368 GQTLLHYCASDDSHYFQAEKMEDLLAAFK-AIGAKASAQMTRLT 410 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 338 bits (865), Expect = 1e-90, Method: Composition-based stats. Identities = 106/464 (22%), Positives = 181/464 (39%), Gaps = 82/464 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +++ G+F I+TA+L PV+LG GM + V + L++AA +A + + L Sbjct: 11 RRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAALATATALANG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP----RKS 127 + + E RNF N+ DI T+V + + + Sbjct: 71 TIQTTE-------------AEAFARNFVAGQMANYLQSGT-DIKSTTSVNVQTTTSGKST 116 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +YQV +S Y L +NPL ++++G + + T T+ + +S+ +D S Sbjct: 117 SYQVTVSPAYVLTVNPL---MQAVGFTTQHLST--SGTTIGGHSQTQGSISMFLALDKSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + + Y CN L+Y Sbjct: 172 SMGEDTATVNE---------------------------ESPTESYTYDCN--LHYNTK-- 200 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + S + K ++ A ++ + D VR GA ++ Sbjct: 201 ----NNKWVYDKCTGSRTNYYTKIEALKMAAGNLFSQLNSADPNAQYVRTGAVSYDINQY 256 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRM 361 + S +WG+ V ++ G T + AM TAY ++ + N E+ H+ Sbjct: 257 APSSLAWGITG-----VSSYVNALQANGGTNSSGAMNTAYTSLTAKNAAGNDVENSAHQQ 311 Query: 362 KNNLEAKKYIVLLTDGENTQD-------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K KKYIV +TDG+N D + C+ AKS+GI I TIAF Sbjct: 312 KTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSKGIEIYTIAFMA----PAG 367 Query: 415 ARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L CAS +S +F+A +L F+ IG + ++ R+T Sbjct: 368 GQALLHYCASDDSHYFQAEKMEDLLAAFQ-AIGAKASAQLTRLT 410 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 320 bits (818), Expect = 4e-85, Method: Composition-based stats. Identities = 107/468 (22%), Positives = 199/468 (42%), Gaps = 79/468 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK-------QAAQTAIIT 63 ++++ G+F ++TAL+ P++L VGG+ VDV ++ L+ AA +A+++ Sbjct: 11 FMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVS 70 Query: 64 ASVPLIQSLEEVSSR-------AKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREVRDI 114 + P I+ ++++ + A + P + I N ++ + +V Sbjct: 71 DARPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNAT 130 Query: 115 -VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + TA + ++QV +++++ L N ++ R +G +S I+T++ AE+ + S Sbjct: 131 EIDITATPNGAKGKSFQVTVANKHLLQFNAMT---RLLGPESIEIETRSTAESATES--- 184 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 ++ +S+ V+D S SM C P Sbjct: 185 KNALSMYLVLDRSGSMAWKTNTINTGKAKC---------------------------PNY 217 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 N S Y L ++ + K ++ A+ ++ + D + Sbjct: 218 TEANWSKYPDLK----------------ATGPCYVTKIDALKTAVGDLLAQLVTADPESA 261 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR GA +N + S SWG + AI G TA +A +TA+ + ++ Sbjct: 262 YVRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAI-----GGTASGNAFKTAFQKVTNA 316 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI--AICNKAKSQGIRIMTIAFSVNKTQ 411 ED H KN KYIV +TDGEN N++ + C+ AK+ ++I ++AF Sbjct: 317 AEDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDTAKASKVQIYSVAFMA---- 372 Query: 412 QEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + L +CAS +S +FEA +L F+ IG V R+TK Sbjct: 373 PDRGQKLLKSCASSSSHYFEAEEASDLVAAFK-AIGERAAASVSRLTK 419 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 318 bits (814), Expect = 1e-84, Method: Composition-based stats. Identities = 100/453 (22%), Positives = 185/453 (40%), Gaps = 72/453 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++++ G+F I+TA+++PV+ G GM + V + L++AA +A + + L Sbjct: 12 RRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAADSAALATATALANG 71 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRKS 127 + S E R+F N+ + DI T V++ + + + Sbjct: 72 TIQTSQ-------------AEAFARDFVAGQMANYLQSGI-DIKSTTGVDVRTTTSGKST 117 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +YQV +S Y++ +NPL ++++G + + T S + + VS+ V+D S Sbjct: 118 SYQVTVSPDYNIAVNPL---MQTIGFTTQ--NISTSSTTTSGNSQTQGSVSMFLVLDRSG 172 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + + Y C++ Y Sbjct: 173 SMGEDTATVNAS---------------------------DPTEEYNYDCSEKDRYGNVTK 205 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + H K ++ A+ ++ + +D + VR GA +N + Sbjct: 206 KK---------TCTDTRPHYYTKIEALKLAVGTLTGELDAVDPEKEYVRTGAVSYNIEMQ 256 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + WG + + + K A D T +A +TAY+ + + ED+ H K Sbjct: 257 KAKALDWGTAHVTKYVNKLTATD-----GTDSGEAFKTAYNKLADAAEDKAHVDKTGQVP 311 Query: 368 KKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS- 424 KYIV +TDG+N + E C+KA+ +++ TIAF + + LS CA+ Sbjct: 312 TKYIVFMTDGDNNYTSADTETKTWCDKARDAKMQVYTIAFMA----PARGQALLSYCATA 367 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 P ++F A L K F++ IG + +V R+T Sbjct: 368 PGNYFPAGDMTALLKAFKE-IGMKASNQVTRLT 399 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 313 bits (800), Expect = 5e-83, Method: Composition-based stats. Identities = 108/452 (23%), Positives = 172/452 (38%), Gaps = 84/452 (18%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 TA+L PV+LG GM V V + L++AA +A + + L + S Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSE-------- 52 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR----KSAYQVVLSSRYDLLL 141 E RNF N+ V DI T V + ++YQV +S YDL + Sbjct: 53 -----AEAYARNFVAGQMANYLQSGV-DIKSATGVSVQTNTSGNSTSYQVTVSPSYDLTV 106 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEHGVSIQWVIDFSRSML-DYQRDSEGQ 199 NPL ++++G + + T S + +S+ +D S SM D +E Sbjct: 107 NPL---MQAVGFTTQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATVNEED 163 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P + P + + E Sbjct: 164 PTESYTYPCNP-------------------------------------HYNRKGKEVWDT 186 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + S + K ++ A ++ + D VR GA ++ + S +WG Sbjct: 187 CTGSRANYYTKIEALKMAAGNLFAQLSGADPNAQYVRTGAVSYDIVQYAPSSLAWGAIG- 245 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVL 373 V ++ G T + AM TAY ++ + N ED H++K+ +KYIV Sbjct: 246 ----VSSYVNALQAGGGTNSSGAMSTAYLSLTAKNAAGNDAEDSAHKLKSGQIPQKYIVF 301 Query: 374 LTDGENTQDNEE-------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP- 425 +TDG+N D+ A C+ AKS+GI I TIAF + L CAS Sbjct: 302 MTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPP----GGQALLQYCASDA 357 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + +F+A +L F+ IG + +V R+T Sbjct: 358 SHYFQAEKMEDLFAAFK-AIGAKASTQVTRLT 388 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 283 bits (723), Expect = 5e-74, Method: Composition-based stats. Identities = 91/380 (23%), Positives = 147/380 (38%), Gaps = 70/380 (18%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPR----KSAYQVVLSSRYDLLLNPLSLFLRSM 151 RNF N+ V DI T V + ++YQV +S YDL +NPL ++++ Sbjct: 1 RNFVAGQMANYLQSGV-DIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPL---MQAV 56 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G + + T T+ + +S+ +D S SM + Sbjct: 57 GFTTQHLST--SGTTIGGHSQTQGSISMYLALDKSGSMGEDTATVNE------------- 101 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 ++ Y CN G + + + S + K Sbjct: 102 --------------EDPTESYTYDCN---------GHYNKKGKWIYDTCTGSRANYYTKI 138 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ A ++ + D VR GA ++ + + +WG V T+ Sbjct: 139 EALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSG-----VSTYVNAL 193 Query: 332 NEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T + AM TAY ++ + N ED H++K KKYIV +TDG+N D+ Sbjct: 194 QAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSG 253 Query: 386 -------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHEL 437 A C+ AKS+GI I TIAF E + L CAS +S +F+A +L Sbjct: 254 GRSYDTLTKATCDTAKSKGIEIYTIAFMA----PEGGQALLHYCASDDSHYFQAEKMEDL 309 Query: 438 NKIFRDRIGNEIFERVIRIT 457 F+ IG + ++ R+T Sbjct: 310 LAAFK-AIGAKASSQLTRLT 328 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 283 bits (722), Expect = 6e-74, Method: Composition-based stats. Identities = 103/472 (21%), Positives = 193/472 (40%), Gaps = 91/472 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------ITASV 66 +++ G+F ++TAL+ P++L VGG+ VDV ++ L+ A A ++ + Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 67 PLIQSLEEVSSR-------AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 P I+ + ++ + A +S P + + + ++ D ++V Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAA---AGSTAPSWDDVNTSEVV-IVE 118 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + ++QV +++++ L N ++ R +G +S ++T++ A++ + S ++ +S+ Sbjct: 119 TPNGTKGKSFQVSVANKHLLQFNAMT---RLLGKESIELETRSTADSATES---KNAISM 172 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 V+D S SM + R + +S G+ +R SP V Sbjct: 173 YLVLDRSGSMAWKTDTVDTSR--------PRCINWTASNWGESNVR--ATSPCYVD---- 218 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 K ++ A+ + + K+D N+ +R GA Sbjct: 219 ------------------------------KITTLKSAVDKLFTPLAKMDPGNEYLRAGA 248 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +NDR +WG + G T + A A + ++ E+E H Sbjct: 249 ASYNDRQDRASKLTWGTKNA-----SAHVQGLDATGGTDSSSAFAAAVEELLLDGENEAH 303 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEE------------GIAICNKAKSQGIRIMTIAFSV 407 KN +KYIV +TDGENT N + A C AK+ GI I T+AF Sbjct: 304 LAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNNGIAIFTVAFMA 363 Query: 408 NKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + L CA SP+ + EA+ L F ++IG + + R+TK Sbjct: 364 ----PQRGKDLLKACATSPDHYKEADDAAALVSEF-EKIGQKAAAMIARLTK 410 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 280 bits (716), Expect = 3e-73, Method: Composition-based stats. Identities = 96/451 (21%), Positives = 190/451 (42%), Gaps = 63/451 (13%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A AI+ Sbjct: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ +L + R ++ K+ + + R +N++K++ + + +T ++ Sbjct: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA--VFYNTEIQN-- 120 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +V SSR + + + + ++ + Y+ + S + + Sbjct: 121 ------IVNSSRISM---------------THMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 F +L+ + Q + F R ++G R L +V + S++ + Sbjct: 160 FIEHLLNQR---YNQKIVSFIPALLRI---------EMGERPIFLIELVVDLSGSMHCAM 207 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P D V+S+ + K +++AL + SI + +V + V MG + Sbjct: 208 NSDPED--------VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE----- 357 RV + SWG K+ + + + +D + T AM+ AY + S + Sbjct: 260 TTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 Query: 358 --VHRMKNNLEAKKYIVLLTDGENTQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 +L +K+I+ LTDGEN N I IC+KAK I+I+TI+ + + Sbjct: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--- 374 Query: 414 KARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 + L C +SP + + L +F++ Sbjct: 375 -GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 271 bits (693), Expect = 1e-70, Method: Composition-based stats. Identities = 91/467 (19%), Positives = 181/467 (38%), Gaps = 87/467 (18%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + +G+F ++TA+L+PV+LGV G +++ + ++ + Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQN----------TADSAA 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM----NPRKS 127 L + ++I+E ++KN T E ++ +++ + N R Sbjct: 61 LAAATEARLREGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKNSPTRVTTTENARGK 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y V + ++ + LNP+ F +G K+ + A++ +K +S+ +D S Sbjct: 121 TYAVETTIKHQIQLNPMLGF---IGAKTLDLSVTGTAKSTI---NKGAPISMYLALDRSG 174 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + +C ++Y+S N K SP V+ S Sbjct: 175 SMSFKTDTVDTTKTSC---------QNYTSDNWSKYPNLAKTSPCYVNKAAS-------- 217 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV------NDTVRMGATF 301 ++ A+ ++ ++ K D ++ VR GA+ Sbjct: 218 --------------------------LKTAVGFLVATLNKADPTYTVNGGSELVRTGASV 251 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--H 359 + S WG + + K I E G T ++ AY+ + +N DE H Sbjct: 252 YTHETYVAQSIGWGTSGVTSYVDKQ--IPEFPSGGTDARSSLNAAYNALKKANPDEARYH 309 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEG--------IAICNKAKSQGIRIMTIAFSVNKTQ 411 + K + ++YIVL+TDGE T ++ C AK GI+I ++AF Sbjct: 310 KEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIFSVAFMA---- 365 Query: 412 QEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +K + L CAS ++++ + ++ F + I + + +T Sbjct: 366 PDKGKSLLQYCASSADNYYAPENMEQIVTAFGE-IARKAAGSIATLT 411 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 269 bits (688), Expect = 5e-70, Method: Composition-based stats. Identities = 88/468 (18%), Positives = 182/468 (38%), Gaps = 87/468 (18%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G+F ++TA+L+PV+LG G +++ + L+ Sbjct: 10 TRRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQN----------TADSA 59 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP----RK 126 +L + ++I+E + ++K T+ E + + +++ V + R Sbjct: 60 ALAAATEARLKEGALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARG 119 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 Y + + Y + LNPL F G K+ + A + +K +S+ V+D S Sbjct: 120 KTYTIQTTINYQMQLNPLLGFF---GAKTLDLAATGTAVSTV---NKGAPISMYLVLDRS 173 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM + +C ++Y+ N + SP V+ Sbjct: 174 GSMSFKTDTLNTKKTSC---------QNYTVDNWGSYPNLKNTSPCYVN----------- 213 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV------NDTVRMGAT 300 K ++ A+ ++ ++ K D ++ VR GA+ Sbjct: 214 -----------------------KATSLKTAVGYLVATLNKADPTYTANGGSELVRTGAS 250 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEV 358 + + +WG + + K I E G T ++ AY+ + +N E + Sbjct: 251 VYTHETYAAQPITWGTSSVATYVDKQ--IPEFPSGGTDARSSLNAAYNALKKANTVEAKE 308 Query: 359 HRMKNNLEAKKYIVLLTDGENTQD--------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 H+ K + ++YIVL+TDGE T + ++ C+ AK GI+I ++AF Sbjct: 309 HKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIKIFSVAFMA--- 365 Query: 411 QQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +K + L +CAS ++++ + ++ F + I + + +T Sbjct: 366 -PDKGKSLLQHCASSLDNYYAPENMEQIVTAFGE-IARKAAGSLATLT 411 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 259 bits (661), Expect = 7e-67, Method: Composition-based stats. Identities = 94/463 (20%), Positives = 177/463 (38%), Gaps = 75/463 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------IT 63 ++K G+F ++TA+ P++L GG+ +D+ ++ L+ A A ++ Sbjct: 11 FITMLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAAALAAASALVS 70 Query: 64 ASVPLIQSLEEVSSRAKNS-----FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 P I + +E++ + + T + + ++ D ++ T Sbjct: 71 DEQPDIAAAKEIARKFLKTQAGGTTTPDAPADSGEGASSGAASSTPDWDDVNTLEV-NIT 129 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + +QV + ++ N ++ R +G S ++ + AE+ + S ++ +S Sbjct: 130 ETPNGTKGKIFQVTVINKRVTEFNAMT---RLLGTDSIELEASSTAESATES---KNALS 183 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + V+D S SM +C P N Sbjct: 184 MYLVLDRSGSMAWKTNTINAAKKSC---------------------------PNYTESNW 216 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S Y L+ +S + K ++ A+ ++ + D VR Sbjct: 217 SRYPNLW----------------ASSPCYVTKIDALKTAVTDLLAQLLVADPDQIYVRTA 260 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A +N + + +WG + G TA A +TAY +I++ E+ Sbjct: 261 AISYNSVQDTAGTLAWGTSGAAAYV-----NALVATGGTASAGAFKTAYQKVIAATENTA 315 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGI--AICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 H KN KY+V +TDGEN N++ + C+ AK+ + I ++AF E+ + Sbjct: 316 HAAKNGQVPSKYMVFMTDGENNYANDDTVTKQWCDTAKANKVEIYSVAFMA----PERGQ 371 Query: 417 YFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS +S +FEA +L F+ IG V R+TK Sbjct: 372 ALLKYCASSSSHYFEAEEVTDLVAAFK-AIGERAAAVVSRLTK 413 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 257 bits (656), Expect = 2e-66, Method: Composition-based stats. Identities = 94/449 (20%), Positives = 190/449 (42%), Gaps = 70/449 (15%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ--SLEEVSSRAKNS 82 +TA+++ V +D+ Y + ++ A A+++ ++ ++++ +++ + Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 T K++I+++L + + +++N D + + T + NP Q + S+ + Sbjct: 61 STIFKKQIKKHLKQG--SYIRENAGDIAQKAQINITKDKNNP----LQYIAESKAQYEIP 114 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY----QRDSEG 198 +LFL+ + I S L + + + +SI V+D SRSM D D+ Sbjct: 115 TENLFLKGL-IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 N + P KS+ S+N KS Y P P + Sbjct: 174 MTSNKYLLPPPPK-KSFWSKN----------------TTKSKYA---PAPAPANR----- 208 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFNDRVISD--PSFSW 314 K ++ ++ +++ SI+K + N +VR+G +N ++ + S Sbjct: 209 -----------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +++ VK+ N +T AM AY + + E H + KK+++ + Sbjct: 258 NLNE-----VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS-HNTIGSTRLKKFVIFI 311 Query: 375 TDGENT-----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSF 428 TDGEN+ Q+ + IC ++ G++I ++A S + + L C S F Sbjct: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----EGQDLLRKCTDSSGQF 367 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRIT 457 F N + EL + F D+I ++I E+ +RI Sbjct: 368 FAVNDSRELLESF-DKITDKIQEQSVRIA 395 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 251 bits (639), Expect = 2e-64, Method: Composition-based stats. Identities = 97/455 (21%), Positives = 179/455 (39%), Gaps = 71/455 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ G+F ++TA+L+PV +GV G+ +D AL+ + A + A+ + + Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + + +F ++ + R+ + T E V+ T ++N ++Y V Sbjct: 101 SE-ADAIALAKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTT--QVNSSSTSYDVE 157 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L+ Y + +NPLS R +G ++ ++ +A+ + + E +S+ V+D S SM D Sbjct: 158 LTGSYTITMNPLS---RVLGWETVTLKAYGKAQAATTAS--ESPLSMYLVLDRSGSMNDE 212 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + K + Sbjct: 213 TATTYTGTCTKTTTSGYGWNKKTT------------------------------------ 236 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S S K ++ A+A + +KK D ++ VR GA +N + + Sbjct: 237 ------TTSYSCTKNYTKIESLKLAVADLAAQLKKADPNSEYVRTGADSYNASADTAQAM 290 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKY 370 SWG ++ + + G T A+ AY + +SN E H + + + +Y Sbjct: 291 SWGTANVVTYV-----NALSATGGTDARGALSAAYSALQTSNKTEITAHNVSSVSKIGRY 345 Query: 371 IVLLTDGENTQD--------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 IV +TDGE T + + + C K+ GI+I T+AF + LS C Sbjct: 346 IVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMA----PANGKSLLSAC 401 Query: 423 ASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 AS + ++EA L F + IG + R+ Sbjct: 402 ASDASHYYEATDAASLVAAFGE-IGKKATSTSTRL 435 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 102/449 (22%), Positives = 186/449 (41%), Gaps = 63/449 (14%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +LI TG+F I+TA+ +PV+ G+ VDV + L+QA A + + L Sbjct: 13 QLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQQATDAAALATATALANGN 72 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S+ + + F ++ YL + + T+ + ++Y V Sbjct: 73 ATTSNAQQLATQFVTGQMSNYLSGDT-----NTADALKAGTTANVTSATNSSGGTSYTVA 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++ YD+ +N +S + +GIK+ + + + + S + K+ +S++ +D S SML Sbjct: 128 VNASYDMSVNGMS---QLLGIKTMHVSAASTSTSGSAAAAKQAALSMEIALDKSGSMLLN 184 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + +C + Y N Y YP P Sbjct: 185 TDVIDTSQKSC--------------------------TQYYTEGN---YLYQYPKAKSPC 215 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 IKK ++ A+ +++ + D + VR A ++ V S + Sbjct: 216 --------------YIKKIAALKTAVGTLLDQLDSADPKSQYVRTAAIAWSSEVDSSSAL 261 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 +WG ++ N G T + M AY + +S+E K N +K IV Sbjct: 262 AWGTTTTRSNVIS----GLNANGGTESSAPMALAYKNVSASSEATAQAAKGNTTFQKIIV 317 Query: 373 LLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFF 429 L+TDGEN + + +A C AK G+ I ++AF ++ + L NCA SP+++F Sbjct: 318 LMTDGENNATSSDTKTLATCKAAKDAGVLIYSVAFMA----PDRGQTLLKNCASSPSNYF 373 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 +A +L F+ IGN+ +++ +TK Sbjct: 374 DAQQMSDLIAAFK-TIGNQASKQITLLTK 401 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 92/474 (19%), Positives = 183/474 (38%), Gaps = 115/474 (24%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++++ G+F I+TA++MPV+LG G+ +D + + L++A +A + A+ L Sbjct: 11 RRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAAATALASG 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV-RDIVRDTAVEMN----PRK 126 ++ A E + ++F + N+ D + I T+V+++ Sbjct: 71 AASTTADA------------EAIAKDFVSGQMANYVDTDAISSIKAGTSVDIDVSATATS 118 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +Y+V +++ Y + P F+ +G K+ + A T S + + +S++ V+D S Sbjct: 119 KSYKVTVATSYGIAATP---FMSVLGYKT--LNIGASTSTSSGTSDTKTALSMELVLDQS 173 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 +G + + Y Sbjct: 174 G---------------------------------SMGEKTTTCATY-------------- 186 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN--- 303 + + + + K ++ A ++ ++ D + VR GA +N Sbjct: 187 -------------NGKNCKTYVTKIDALKKAADALFDALDTADPDHSLVRTGAYSYNNGL 233 Query: 304 ------DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--- 354 ++ S +WG T+ G T + M+ A +I ++ Sbjct: 234 IYNSQKTQIKSMSGMAWGTA-----TTATYVSGITASGGTDATEPMRQATLSIAKASDGS 288 Query: 355 --EDEVHRMKNNLEAKKYIVLLTDGENTQD--------NEEGIAICNKAKSQGIRIMTIA 404 E + H +K N +YI+L+TDGE T + ++ C+ K+ GI+I T+A Sbjct: 289 DVETQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFTVA 348 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F +K + L CASP +++EA + +L F I E + V +T Sbjct: 349 FMA----PDKGKQLLQYCASPGGNYYEAETMEKLVASFTS-IAKEATKAVTLLT 397 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 72/311 (23%), Positives = 115/311 (36%), Gaps = 59/311 (18%) Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 TVS + +S+ +D S SM D Sbjct: 3 STSGRTVSGHSQSQGSISMFLALDKSGSMGDPTATVNA---------------------- 40 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 D+ P+ CN P + + S H K ++ A + Sbjct: 41 -----DDPTEPFTYDCN--------PHLNKKGTKIIYDTCTGSRAHYYTKIEALKIAAGN 87 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + D + VR GA ++ + +WG+ T V ++ G T + Sbjct: 88 LFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWGI-----TAVTSYVNALESGGGTNSS 142 Query: 341 DAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-------GI 387 A+ TAY ++ + N ED H++K KKYIV +TDG+N D+ Sbjct: 143 GAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLPKKYIVFMTDGDNNDDSRGGRSYDTLTK 202 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIG 446 A C+ AK++GI TIAF E + L CAS + +F+A +L F+ IG Sbjct: 203 ATCDTAKAKGIETYTIAFMA----PEGGQALLHYCASDDAHYFQAEKMEDLLAAFK-AIG 257 Query: 447 NEIFERVIRIT 457 + +V R+T Sbjct: 258 AKASAQVTRLT 268 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 222 bits (565), Expect = 8e-56, Method: Composition-based stats. Identities = 85/463 (18%), Positives = 174/463 (37%), Gaps = 83/463 (17%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + F+SK + +G F I++A ++ V L ++D+ + ++ ++ + AII+ Sbjct: 11 RRIYFFSK----NKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISG 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ S N +++I + + +N N+ +NF I+ + + + Sbjct: 67 CSIVV------SDPKINDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFSK 120 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 L+++Y+ + Sbjct: 121 D-------LTNKYEYKIT------------------------------------------ 131 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN--GKVGIRDEKLSPYMVSCNKSLYY 242 M + S + F P T S S K + ++ C+ S+ Sbjct: 132 ----MEAKHQLSGKNFILGFLMPNVITHISSISTGIIQKPSDKKAFSVEMVLDCSGSMLD 187 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA----LASVIRSIKKIDNVNDTVRMG 298 + S ++ S + K + ++ A + + +++ ++ R+G Sbjct: 188 SMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQIS--ARIG 245 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDE 357 FN ++ D S + + +TI + G T M AY+ + + NE + Sbjct: 246 LITFNHYIMQDSKLSNNFNVIKKTISRM-----KPKGGTDTFLPMNAAYEYLNNIPNETK 300 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEE--GIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 H + +N+ K+YI+L+TDGEN + + I +C+ A+ GI I +I +N + Sbjct: 301 AHNISDNVPLKRYIILMTDGENNHPSYDLKTINVCDNARKNGIIIYSI--FLNYYEYTDG 358 Query: 416 RYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 CAS FF AN+T L F+ I + I ++ +RI Sbjct: 359 YELARKCASSEKHFFYANNTKALLDSFKS-IAHAIQDKAVRIA 400 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 206 bits (524), Expect = 5e-51, Method: Composition-based stats. Identities = 93/450 (20%), Positives = 185/450 (41%), Gaps = 54/450 (12%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 F+F KK+I S +F I+ ++++ +L G+L+ V+ + + ++A++ A +AI++ Sbjct: 5 RTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILSG 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + +I +S N + + I + + R ++ +K++ + + + Sbjct: 65 ASKIIS---RISYFGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNI---- 117 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + +S+ + P +S L K YH + I Sbjct: 118 ------ISQNSKVSITREPHPNVFHEFNNQSILQNKKT-------FYHISVETFYDYHIK 164 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 F ++L+ + + F + G +V + S+ ++ Sbjct: 165 FFDNLLNKK---INSKIISFVPALVKI---------DTGEHPFFFVQLVVDLSASMSCLM 212 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK-IDNVNDTVRMGATFFN 303 DP + E V S ++ K ++ A+ + S+ + DT +G T + Sbjct: 213 NS---DPEHATEFSVCGKSKKN--SKMDALKKAVLLFLDSVDRGSKTQKDTHYIGLTGYT 267 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE-----DEV 358 RV + SWG K+ + IV+ ID N +G T AM+ AY + S + + + Sbjct: 268 TRVEKNIEPSWGTGKVRKYIVEE--IDVNMLGQTDSTPAMKKAYQILTSDKKRNFIRNIL 325 Query: 359 HR--MKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 H+ L +K+++ LTDGEN + + I IC KAK I+I+TI+ + + Sbjct: 326 HKRIKIPPLPFQKFLIFLTDGENNDPKSDVKTIKICEKAKKNSIKILTISINAS----AN 381 Query: 415 ARYFLSNCAS-PNSFFEANSTHELNKIFRD 443 + L C S P ++ T L ++F+D Sbjct: 382 GKRLLKKCVSAPEYYYNVVDTGSLLRVFQD 411 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 199 bits (505), Expect = 7e-49, Method: Composition-based stats. Identities = 90/455 (19%), Positives = 168/455 (36%), Gaps = 85/455 (18%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + G I+TA+ +P++ V GM+++V + + L +++ A+ Sbjct: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 ++ + + +I+ +F N L+ N ++ DIV R Sbjct: 63 TQIMNEGNGNNRKKLKGGDILC-RIKNTWNMSFRNELRDNGFVNDIDDIV---------R 112 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 ++ +V+ + ++ + +S Sbjct: 113 STSLDIVVVPQ-----------------------------------NEGYSIS------- 130 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 ++ + P N + + S + S + M+ Sbjct: 131 --------------AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 LD S S E F DSS I K + ++ +++ +K I +VN+ V+ G F+++ Sbjct: 177 V--LDVSRSMESFFDSS-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + WGV L R I ST ++ AY+ I H + Sbjct: 230 IEEFFLLEWGVSHLQRKIKYLSKFGV----STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 Query: 366 EAKKYIVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 KK IV +TDGEN T+++++ + CN+AK +G + I V ++ FL CA Sbjct: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACA 340 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 SPNSF+ + H + F IG +I + I K Sbjct: 341 SPNSFYLVENPHSMYDAF-SHIGKDIVTKRIWYDK 374 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 189 bits (480), Expect = 7e-46, Method: Composition-based stats. Identities = 88/454 (19%), Positives = 166/454 (36%), Gaps = 88/454 (19%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + +C G I+TA+L+PV+ V G++++ + + L +++ + Sbjct: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 ++ + +F + KN + R+ +R Sbjct: 63 IL--------------NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR----------- 97 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + F + + + E S+ +ID Sbjct: 98 ----------------ENGFAQDIN-------------------NIERSTSLSIIID--- 119 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 D +D ++ + P + + + ++ + +KS + Sbjct: 120 ---DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA---PLLITSSVKISSKSDIGLDMMM 173 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD SLS + K + ++ ++ IK I +VN+ VR G F+ +++ Sbjct: 174 VLDVSLSMNDHFGPG-----MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 +WGV + I + T ++ AY+ I + E H K + + Sbjct: 229 QTFPLAWGVQHIQEKINRLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 Query: 368 KKYIVLLTDGEN---TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 KKYI+ LTDGEN DN+E + CN+AK +G + I Q FL NCAS Sbjct: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----FLKNCAS 338 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 P+ F+ ++ +L+ F RIG E+ ++ I K Sbjct: 339 PDRFYSVQNSRKLHDAFL-RIGKEMVKQRILYNK 371 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 186 bits (472), Expect = 6e-45, Method: Composition-based stats. Identities = 54/193 (27%), Positives = 91/193 (47%), Gaps = 13/193 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK ++++ A+ +I ++ D VR+GA+ +N ++ W K K F Sbjct: 181 KKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPGKT-----KEFV 235 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEG 386 G T DA AY + E+ H K+ KK+IV +TDG+N + Sbjct: 236 DALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMTDGDNNYSSADSST 295 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRI 445 +C+ AK GI + T+AF+ + + LS CAS FF+A ++ +L + F++ I Sbjct: 296 KHLCDDAKDDGIEVYTVAFAA----PNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKN-I 350 Query: 446 GNEIFERVIRITK 458 G + V R+T+ Sbjct: 351 GYAASKVVSRLTE 363 Score = 91.2 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 88/248 (35%), Gaps = 19/248 (7%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + T+ +KL+++ G+F +I A +PV+ G + VD + L+ A +A Sbjct: 1 MMSTRITSKIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAA 60 Query: 62 ITASVPLI-QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + L + + + F +++E F V V V Sbjct: 61 LATAARLSEEENLTAAQAQAFALKFVNGQVKEDFGA---------FNGFSVTPTVNIDPV 111 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 E R + ++V +S L P++ R MG + ++E+ + + S+ Sbjct: 112 ETGGR-TVWKVAVSMEGSQSLTPMA---RIMGKDKLTVSVVGKSESAGEA---QGAFSMA 164 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V+D S SM + + +N ++ + + + S Y S Sbjct: 165 LVLDRSGSMD--WNLNGQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGST 222 Query: 241 YYMLYPGP 248 PG Sbjct: 223 KLRWNPGK 230 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 164 bits (414), Expect = 3e-38, Method: Composition-based stats. Identities = 47/186 (25%), Positives = 88/186 (47%), Gaps = 12/186 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A+ S+I+ ++ +D + R+GA ++ +W + V+++ Sbjct: 253 SRMSELKKAVRSLIKELQTVDPDDQFTRLGAYAYHWYYAGKKELTWNKNS-----VRSWV 307 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T A+Q A + +++++E H KN E +I+ +TDG + N Sbjct: 308 NSLPASGGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNWAKRE 367 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGN 447 C AK+ GI I T+AF + R L CA S +++A + +ELNK+F+D I Sbjct: 368 -CTSAKNAGITIYTVAFKAPAS----GRNLLKACATSDAHYYDAKNANELNKVFKD-IAR 421 Query: 448 EIFERV 453 E + + Sbjct: 422 ETTKSI 427 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 87/267 (32%), Gaps = 12/267 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G + +L+ ++L V + +D+ + L+ AA A + L+ + Sbjct: 76 KERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLAEEITI 135 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + ++ F V+ V+ T ++ + Y V ++ Sbjct: 136 AQANAYAKDAFNRIASGLSASGDGSSGTSIFGTMTVKPAVQITE-TVDGNTTNYVVKVNG 194 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + +PLS I E+ET ++ +S+ V+D S SM Sbjct: 195 TAKIPASPLSFMFFDGETGKNTISLGFESETTAK-AEAGASLSMALVLDRSGSM------ 247 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 G + V+S + V D+ + + YY S Sbjct: 248 --GWERPSRMSELKKAVRSLIKELQTVDPDDQFTRLGAYAYHW--YYAGKKELTWNKNSV 303 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVI 282 +V+S + ++ A ++ Sbjct: 304 RSWVNSLPASGGTRAAPAIQKAKNDLL 330 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 155 bits (391), Expect = 2e-35, Method: Composition-based stats. Identities = 56/274 (20%), Positives = 96/274 (35%), Gaps = 58/274 (21%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 + PV+LG GM V V + L++AA +A + + L + S Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSE----------- 49 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP----RKSAYQVVLSSRYDLLLNPL 144 E RNF N+ V DI T+V + + ++YQV +S YDL +NPL Sbjct: 50 --AEAYARNFVAGQMANYLQSGV-DIKGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPL 106 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 ++++G K+ + T TV + +S+ +D S SM + Sbjct: 107 ---MQAVGFKTQHLST--SGTTVGGHSQTQGSISMFLALDKSGSMGESTATVNE------ 155 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 D+ + CN L+Y + + + S Sbjct: 156 ---------------------DDPTETFTYDCN--LHYN------SKNNKWVYDKCTGSR 186 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + K ++ A ++ + D VR G Sbjct: 187 TNYYTKIEALKIAAGNLFSQLNSADPNAQYVRTG 220 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 140 bits (351), Expect = 6e-31, Method: Composition-based stats. Identities = 54/301 (17%), Positives = 107/301 (35%), Gaps = 29/301 (9%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ----PLNCFGQPADRTV 212 + T H + +Q VID + SM + ++ + P Sbjct: 77 PVNTIWTKNLKYELEHSDFSSDVQNVIDDT-SMKLESDSNFKTLSITAISQYKMPFKICN 135 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 K S + S ++ + S+ + + + Sbjct: 136 IHLLCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVLDVSSSMDDNFMKPEEAPC---SRLE 192 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + + ++ ++ +K+ N + R G+ FND V G+ ++ I K Sbjct: 193 VAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLKRGLKRIYNDIKKYR----- 247 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAIC 390 GST M+ A++ + + +D R KK ++ LTDGEN + I +C Sbjct: 248 AFGSTNSYVGMKYAWEQLYGNPQDTKDR-------KKIVIFLTDGENMIINATRKTIELC 300 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 N K + I +IA +V+ + L C+S + + A+ L + + IG ++ Sbjct: 301 NDMKKKKAVIYSIALAVDN------KEVLQGCSSSGNVYAADDAQSLVQAY-SLIGKDVM 353 Query: 451 E 451 + Sbjct: 354 K 354 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 123 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 41/121 (33%), Positives = 59/121 (48%), Gaps = 13/121 (10%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-------GIAICNKAKSQG 397 TA + + ED H++K KKYIV +TDG+N D+ C+ AKS+G Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRI 456 I I TIAF + L CAS +S +F+A +L F + IG + +V R+ Sbjct: 64 IEIYTIAFMA----PAGGQALLHYCASDDSHYFQAEKMEDLLAAF-EAIGAKSAAQVTRL 118 Query: 457 T 457 T Sbjct: 119 T 119 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 50/259 (19%), Positives = 95/259 (36%), Gaps = 25/259 (9%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 Q P P D+ +Y+ ++ ++ V ++L P D S Sbjct: 320 QDLKPEGPSAYSVHPNDKGAAAYARCVQRMIDDIQECEDNGVD-PRTLMGDSKVDPNDAS 378 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +S + + + A +I K D + ++ + Sbjct: 379 SRHVVLALDTSGSMDGEPLNETKTATREFASTIFKSDAD-----VCLVSYDSSARNVID- 432 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 +K D + G T I DA++ +Y+ + S D K+ IV Sbjct: 433 ----STDNEYALKAAVRDLSAGGGTNIEDALRVSYERLEGSGSD-----------KRIIV 477 Query: 373 LLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 L++DGE + ++ IA N+ K G+ I T+ F + + + + + + ASP +E Sbjct: 478 LMSDGEANEGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPGCHYE 537 Query: 431 ANSTHELNKIFRDRIGNEI 449 + +L F D IG++I Sbjct: 538 VDDASQLRYFFGD-IGDDI 555 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 69/164 (42%), Gaps = 30/164 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ HK +RT++ I G TAI DA+ A Sbjct: 138 RIGLILFGTQAYLQTPLTFD-HKTVRTLLNESRIGI-AGGQTAIGDAIGLA--------- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + R+KN+ K ++LLTDG NT + + A QG++I T+ ++ + Sbjct: 187 --LKRLKNHKTGSKVLILLTDGANTAGSVSPVQAAELAARQGMKIYTVGVGADEMRIPGV 244 Query: 416 RYFLSNCASPNS-----------------FFEANSTHELNKIFR 442 F S +P++ +F A +T EL +I++ Sbjct: 245 LGFGSQIVNPSADLDEVTMKKIASLTGAQYFRARNTDELRRIYQ 288 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 84/497 (16%), Positives = 156/497 (31%), Gaps = 65/497 (13%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ ++S G+ +I L + ++ G +D + + + A +A++ A+ Sbjct: 8 MRYFVNAFLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAA- 66 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 I + K + K+ + + K + R V+ + + + + Sbjct: 67 --IVNNSNEGDLKKLAADVFKENLGAADLDAKVTAFKYDAKARTVKATAQGSYDPVIMQL 124 Query: 127 SAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 ++ V S L + L S + + VS Sbjct: 125 FGFKNLPYAVTSDAIKAADGTLEVALVLDNTWSMSATVNGTPKIDILKTAAQGLVSTILT 184 Query: 183 IDF--------------------SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 D +R+M ++ + T + Sbjct: 185 KDNKDYVKIAVVPYADYVNVGMANRNMPWVSVAADYSTTSTKTCKTVSTATQCTGGTKGT 244 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 ++ PY SC + + S K V++ +AS Sbjct: 245 CTGNQDGVPYTYSCW------IVAQTCKTVNVTPYQSCSGGGTTNYKWYGCVKNQVASS- 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDP-SFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 K + T G + ++ S + TI T I Sbjct: 298 ---KVVMPDPTTPYGGLVQTSQTCLNPILPLSNDATVVTNTIKGLVVNIGGYKPETYIPG 354 Query: 342 AMQTAYDTIISSNEDEVHR--MKNNLEAKKYIVLLTDGENT--QDNEEGIAI-------- 389 M + + + NN E +K IVL+TDG NT + GIA+ Sbjct: 355 GMIWGVNALTPPAPFTEGKPYDANNKEPRKTIVLMTDGANTLYANTSGGIAVANATQVAV 414 Query: 390 --------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 C+ AKS+ I I TI F V + KA L CA+ +F+A S+ +L K Sbjct: 415 TYSDQIRVCDYAKSKKIEIYTIGFDVTDS---KALSTLKACATDAQHYFDAKSSADLIKA 471 Query: 441 FRDRIGNEIFERVIRIT 457 F + IG ++ + +R+T Sbjct: 472 F-ETIGGKLSK--VRLT 485 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 75/494 (15%), Positives = 154/494 (31%), Gaps = 77/494 (15%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 +F++K Y K G + A + ++L V G VD R Y H + A A Sbjct: 1 MFESKNSSYFHKFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 +T + L ++ + +I L F NL N + I + + Sbjct: 61 LTVAKQL------------STTVLTENQIRTGLKNAFRANL--NAAGINSQGI-DNLDFK 105 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 ++P + V S D+ N F++ GI ++ A ++ + E Sbjct: 106 VDPGEGTLDV--WSSVDIQTN----FIKLGGIGPEKLEVGAASQVNYSRFDVELA----L 155 Query: 182 VIDFSRSM---LDYQRDSEGQPLNCFGQP-----------------------------AD 209 V+D + SM ++ +++ +N + Sbjct: 156 VLDVTGSMRPDMNALKEASKSIVNILLPDDSNSRESKVRISLVPYSQGVNLGSYATRVTN 215 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + N + G + +PY + ++S ++ P + Sbjct: 216 GGSTWRNCVNERSGPQKFTDAPYNYAGSRSDFFHGKPKQFVWDYGWTEQWQTRPEACPKT 275 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV-----HKLIRTIV 324 + ++R+I + + T + +S W Sbjct: 276 AVEPLTADRTKLLRAISGLKDGGGTGGQTGIAWGWYTLSPK---WKNLWPRDSAPATYGT 332 Query: 325 KTFAIDENEMGSTAINDAMQTAY------DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + D + + AY I + ++Y Sbjct: 333 GSHTDDTKKFALIMTDGDFNAAYGWDCGCRKIRDKPLYCRKKSNKKSWIERY--FSPSKI 390 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHEL 437 + + +C++ KS+ I I T+ F + +S CAS +++ A++++EL Sbjct: 391 SHAPAQRAKKLCDEMKSKNIEIFTVYF--DTGGATFGDDLMSYCASGSRNYYRADNSNEL 448 Query: 438 NKIFRDRIGNEIFE 451 + F + I NEI Sbjct: 449 IQAFSN-IANEIQS 461 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 72/505 (14%), Positives = 159/505 (31%), Gaps = 101/505 (20%) Query: 10 YSKKLIK---------SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 Y ++LI + L++ +++ + G+ +DV R L A A Sbjct: 3 YFRRLIGRPLLKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAIDAA 62 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK-NFTDREVRDIVRDTA 119 ++ + L ++ ++I+ L +F+ NL F D ++++ Sbjct: 63 ALSVAADL------------STSVMSDEQIKAALADSFKANLADVEFLDEAIKNLSFVVD 110 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 E K + L + ++ ++++G +++ T ++ + V + Sbjct: 111 AENGTIKVSSFATLD---NYFIDMGGYGMQALGPETFNFGTSSQVT------YSRFDVEL 161 Query: 180 QWVIDFSRSM---LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V+D + SM +D RD+ +N S + + L Y Sbjct: 162 ALVVDVTGSMRNDMDTLRDASKGLVNILIPETTEEADSKVRISLVPYSQGVNLGTYAAKV 221 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +Y ++E D + ++ + + +K D Sbjct: 222 KGGVYGY---ADSSVCVTERQDYDDGEDIYKVRYTDMPYNYY------VKTDPPPKDVFY 272 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G + + R + D ++ G TA + +++I + D Sbjct: 273 GGGSNRCSGTSKMIPLT-----ADRDTLLDAIADLDDNGGTAGQTGVVWGWNSISPNYSD 327 Query: 357 EVHRMKNNLEAK-----KYIVLLTDGE--------------------------------- 378 K+ +++TDG+ Sbjct: 328 VWPLASKPEPYDNDDVLKFAIIMTDGDNNRFYEFVKEREECDWVYSRRYGWQWTCEMVSV 387 Query: 379 ------------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 N ++ A+C K +GI I + F N + + +CAS Sbjct: 388 NQWQERSESESYNNNSSKAQRALCQAMKDEGISIFGVYFGTNDSS--AGSKNMQSCASTG 445 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 ++++A S+ EL F + I +I + Sbjct: 446 NYYKATSSDELINAFAN-IAKKIQQ 469 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 143/469 (30%), Gaps = 56/469 (11%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ ++ G+ +I AL + +LG G+ VD R + +L+ A +A + S L Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVG 74 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + T+ +T++E + + + Sbjct: 75 TITPDQVSSKAQTYFNSL----------------YTNKETGAVTVTATYTAKDGSGSSTI 118 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +S + + + F++ +G ++ I + + + +D + SM Sbjct: 119 AMSGQGAVQ----TQFMKILGFQTMAIGSSTTTTWGGTR------LRVAMALDVTGSMAS 168 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + S + + R + + ++ S + + Sbjct: 169 AGKMSAMKTAAKNLVDSLRASAQTADDVYISVVPFAQMVNVGSSNRNANWVRWDLWDESN 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDA---LASVIRSIKKIDNVNDT-----VRMGATFFN 303 + + S + V + D D R A ++ Sbjct: 229 GSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYD 288 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + T +K + G T M A+ ++ + + + Sbjct: 289 ACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPAKDS 348 Query: 364 NLEAKKYIVLLTDGENTQDNEEG-------------IAICNKAKSQG-----IRIMTIAF 405 N + I+LL+DG NT D G +C+ ++ + I TI Sbjct: 349 NYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQV 408 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + L CA +FF ++ + F +IG+ + + + Sbjct: 409 NTDGDPES---TVLKYCADSGNFFATTTSSGIGTAF-AQIGSSLSKLRV 453 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 142/469 (30%), Gaps = 56/469 (11%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ ++ G+ +I AL + +LG G+ VD R + +L+ A +A + S L Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVG 74 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + T+ +T++E + + + Sbjct: 75 TITPDQVSSKAQTYFNSL----------------YTNKETGAVTVTATYTAKDGSGSSTI 118 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +S + + + F++ +G ++ I + + + +D + SM Sbjct: 119 AMSGQGAVQ----TQFMKILGFQTMAIGSSTTTTWGGTR------LRVAMALDVTGSMAS 168 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + S + + R + ++ S + + Sbjct: 169 AGKMSAMKTAAKNLVDSLRASAQTVDDVYISVVPFAQMVNVGSSNRNASWVRWDLWDESN 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDA---LASVIRSIKKIDNVNDT-----VRMGATFFN 303 + + S + V + D D R A ++ Sbjct: 229 GSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYD 288 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + T +K + G T M A+ ++ + + + Sbjct: 289 ACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPAKDS 348 Query: 364 NLEAKKYIVLLTDGENTQDNEEG-------------IAICNKAKS-----QGIRIMTIAF 405 N + I+LL+DG NT D G +C+ ++ + I TI Sbjct: 349 NYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQV 408 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + L CA +FF ++ + F +IG+ + + + Sbjct: 409 NTDGDPES---AVLKYCADSGNFFATTTSSGIGTAF-AQIGSSLSKLRV 453 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 72/448 (16%), Positives = 145/448 (32%), Gaps = 73/448 (16%) Query: 6 KFIFYSKKL---IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 +F + ++L + +G+ +I AL + V+ G G +D R Y L+ AA +A++ Sbjct: 10 RFAGFRRRLGESCRDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAVL 69 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + E A +F +NF V D+ E+ Sbjct: 70 RAMALKSATDESRGVAADKAFA-------------------ENFGHPGVYDLNGALKREV 110 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 N ++S Y + S F G S+ + ++A+T I +V Sbjct: 111 NEN------IISQTYTVHATVSSYFGAFFGKDSYPVTVVSQAKTSLD------VFEIAFV 158 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D + SM + + A + K+ + + + Sbjct: 159 LDTTGSMAE-ANKMPNLKSSVDSAMAGLLQNGKNLSGSKIAVVPFNTQVRLSDATVT--- 214 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + L + H + + ++ ++ D + Sbjct: 215 TMSSQGLSSGWGNCVHDRDLATSHDVSASAAQKGKAQTLY-PLETCDEAS---------- 263 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + I G T + +Q + + + Sbjct: 264 ---LKPVQGLSDNISSARNFIKT-----LQPGGYTNVTMGVQWGMEVLSPNQPFSDATEF 315 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIA---------ICNKAKSQGIRIMTIAFSVNKTQQE 413 + +A+K+++++TDG+NT+ A C AK++GI + T+ + Sbjct: 316 GSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALACENAKAKGITVYTVKIIQGNS--- 372 Query: 414 KARYFLSNCAS-PNSFFEANSTHELNKI 440 L CAS P F++ S ++LN Sbjct: 373 ---NMLRKCASAPEYFYDLTSANQLNAA 397 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 68/471 (14%), Positives = 134/471 (28%), Gaps = 57/471 (12%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S + +K+ G+ II A+ + M+G G +D R + +++ A +A + S L Sbjct: 14 SSRFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKDLAS 73 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + + Y + N N T T K Sbjct: 74 GVITAGQVSAKAQS--------YFASLYNNTEAPNITVTATYTAKDSTGSSTVLLKGTGD 125 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + F+ G + I + A A + + +D + SM Sbjct: 126 ISTE------------FMNMFGFPTLGIGSAATATWGGTR------LRVAIALDVTGSMA 167 Query: 191 DYQRDSEGQPLNCFG-QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + Q ++ + + ++ + N S Y Sbjct: 168 SAGKMPAMQSAAKTLVDNLRANAQTADDLYISIIPFAQMVNVGKSNKNASWIKWDYWEDT 227 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDAL--ASVIRSIKKID-----NVNDTVRMGATFF 302 S + SS + + V + D R A + Sbjct: 228 TGSCNWWWLTTKSSCESAGRTWSSTNQSQWGGCVTDRDQPADTTKDAPTTAATRFPAANY 287 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + T +K + G T M A+ ++ Sbjct: 288 SACPEQILPMTSAYSSSNATTIKDKIDALSPNGGTNQPIGMHWAWMSLQDGAPLNTPAKD 347 Query: 363 NNLEAKKYIVLLTDGENTQDNEEG-------------IAICNKAKSQGI------RIMTI 403 + + I+LL+DG NT D G +C+ ++ I TI Sbjct: 348 ADYKYTDAIILLSDGMNTIDRWYGNGSSWSKDVDARQKLLCDNIRAASAASTTKTVIYTI 407 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + + L CA +FF + ++ F +IG + + I Sbjct: 408 QVNTDGDPESE---VLKYCADSGNFFATTTASGISTAF-AQIGASLSKLRI 454 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 98.1 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 75/506 (14%), Positives = 146/506 (28%), Gaps = 80/506 (15%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G+ +ITA ++P +L + G+ +D+ + ++ A +A++ + L + ++ Sbjct: 18 EGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGA--LGRQAGNTAA 75 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + + D DI+ + R+ Y L Sbjct: 76 ETTLDVQTYALALFTDQGGGLDCDPVAVTFDETNLDILG----TVRCRQPTYLSSLIGHD 131 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF------------ 185 +L N S +G S + + + +D Sbjct: 132 ELEFNVASTSTYGVGKLDVAFIFDVSGSMNSYNRLAQLKTAAVAAVDELLPDSRERDGTV 191 Query: 186 ---------SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP--YMV 234 S + Y + Y+S N K I + Y Sbjct: 192 RLAIASYNHSLNAGAYIGAVTETVTLSADGSNSTALSRYNSHNTKRMIDQDSGKRFFYYQ 251 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSS---SLRHVIKKKHLVRDAL------------- 278 S S + F D+ + + DA Sbjct: 252 SGTCSSWNCGKYSSWSWDTKRRFFDDTGLADACVYERTGTQAATDAAPGSGAWIGAGNPR 311 Query: 279 -ASVIRSIKKIDNVNDTVRMGATFFNDRVI----------SDPSFSWGVHKLIRTIVKTF 327 + S K D + AT + + +T++K Sbjct: 312 WSFYAGSSSKYDGWQNVENQNATGYGVGAYEGRHGTCMPSGPVPLT-----EDKTVLKDH 366 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-----KYIVLLTDGENT-- 380 G TA + + + + L + K ++L+TDG+ Sbjct: 367 VNALVAEGGTAGHLGIAWGWYLVSPEWAAIWPEASEPLPYRQPQTSKAVILMTDGDFNIE 426 Query: 381 ------QDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQE--KARYFLSNCA-SPNSFF 429 + + +C+ K+ I+I T+ F V + Q R L CA SP+ F Sbjct: 427 HPTASRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQRTGDGRTILEYCATSPSHAF 486 Query: 430 EANSTHELNKIFRDRIGNEIFERVIR 455 A+S EL +++R I I + ++ Sbjct: 487 SADSGEELIEVYRS-IARSISDLRLK 511 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 97.0 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 90/497 (18%), Positives = 163/497 (32%), Gaps = 111/497 (22%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + ++P++L VGG VDV R S L++A A + + +A + Sbjct: 1 MALAILPMLLAVGG-TVDVGRQSSLATDLQEAIDIAALHIA------------KAPSDAI 47 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 ++ + + + N T + + E + +S + P Sbjct: 48 PGEEDVLQLIKSNI-------TTKDSRIALKKLDVTEKD---------VSLHATAEITPF 91 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 FL GIK+ Q + +R + + V+D + SM + + S L+ Sbjct: 92 --FLGLAGIKNLTAQRATKTAREAR-----GEIEVALVLDTTWSMSE-KDSSGKSRLDSL 143 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC----NKSLYYMLYPGPLDPSLSEEHFVD 260 A + V + +++GK + + Y+ N+S + PS Sbjct: 144 KGAAAKLVDTIFTEDGKTRVAVVPYADYVNVGTQHRNQSWLDVPPSYSTTPSERRCETRT 203 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKID--NVNDTVRMGATFFNDRVISDPSFSW---- 314 + + + + V S ++TV++ + S ++ W Sbjct: 204 TRTQCTSYAPTYQCTRTVDGVSESTTCGGGCTSSETVQVAPYEYCTGGGSSRTYDWYGCV 263 Query: 315 -----------------------GVHK----------LIRTIVKTFAIDENEMGS----- 336 G + VKT + + G Sbjct: 264 ASRTVGDYRLTDARPDIRYPGFLGTSRECPGPLLSLSTREADVKTSISNLSYGGGGYRPS 323 Query: 337 TAINDAMQTAYDTIISSNEDEVHR-MKNNLEAKKYIVLLTDGENT--------------- 380 T I + + + E NN +K +VL+TDG NT Sbjct: 324 TFIPAGLIWGLNVLSPPAPFEEQAYDPNNKLPRKALVLMTDGANTMVFNSSDGRHRNARS 383 Query: 381 -----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANST 434 Q + + I+ICN K GI I T+ F VN + A L CA+ +F+A S Sbjct: 384 GTEVAQSDRDTISICNNIKRSGIEIFTVGFMVNSSS---ALDLLKECATDGEHYFDATSP 440 Query: 435 HELNKIFRDRIGNEIFE 451 EL+ F RI + + + Sbjct: 441 EELHSAFG-RIADGLTQ 456 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 93.1 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 69/464 (14%), Positives = 144/464 (31%), Gaps = 111/464 (23%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 ALL+PV+LGV G+ VD+ L+ A + S+ L Sbjct: 2 ALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLEL----------------PY 45 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + + ++ N++ + + V T + + +V + + L Sbjct: 46 DPDLSKGIVTQAVNDMVETNMEEAV-----VTEISAGTEIRSVKVTAQAEVRM------L 94 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 + +G+ A+ + + + + + +VID S SM D Q Sbjct: 95 LMEVLGM--------ADKTVEASAMAGFNKLEVVFVIDNSGSMKGTPIDLVKQASEELTD 146 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + ++ G V R + V + G L+ ++EE + ++L + Sbjct: 147 LLIPDGTTPDTKVGLVPFRGKIRLGEAVDGYAEG-CVNADGSLNTGINEEFMDEYNALPY 205 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K+ + + + S +I I Sbjct: 206 YYKRYITLDTC--------------------------SDIPTVLPLSKNKSTIIAAIGSQ 239 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---- 382 A T I++ ++ + + K + +K +++LTDG+ Sbjct: 240 TATGAAS--GTVISEGIKWGRNILTPDAPFTQAGSK--EDFRKIMIVLTDGDTEDGECGG 295 Query: 383 --------------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 N + ++ AK GI I +I F + Sbjct: 296 TYRATYRPNNYWTNAYYGMGVDTAHCNDGGVLNADMLSEAQLAKDAGIEIFSIRFG---S 352 Query: 411 QQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEI 449 + AS + +F+A S +++ IF+ +IG ++ Sbjct: 353 SDTTDINLMKEIASSKAGTDDHYFDAPSVYDIPDIFK-QIGKQL 395 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 92.0 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 71/495 (14%), Positives = 151/495 (30%), Gaps = 85/495 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F I+ V+ G VD+ + + L+ A+ + + L + Sbjct: 14 FARHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGVI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + N + + + ++A V Sbjct: 74 KEADA------------SKAVQNFLVANSMAGILQPDQIVL-----DRLVVDRTANTVQA 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + D+ L F G+ + + A T S K V++ ++D + SM Sbjct: 117 DAHVDVAL-----FFPVFGMGN---TQRVTASTTSLYSDKTIEVAM--MLDVTGSMAANW 166 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS----------------CN 237 + + + + R Sbjct: 167 WAKTDKIGDLQAAASTAVENLLDNNIDPNNPRVRVAIVPYAEAVNTGGLADSVFVEQAGG 226 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +L + + + K K D + +++ + + + Sbjct: 227 SNLPPPVPSAGAPIPVGSSVTLRPDKCATERKDKDGYADYSSDGPSELRRNNQNQEYL-- 284 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY--------DT 349 +DR+ + P + + D G TA A+Q Y T Sbjct: 285 AKVNRDDRMGTCPKPELIPLTADKQKLLDTIADFKAAGVTAGGIAVQWGYYMLSPSWRST 344 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-----------------DNEEGIAICNK 392 I+++ +N + K +L+TDG+ +IC+ Sbjct: 345 IVNARLGSGPANFDNRKVGKVAILMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDN 404 Query: 393 AKSQGIRIMTIAF-----SVNKTQQEKARYFLSNCASPN-----SFFEANSTHELNKIFR 442 K GI I TI F S+ T++++A+ L +C++ + ++EA + EL++ F Sbjct: 405 MKRDGIEIFTIGFDLDDPSMTSTERDQAKSVLQDCSTADTSTLKHYYEAATGPELDEAF- 463 Query: 443 DRIGNEIFERVIRIT 457 N I + + R+T Sbjct: 464 ----NAIVQNIERLT 474 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 91.6 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 81/491 (16%), Positives = 149/491 (30%), Gaps = 77/491 (15%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + + G+ +I AL + +LG G +D R S A++ A + + Sbjct: 8 TRSRKVASRFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTALMV 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 S L + S ++ + T+ +T E R + T Sbjct: 68 SKDLGADKIKTSEVSEKAQTYFNSL----------------YTGTEARGVTLTTNYTAKD 111 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + VV++ + + F++ G S I + A A + + +D Sbjct: 112 DSGSSTVVVNGDGAVS----THFMKMFGFPSLAIGSAATATWGGTR------LRVAMALD 161 Query: 185 FSRSM------------------LDYQRDSEGQPLNCFGQ--PADRTVKSYSSQNGKVGI 224 + SM +D R S + + P + V SS I Sbjct: 162 VTGSMVLNGSTKLAEMKKAASALVDTLRASAQSKDDLYISVVPFAQMVNVGSSNIDASWI 221 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + SC+KS + + + D S + + + + + Sbjct: 222 KWDVWDETEGSCSKSKFKTKTD--CEDNGRTWTVTDRSKWKGCVTDRD---QPADTTKDA 276 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D +R + +K D G T M Sbjct: 277 PTSDDTRFPALRT-LLGTTSCPAQIFPMTSAYAATDAQKIKDVIDDLVADGGTNQPIGMA 335 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------------NEEGIA 388 A+ ++ N N + I+LL+DG NT D + Sbjct: 336 WAWMSLQQGNPLNTPAKDPNYKYTDAIILLSDGLNTMDRWPDYGDGQRQFDGKIDARQKL 395 Query: 389 ICNKAK---SQGIR--IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 +C+ K S G R + TI T + L CA +FF + + F Sbjct: 396 LCDNIKLPDSNGKRPVVYTIQV---NTTGDPESTILKYCADGGNFFATTTASGIGTAF-A 451 Query: 444 RIGNEIFERVI 454 +IG+ + + I Sbjct: 452 QIGSSLSKLRI 462 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 91.2 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 100/327 (30%), Gaps = 58/327 (17%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVKSYS 216 + + A+ + + + I V+D S SM D E + + + Y Sbjct: 124 LSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYV 183 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 +G + + + S +Y+ D + F L + + ++ Sbjct: 184 RVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALK 243 Query: 276 DALASVIRSI----KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 A+ I K+ + N R+G + V + + + L T+ D Sbjct: 244 QAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD-----DL 298 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------E 384 G+T + MQTA + ++ D K ++ TDG+ T+ N Sbjct: 299 KASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDVAN 348 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQE-------------KARYFLSNCAS------- 424 + I K+ G + ++ K+ F+ +S Sbjct: 349 DAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATT 408 Query: 425 ----------PNSFFEANSTHELNKIF 441 N + A+ LN +F Sbjct: 409 YTNLGAKAPNSNYYLAASDADTLNAVF 435 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 90.8 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 100/327 (30%), Gaps = 58/327 (17%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVKSYS 216 + + A+ + + + I V+D S SM D E + + + Y Sbjct: 124 LSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYV 183 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 +G + + + S +Y+ D + F L + + ++ Sbjct: 184 RVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALK 243 Query: 276 DALASVIRSI----KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 A+ I K+ + N R+G + V + + + L T+ D Sbjct: 244 QAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD-----DL 298 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------E 384 G+T + MQTA + ++ D K ++ TDG+ T+ N Sbjct: 299 KASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDVAN 348 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQE-------------KARYFLSNCAS------- 424 + I K+ G + ++ K+ F+ +S Sbjct: 349 DAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATT 408 Query: 425 ----------PNSFFEANSTHELNKIF 441 N + A+ LN +F Sbjct: 409 YTNLGAKAPNSNYYLAASDADTLNAVF 435 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 90.8 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 100/327 (30%), Gaps = 58/327 (17%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVKSYS 216 + + A+ + + + I V+D S SM D E + + + Y Sbjct: 110 LSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYV 169 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 +G + + + S +Y+ D + F L + + ++ Sbjct: 170 RVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALK 229 Query: 276 DALASVIRSI----KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 A+ I K+ + N R+G + V + + + L T+ D Sbjct: 230 QAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD-----DL 284 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------E 384 G+T + MQTA + ++ D K ++ TDG+ T+ N Sbjct: 285 KASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDVAN 334 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQE-------------KARYFLSNCAS------- 424 + I K+ G + ++ K+ F+ +S Sbjct: 335 DAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATT 394 Query: 425 ----------PNSFFEANSTHELNKIF 441 N + A+ LN +F Sbjct: 395 YTNLGAKAPNSNYYLAASDADTLNAVF 421 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 90.4 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 70/190 (36%), Gaps = 28/190 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++ I ++ D R+G F + ++ + + + + A Sbjct: 119 TRLAAVKKVVSDFIDQ-RQGD------RIGLILFGTQAYLQTPLTFDTQSVNQFLQE--A 169 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + N+ K + K I+LLTDGENT E + Sbjct: 170 QLGFAGKDTAIGDAIGLSVKRL--KNQSSASSAKPSN--SKVIILLTDGENTAGEVEPLQ 225 Query: 389 ICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP--NSFFEANS 433 A+ G +I T+ ++ L+ A+ +F A + Sbjct: 226 AAKLAEKIGAKIYTVGIGADEMIVRGFFGNRRVNPSASLDEETLTAIANTTGGLYFRARN 285 Query: 434 THELNKIFRD 443 T ELN I+ + Sbjct: 286 TQELNNIYSE 295 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 90.0 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 65/491 (13%), Positives = 138/491 (28%), Gaps = 57/491 (11%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS--VPLIQS 71 + G+ ++ L +++ VD + A A + + P Sbjct: 6 FFRDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVAVLRPATVE 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + + + + + + + +N ++ Y+V Sbjct: 66 QAQAAVKLRLDKELGDNPDKVVIGQFNYDTKTRTYYVTAKGTYKPFLLGVVNIKEIPYEV 125 Query: 132 VLSS--------RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + L+L+ + + S + A T + + Sbjct: 126 ISETIQAANGTLELALVLDNTDSMGQILNGSSTRLDVLKTAATNLVNTVMTSANKDYVKV 185 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY-- 241 + + AD TV + + + + +C+ Sbjct: 186 AVVPYADYVNVGLANRSQSWVSVGADYTVPAAAKTCTTISTKQVCTGGVYGTCDSIKDGV 245 Query: 242 -----YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 P + ++ K VR + S + + D + Sbjct: 246 PIKVGCWKTPQTCTTVNITPYQSCNNPQPTYYKWYGCVRHQVDSKTKMLVLPDPLTAY-- 303 Query: 297 MGATFFNDRVISD-PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-- 353 G + + S + +I + T I + +T+ Sbjct: 304 TGVLETAQKCPTAIQPLSNDKTVVTNSIKGLVNSIGSYKPDTFIPGGLHWGVNTLSPPAP 363 Query: 354 -NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----------------------- 389 E + KN E KK IVL+TDG NT I Sbjct: 364 FKEGMAYDSKN-KEPKKVIVLMTDGANTLYTNSSGQIVSAATGSPPTISSSLVAPTYTAQ 422 Query: 390 ---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRI 445 C AK + I + I V A L +CA+ +F+A + ++L + F + I Sbjct: 423 DNACKYAKGKNIEVFVIGLGVTD---PTALSALKSCATDAQHYFDAQNANDLIEAF-EII 478 Query: 446 GNEIFERVIRI 456 G ++ V+R+ Sbjct: 479 GGKLS--VVRL 487 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 89.3 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 56/270 (20%), Positives = 98/270 (36%), Gaps = 30/270 (11%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD----EKLSPYMVSCNKSLYY 242 +L Y + ++ +P N F + ++++G + E + C Y Sbjct: 152 GELLIYTKWAKNEPDNTFSDYGYYVRFNENAKDGTWKVDTFSGGETNFNNVFLCEWGDYS 211 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + L + + V + + + + A A + SI +N +G Sbjct: 212 VTGNDGLKVTSKKRDIVLTLDISASMDGIPLDETKKAAAKFVDSI-----LNKNSNIGLV 266 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 ++D S L TI + +T I D + AY + Sbjct: 267 SYSDEATSLSGICSNDVFLKNTITS-----LSSAENTNIEDGLSRAYSMLQL-------- 313 Query: 361 MKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-QEKARY 417 KK IVL++DG T +D EE I K K QG+ I T+ F N + + + +Y Sbjct: 314 ---GQSKKKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQY 370 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + AS +E +S+ +L F D G Sbjct: 371 LMEKIASEGCHYEVSSSEDLVFFFEDVAGQ 400 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 89.3 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 72/466 (15%), Positives = 135/466 (28%), Gaps = 95/466 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALL+ + G GM VD L AA A + + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ S ++ F + + TD +V + T L Sbjct: 65 AVAAAMTMSGNGTISLGKDDARSIFMSQISGELTDVQVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I A AE Y + ++D + SM Sbjct: 116 NSQVSFSATVPTTFMRVLGRDSITISGTATAE-----YQTASFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 ++ + + N Sbjct: 171 TATDVSTMEKNTSDTCAFACHETQNNN--------------------------------- 197 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 ++ + + V + +VR A + + K ++ RMG F + Sbjct: 198 ---NYYNLAKKLGVSMRIDVVRQATKELTVTAKSTRVSSNQFRMGVYTFGTKAEDAKLTT 254 Query: 310 -PSFSWGVHKLIRTIVKTFAIDENEMG-----STAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + G T+ + A+ I + + Sbjct: 255 ISDPTDDLDKVRSYTDAVDLMTIPFQGYNNDQQTSFDSALTQMKTIITTPGDGS-----T 309 Query: 364 NLEAKKYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAF 405 +K + ++DG + +G + C K + IRI T Sbjct: 310 ATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCQPLKDKSIRIAVLYTTYL 369 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K Q + + CASP +FE T + + Sbjct: 370 PLPKNSWYNTWIKPFQGEIPTKMQACASPGLYFEVTPTEGIADAMK 415 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 88.9 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 72/462 (15%), Positives = 137/462 (29%), Gaps = 89/462 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALL+ + G GM VD L AA A + + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ S ++ F + + TD + + T L Sbjct: 65 AVAAAMTMSGNGTISLGKDDARNIFMSQMSGELTDVHIDLGINVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I A AE Y + ++D + SM Sbjct: 116 NSQVSFSATVPTTFMRILGRDSITISGAATAE-----YQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 ++ L A + S N Sbjct: 171 TANDVSKLQAKTGCAFACHQMDQSTNN--------------------------------- 197 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 + V + +VR A ++ + K +D RMG F + Sbjct: 198 -----YTIAKGLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 310 -PSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S + + K+ + + + + +A + + D +N+ Sbjct: 253 ISSPTSDLTKVKNYTDTVDLMTIPYQNYNQDQLTSFDSALTQMNTII-DPAGDGTSNISP 311 Query: 368 KKYIVLLTD--GENTQDNEEGIAI-------------CNKAKSQGIRI---MTIAFSVNK 409 +K + ++D G++ + + C K +G++I T + Sbjct: 312 EKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPS 371 Query: 410 TQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 Q + + CASP +FE T + + Sbjct: 372 NDWYNKWISPFQSEIPTKMQACASPGFYFEVTPTEGITDAMK 413 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 88.1 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 80/475 (16%), Positives = 153/475 (32%), Gaps = 76/475 (16%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G I+ AL V+LG+ G+ +D R + + AA A + S Sbjct: 23 ADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLAVSR--------- 73 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 +N +++ F+ N V + P + V +++ Sbjct: 74 ----ENFQVSTNDQLKALAQSYFDANFPPGTMGATTSLSVATSGTP--PTVQGFTVTVTA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 L+ PL L I S I + A +++ + + G+ + V+D S SM Q D Sbjct: 128 TLPLVFAPLVETLGGPTIGSVGISKASGAVFTTQTSN-QGGMELVIVLDNSASMKGSQED 186 Query: 196 SEG--QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 G + L T K+ L + + + ++ P Sbjct: 187 LRGGVKALLDMLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCPMA 246 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + ++ S L + K + + GA+ + S Sbjct: 247 TVNGKLNGSRLSNAPPKTF-----------KFDSTTDGVEIQYCGASTLGT----SSALS 291 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-----NEDEVHRMKNNLEA- 367 + I G T I + + + + S N + L+ Sbjct: 292 PNRGDADKAIKSYV-----AGGDTLIGEGLVWGWRMLTPSWRGLWNTKDQPGASLPLDYD 346 Query: 368 ----KKYIVLLTDGEN---------------------TQDNEEGIAICNKAKSQ-GIRIM 401 KK +VL+TDG N ++ + + + ICN AK + + Sbjct: 347 LPYMKKVLVLMTDGVNHIAGRNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLY 406 Query: 402 TIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 TI + + +Q+ +S+CAS P+ + A +L K F + ++++ Sbjct: 407 TITYGSDTDEQQ-----MSDCASDPSKHYHAALPQDLAKAFTQVGTDLTTMKLVQ 456 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 79/498 (15%), Positives = 153/498 (30%), Gaps = 98/498 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L+ G II AL + + G+ VD R + L QA A + + Sbjct: 4 RRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGSSTGTA 63 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E +++ + F+ N K + D V T ++ SA QV Sbjct: 64 AE----------------LQQIGQKFFDANFKDSGLDAAGSFSVSVTGDVVSANGSA-QV 106 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-- 189 + ++ +GI + + A+ + G+ + V+D + SM Sbjct: 107 QTT------------LMQLVGIDTIAVSESAQVIRSIK------GLELALVLDNTGSMTT 148 Query: 190 ----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + ++ FG AD + + ++P ++S N + Sbjct: 149 SDNIGALRDAAQELVDILFGGRADHPTLRVAVVPYSASVNPGPIAPTLISGNDAYAPTNL 208 Query: 246 PG-----PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 G + E S++ + + + + S + D G T Sbjct: 209 LGWKGCVIERVGRAMEDSPASTAPWLRYQWLPAIDNYYDATKASTVRADPSQG---NGGT 265 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 N + + GV + + ++ A+ G T + M + Sbjct: 266 GPNLGCPTPITPLTGVKATVDSAIQ--ALRAWSRGGTMGDIGMAWGLRVLSPEPPFTEGL 323 Query: 361 MKNNLEAKKYIVLLTDGEN----------------------------------------- 379 N + K ++L+TDG+N Sbjct: 324 AWNTPKWAKAVILMTDGDNQFYKLTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTT 383 Query: 380 --TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 + N +C K +GI + TI F + + CAS +F++ S + Sbjct: 384 AKSVINTRLTQVCQAMKDKGITVYTITF--TSGINQATKDIYKACASSTAKWFDSPSQAD 441 Query: 437 LNKIFRDRIGNEIFERVI 454 L FR I E+ + + Sbjct: 442 LRASFR-AIATELSQLRV 458 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 87.0 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 66/469 (14%), Positives = 139/469 (29%), Gaps = 73/469 (15%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G+F I A+ + ++ G VD+ R E LK A + + + Sbjct: 11 LWQDRRGNFAAIFAIAIIPVVAAAGATVDISRAYIVESRLKAALDASALAVGGATGMTTS 70 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ + A++ F N+ ++ +S V L Sbjct: 71 QMQAMAQSFFN-------------------ANYPAS-----KLGVPGTLSVSQSGNVVSL 106 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S L + + +GI + + ++ + + + + V+D + SM Sbjct: 107 SVHAQLP----TTLMGVVGINTLNVSATSQVTRMGKK------LEVALVLDNTGSMASGG 156 Query: 194 RDSEGQPLNCFG------------------QPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 R + + P + V ++ + ++ +P Sbjct: 157 RMTVLKTAAKNLITTVSAAATNPGDVKVAIVPFNVDVNIGTTNENVSWLHWDEFTPSGGG 216 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 N + + L + + + + + DA + N Sbjct: 217 GNGNGNCNIIQILLGLCNNNNNSNSHAGWEGCVMDRDQNYDAQNTFPPPNPGGSNATRYP 276 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + N + + + + G+T + ++ + Sbjct: 277 ASNSDSDNSNCNLQTIMPLSTN---WSALNSHIDAMASAGNTNTTIGLAWGWNMLTQGGP 333 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT----QDNEEGIA-----ICNKAKSQGIRIMTIAFS 406 K IV LTDG+NT +N I ICN K+ GI++ ++ Sbjct: 334 LSS-AAAPAANLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRV- 391 Query: 407 VNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 E + NCA+ P ++ + EL +F I + I Sbjct: 392 -----IEGNATLIRNCATEPGMYYSVTTASELTSVFAS-IAQSLSNLRI 434 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 86.2 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 65/197 (32%), Gaps = 27/197 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKID-NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + ++ + +D ++D V G F + L+ + Sbjct: 108 HVDRLTAIKKVAGDFVTGGDNLDGRLSDLV--GLITFAGYADGVTPPTLDHAFLVSQLNH 165 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + + TAI DA+ A + + + + + K I+LLTDGEN + E Sbjct: 166 SQIVTNRSEDGTAIGDAISLAVEKL-----NALDARRKEKIQSKIIILLTDGENNAGDLE 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----------------LSNCAS--PN 426 I A++ GI++ TI L AS Sbjct: 221 PIQAAELAQTMGIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVASITGG 280 Query: 427 SFFEANSTHELNKIFRD 443 +F A T L KI+ + Sbjct: 281 KYFRATDTDSLAKIYGE 297 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ H ++ I A +T+I DA+ A + Sbjct: 143 RIGLILFGTRAYLQTPLTYDRHSVLMRID--DATAGLAGKTTSIGDAVGLAVKRLQDVPS 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 + I+LLTDG N + AK GI+I TI Sbjct: 201 K-----------GRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRALT 249 Query: 405 --FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 F E L A + +F A L I++ Sbjct: 250 GDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQ 291 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ H ++ I A +T+I DA+ A + Sbjct: 135 RIGLILFGTRAYLQTPLTYDRHSVLMRID--DATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 + I+LLTDG N + AK GI+I TI Sbjct: 193 K-----------GRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRALT 241 Query: 405 --FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 F E L A + +F A L I++ Sbjct: 242 GDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQ 283 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ H ++ I A +T+I DA+ A + Sbjct: 143 RIGLILFGTRAYLQTPLTYDRHSVLMRID--DATAGLAGKTTSIGDAVGLAVKRLQDVPS 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 + I+LLTDG N + AK GI+I TI Sbjct: 201 K-----------GRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRALT 249 Query: 405 --FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 F E L A + +F A L I++ Sbjct: 250 GDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQ 291 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ H ++ I A +T+I DA+ A + Sbjct: 135 RIGLILFGTRAYLQTPLTYDRHSVLMRID--DATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 + I+LLTDG N + AK GI+I TI Sbjct: 193 K-----------GRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRALT 241 Query: 405 --FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 F E L A + +F A L I++ Sbjct: 242 GDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQ 283 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ H ++ I A +T+I DA+ A + Sbjct: 135 RIGLILFGTRAYLQTPLTYDRHSVLMRID--DATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 + I+LLTDG N + AK GI+I TI Sbjct: 193 K-----------GRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRALT 241 Query: 405 --FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 F E L A + +F A L I++ Sbjct: 242 GDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSIYQ 283 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 79/496 (15%), Positives = 160/496 (32%), Gaps = 91/496 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F I+ L V+ G V+V + +L+ A+ + + L Sbjct: 14 FARDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGAI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + K+ F N + + + + ++A V Sbjct: 74 KEADANKSVQAFLD------------ANSQAGILQADQIVL-----DRLIVNRTAKTVQA 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + D+ L+ G + ++ A + T + + V + ++D + SM Sbjct: 117 DAHVDV-----GLYFPIFG--TGDMKRVAASTTA---LYSDKTVEVAMMLDITGSMAKRG 166 Query: 194 R-DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP--------YMVSCNKSLYYML 244 + D G ++ QN ++ + + V K L Sbjct: 167 KVDKIGDLKTAAKNAVQTMLQKQDPQNPRIRVAIVPYASGVNAGKLAENVYAEKQASTEL 226 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--------DTVR 296 P P L + + + +K N N DTVR Sbjct: 227 PPVAGSPLLVAKTGKNLLPSFSDYISIVGAAMPRPDNCATERKDKNGNADMSADGPDTVR 286 Query: 297 M---GATFF------NDRVISDPSFSWGVHKLIRTIVK----TFAIDENEMGSTAINDAM 343 G F+ + D + + T D G TA A+ Sbjct: 287 TDGNGKKFYALVNRDDHLGDGDMNRCPDAKVIPLTADSDALLESIEDFRANGFTAGAIAI 346 Query: 344 QTAYDTIISSNEDEVHRM-----KNNLEAKKYI---VLLTDGE---------NTQDNEEG 386 Q Y + + ++ + KK +L+TDG+ ++ + + Sbjct: 347 QWTYYMLSPQWRTAIRNAGLGKGASDADPKKIAKVAILMTDGQFNTAFAGAGDSYNRQGT 406 Query: 387 IA------ICNKAKSQGIRIMTIAFSVN-----KTQQEKARYFLSNCASPN------SFF 429 +A +C+ K+ GI I TI F ++ T++++A+ L +C+S + FF Sbjct: 407 LARGNAETLCDNMKNDGIEIFTIGFDLDDKDMSTTERDQAKAVLKDCSSKDTSGAKRHFF 466 Query: 430 EANSTHELNKIFRDRI 445 + ++ EL+ F++ I Sbjct: 467 DVSTGAELDDAFQEII 482 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 63/484 (13%), Positives = 138/484 (28%), Gaps = 96/484 (19%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G+ + A++ ++ G VD R + L+ A A + A + Sbjct: 10 RFRSDIQGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASVGAVSRTSPAF 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + RN N T + +V +K+ + Sbjct: 70 IAAGAMTADGIITA----GNDDARNIFNGNMNGTTGYTLN------SVTPEVKKTGSVLT 119 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + F+ +GIK+ +Q ++A Y + ++D S SM Sbjct: 120 ATVSFSASVPMM--FMNIVGIKTMTLQGMSKATASMPKY-----IDFYLLLDNSPSMGVA 172 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + A Y + Sbjct: 173 ATPDDVTKMVNATSDAKYGSNRYCAF-----------------------------ACHDY 203 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM-----GATFFNDRVI 307 +F + + V + ++R A ++ + + + RM GA + Sbjct: 204 NDSNNFYNLAKSIGVTTRIDVLRSATQQLMDTATQTQTYPNQFRMAIYDFGAASKTIGLR 263 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ-TAYDTIIS--SNEDEVHRMKNN 364 + + S + + G+ A + T + + +NE Sbjct: 264 ALFALSANLSSAKSAAGNIDLMGV--YGNNDAYTADKDTPFTAVFPAVNNEISTPGDGTT 321 Query: 365 LEAKKYIVLLTDGENTQDNEEGI---------------AICNKAKSQGIRI---MTIAFS 406 KY+ ++DG + N + A+C K++GI+I T Sbjct: 322 GSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTTLKNRGIKIAVLYTTYLQ 381 Query: 407 VN------------------KTQQEKARYFLSNCASPNSFFEANS----THELNKIFRDR 444 + + + + CASP +FE + +N +F+ Sbjct: 382 LPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACASPGFYFEVSPTQGIADAMNALFKKA 441 Query: 445 IGNE 448 + + Sbjct: 442 VADA 445 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 66/194 (34%), Gaps = 38/194 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + +V+ + I ++ R+G F + ++ K + T++ Sbjct: 115 IARILVVKYVVNEFIER-------RESDRLGLILFGSQAYLQAPLTFD-RKTVSTLLDEA 166 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + TAI DA+ A + ++ ++LLTDG NT Sbjct: 167 QLG-FAGEQTAIGDAVGLAIKRLRERPAS-----------QRVLILLTDGANTAGEVAPR 214 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKA----------------RYFLSNCA--SPNSFF 429 + AK GI+I T+ ++ +Q L A + +F Sbjct: 215 QAADLAKQAGIKIYTVGVGADQMEQRMGLFGGFSRTVNPSSDLDEDTLRYMAETTGGLYF 274 Query: 430 EANSTHELNKIFRD 443 A + EL I+ + Sbjct: 275 RARNPQELQAIYEE 288 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 78/550 (14%), Positives = 161/550 (29%), Gaps = 114/550 (20%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII------- 62 ++ +K +G+F ++ +L ++ GG+ VD V S + L+ A +A + Sbjct: 3 HTSSFLKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARAGD 62 Query: 63 -----------------------TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 +V ++ V + + F + + E Sbjct: 63 MSETQAMKLAKTTITTNYGINVAKVAVSMVDGDATVKASMDQALVFGGFMGRKNAAVSAE 122 Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD----LLLNPLSLFLRSMGIKS 155 +T E+ ++ T + + ++ Q + D L LN L + Sbjct: 123 ATATYAYTKYEIALVLDTTGSMLGGKLTSLQNAVIGLVDGMEALGLNKEQLKFAVVPYAG 182 Query: 156 WLIQTKAEAETVSRSYHKEHGVSI-------------QWVIDFSR-SMLDYQRDSEGQPL 201 ++ T++ + + + DFSR +M ++ + + Sbjct: 183 FVNVGPEYGPTINGAGKVKKPAAAWIDQDAKAPIPQSDLPSDFSRFAMFNHLKVEWPGCV 242 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + + + + + D + + Sbjct: 243 ETRVPADKILHDVKDTVPDITDPKSLFTPFFAIDEPDNKWGYPNSYLPDGGKPVKGNKAT 302 Query: 262 SSLRHV----------IKKKHLVRDALASVIRSIKKI--DNVNDTVRMG-----ATFFND 304 + + KK DA+A + KK+ DN F Sbjct: 303 EAEKQDQLARYGKTGEYKKPKNTDDAIA-LTGKWKKVKVDNSPSNFYSNKKDPKGPGFGC 361 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + K+ T+ GST + + + + + K++ Sbjct: 362 EMEPLVPLTTDFSKIRTTV-----KALEANGSTNMLEGVMWGWRVLSDREPFAQGAPKSD 416 Query: 365 LEAKKYIVLLTDGEN------------------------------------TQDNEEGIA 388 +K ++ LTDG+N +++ A Sbjct: 417 ASVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKA 476 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGN 447 C AK G+ I TI K L CA S +F+A S +L IF D I Sbjct: 477 ACENAKEDGVTIYTIRLEEADVGTGK---MLEECATSSAHYFDAPSRQQLTPIF-DAIKK 532 Query: 448 EIFERVIRIT 457 + + +R+T Sbjct: 533 GVVK--LRLT 540 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 88/519 (16%), Positives = 160/519 (30%), Gaps = 108/519 (20%) Query: 6 KFIFYSKKLI---KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++L+ + G I +++ +ML V G+ VD+++ ++ AI+ Sbjct: 12 SLRGFCRRLMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAIL 71 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 AS ++ + +++Y + E D V+ T Sbjct: 72 AAS--------DLDQPL-----PADEVVDDYFAK---------AGMTEFLDDVQITPGAH 109 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 P + V +R S+++ G++S + AE + I V Sbjct: 110 LPTTNFRVVQAEARTRTP----SIYMAMTGVRSLPVYVAGTAEETIENTE------ISLV 159 Query: 183 IDFSRSMLDYQRDSEGQPLNCF--GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D S SM + + + G + + +S N P + Sbjct: 160 LDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAANTTSLNIVPYAGQTNPGPIVFQRAGGR 219 Query: 241 YYMLYPGPLDPS--LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID-NVNDTVRM 297 + + D + L + FVD + + + L ID + Sbjct: 220 PFATFIEDSDGNEILYGQTFVDDEGNSIDVP-YNTMSSCLDLTNGDFDNIDLPSGGYDQT 278 Query: 298 GATFFNDRVISDPSFSWG----VHKLIRTIVK------TFAIDENEMGSTAINDAMQTAY 347 F N + + WG IR F D T M+ Sbjct: 279 -PYFMNWPIDAPT-MDWGWCPQNKSSIRYAQNNAGQLQDFIDDMRLHDGTGTQYGMKYGV 336 Query: 348 DTIISSNEDE----------------VHRMKNNLEAKKYIVLLTDG---------ENTQD 382 + S+ D + +K+IVL+TDG + Sbjct: 337 ALLNPSSRDTFVALNAAGLVPDGFKDRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDP 396 Query: 383 NEEGIA-----------------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + IA ICNKAK+ GI + TIAF + + Sbjct: 397 KNDEIALNQRIGDRDTYATQSTNVANFYSICNKAKAAGITVYTIAFEAPANAITQ----M 452 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 CA+ +FF E+ F+ I +I E +R+T+ Sbjct: 453 RTCATSPAFFYKVEGVEIKTAFKS-IARQINE--LRLTQ 488 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 78/221 (35%), Gaps = 35/221 (15%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S + + V+ ++ I +K D R+G F D Sbjct: 90 DLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ-RKGD------RLGLVLFGDHAYLQ 142 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 ++ + + + +T + TAI + + A T I SN ++ Sbjct: 143 TPLTFDRNTVREQLDRTVLNLVGQR--TAIGEGLGLATKTFIESN-----------APQR 189 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKAR 416 I+LL+DG NT E + AK +I T+ + Q ++ Sbjct: 190 TIILLSDGANTAGVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDE 249 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L+ A + +F A + EL +I++ E + + Sbjct: 250 DTLTKIATMTGGQYFRARNADELAEIYQTIDALEPVTQATQ 290 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 84.3 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 79/504 (15%), Positives = 150/504 (29%), Gaps = 107/504 (21%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F ++ V+ G V++ + + +L+ A+ + + L + Sbjct: 14 FARDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGVI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + N + + ++ K+A V Sbjct: 74 KEADA------------DNSVKAFLVANSAAGILQPDQVVL-----DKLIVDKTAKTVQA 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + D+ L + GI + A T + K V++ ++D + SM Sbjct: 117 NVHVDVAL-----YFPLFGIGDMQ---RVAASTTALYSDKTVEVAM--MLDITGSMAKRG 166 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + A V++ R S + Sbjct: 167 NV---DKIGDLRAAARNAVQTMLQNQDPKRPRIRVAIVPYASGVNAGKLAENVYAETQGS 223 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVI--------------------RSIKKIDNVND 293 SE V SSL K L+ + + D Sbjct: 224 SELPPVAGSSLLVAKTGKALLPSFSDYISIVGAAMPHPDNCTTERKNKNGDADLSADGPD 283 Query: 294 TVRM--------GATFFNDRVISD----------PSFSWGVHKLIRTIVKTFAIDENEMG 335 TVR +D + + L+ +I D G Sbjct: 284 TVRTDRNGKKYYALVNRDDHLDGGGMNRCPDAEVIPLTADSDALLDSID-----DFRAAG 338 Query: 336 STAINDAMQTAYDTIISSNEDEVHRM--------KNNLEAKKYIVLLTDGE--------- 378 TA A+Q Y + + + N + K +L+TDG+ Sbjct: 339 YTAGAIAIQWTYYMLSPQWRAAIKNVGLGNGASDANAKKIAKVAILMTDGQFNTAFAGAG 398 Query: 379 ---NTQDNE---EGIAICNKAKSQGIRIMTIAFSVN-----KTQQEKARYFLSNCASPN- 426 N Q + A+C K+ GI I TI F +N T++++A+ L C+S + Sbjct: 399 GSYNGQGDLARGNAEALCGNMKNDGIEIFTIGFDLNDKDMSATERDQAKAVLKGCSSKDA 458 Query: 427 -----SFFEANSTHELNKIFRDRI 445 +FEA++ EL+ F++ I Sbjct: 459 SAAERHYFEASTGAELDAAFQEII 482 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKKFPGDS-----------KALILLTDGENNSGT 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 219 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 279 FRAQNSSDLKKVYES 293 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ + + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKSNGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFS-------------VNKTQQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + T ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLINTSEDLDTTVLEKIAEMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 72/192 (37%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTV--RMGATFFNDRVISDPSFSWGVHKLIRTIV 324 VI K+ + R A ++ D +N V R+G F + ++ K + T++ Sbjct: 107 VINKRSVDRLTAAKMVA----ADFINRRVGDRVGLILFGTQAYLQTPLTFD-RKTVMTLL 161 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 I +TAI DA+ A V R+K+ + +VL+TDG NT Sbjct: 162 NEAVIGL-AGDNTAIGDAIGLA-----------VKRLKSEQVNSRVLVLMTDGANTAGEV 209 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP--NSFF 429 + A + ++I TI ++ + L A ++ Sbjct: 210 SPLKAAELAAANHLKIYTIGIGADEMIVRSFFGNRKINPSVDLDEKTLIKIAESTGGQYY 269 Query: 430 EANSTHELNKIF 441 A +T ELN I+ Sbjct: 270 RARNTDELNNIY 281 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 219 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 279 FRAQNSSDLKKVYES 293 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 76/462 (16%), Positives = 140/462 (30%), Gaps = 89/462 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I + +G+F I+TALLM ++G GM VDV L AA A + + + Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + TD + + T L Sbjct: 65 AVAAAMTMNGNGTVSLGKTDARNIFMSQTSGELTDIHIDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R G S +I A AE Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMRIFGRDSIIISGTATAE-----YQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N + KSL + Sbjct: 171 TASDVSKLQAKTGCAFACHQMDQSTNN-------------YTIAKSLGVTM--------- 208 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 + +VR A ++ + K +D RMG F + Sbjct: 209 ----------------RIDVVRQATQALTDTAKAERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 310 -PSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + K+ + + ++ + +A I + D +N+ Sbjct: 253 ISGLTSDLTKVKNYTNAVDLMTIPYQNYNSDQLTSFDSAMTQINTII-DPAGDGTSNISP 311 Query: 368 KKYIVLLTD--GENTQDNEEGIAI-------------CNKAKSQGIRI---MTIAFSVNK 409 +K + + D G++ + + C K +G++I T + Sbjct: 312 EKILFFVADGVGDSYKPSTCTKKTTGGRCQEPIDTTFCKPLKDRGVKIAVLYTTYLPLPS 371 Query: 410 TQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 Q + + CASP +FE T + + Sbjct: 372 NSWYNTWIKPFQNEIPTKMQACASPGLYFEVTPTDGIADAMK 413 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 35/159 (22%), Positives = 61/159 (38%), Gaps = 24/159 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R ++ ++R +++T +I TAI DA+ A T+ + + Sbjct: 136 RIGLILFSTRAYVQAPLTFD-RNVVRDLLRTSSIGMTGQE-TAIGDAIALAVKTLRTRPQ 193 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++ +VLLTDG N I AK+ G++I TI + Sbjct: 194 E-----------QRVLVLLTDGANNSGMLSPIPAAEIAKANGVKIYTIGVGADAFAVGQR 242 Query: 413 ------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + +F A L I+ D Sbjct: 243 MVNPSFDLDEGALEQIAQMTGGRYFRARDAAGLAAIYND 281 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI ++ ++ L A + + Sbjct: 219 LQPLQAAEIAKQYHIKIYTIGLGGDQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 279 FRAQNSSDLKKVYES 293 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 60/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + S+ + + I +T TAI DA+ + Sbjct: 138 RIGVIVFGTKAYLQAPLSFDTKTVRQLIQETQI--GFAGEKTAIGDAIGLGIKQLSELPS 195 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 D KK ++L+TDG NT + N A QG+ I TI ++ + Sbjct: 196 D-----------KKVLILMTDGANTAGRVSPLQAANFAAEQGVTIHTIGIGADEMEVQGF 244 Query: 412 ---------QEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 ++ L N AS ++ A ST +L +I+ D Sbjct: 245 FGPQTVNPSEDLDEALLENVASLTGGKYYRAKSTSDLEEIYGD 287 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 77/478 (16%), Positives = 142/478 (29%), Gaps = 95/478 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM +LG GM VD L AA A + + + Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVG---SIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + D V + T L Sbjct: 65 AVAAAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I A AE ++ S+ + ++D + SM Sbjct: 116 NSQVSFTAVVPTTFMRVLGKDSITISGTATAEYLTASF-----MDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + SC + + Sbjct: 171 TAKDVATMEKNTSD---------------------------SCAFACHETENKN------ 197 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 ++ + + V + +VR A + + K + RMG F + Sbjct: 198 ---NYYNLAKTLGVSMRIDVVRQATKELTLTAKSTRVSTNQFRMGVYTFGTKAEDANLTT 254 Query: 310 -PSFSWGVHKLIRTIVKTFAIDENEMG-----STAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + + G T+ ++A+ D I + + Sbjct: 255 ISDPTDDLDKVRTYTDAVDLMTIPKQGYNNDQQTSFDNALTQMKDIITTPGDGS-----T 309 Query: 364 NLEAKKYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAF 405 +K + ++DG + +G + C K +GIRI T Sbjct: 310 ATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCKPLKDKGIRIAVLYTTYL 369 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + K Q + + CASP +FE T + + I I Sbjct: 370 PLPKNSWYNTWISPFQSQIPTKMQECASPGLYFEVTPTEGIADAMKALFLKAIRAPRI 427 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 6/70 (8%) Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGN 447 C+ AKS+GI I TIAF + L CAS +S +F+A +L F+ IG Sbjct: 1 TCDTAKSKGIEIYTIAFMA----PAGGQALLHYCASDDSHYFQAEKMEDLLAAFK-AIGA 55 Query: 448 EIFERVIRIT 457 + ++ R+T Sbjct: 56 KASSQLTRLT 65 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 74/511 (14%), Positives = 146/511 (28%), Gaps = 130/511 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K ++ G I A ++ V++ +GG +D+ R L A A ++ + L Sbjct: 8 KFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVATDL---- 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ +I+ + +F + N +D E D D ++ + +A V Sbjct: 64 --------STTVLRDNQIKTRIENSF----RANLSDAEFLDQAID-NLDFDVDSNAGTVT 110 Query: 133 LSSRYDLLLNPLSLFLRSMGI-KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +SS L + FL G K L T + + V + V+D + SM Sbjct: 111 VSSSAGLN----NYFLNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAG 166 Query: 192 YQRD-------------------------------SEGQPLNCFGQP-ADRTVKSYSSQN 219 S+G L + + + + N Sbjct: 167 DMGALRDAAEEVVDILIEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVN 226 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + G + + Y S Y+ S+ + Sbjct: 227 EREGQQKYTDAVYNYDGTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKN 286 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++I I+ + + +WG + L + D + Sbjct: 287 TLISDIRNLSSG------------GGTGGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNG 334 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--------------------- 378 + + KK+ +++TDG+ Sbjct: 335 TP----------------------DDDVKKFALIMTDGDFNAQYGKEERTTCTGRGRNRV 372 Query: 379 -----------------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 N +C+ K++ I I T+ F + +S Sbjct: 373 CTTNEYWVERYHRYSDYNDPPATRARTLCDAMKAENIEIFTVFF--DTGGSAFGDDLMSY 430 Query: 422 CAS-PNSFFEANSTHELNKIFRDRIGNEIFE 451 CAS + ++EA++ EL F + I I + Sbjct: 431 CASGSDYYYEADNKDELITAFSN-IAKRIQQ 460 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 219 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 279 FRAQNSSDLKKVYES 293 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 83.5 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + I + ++ L++ I +K D R+G F D Sbjct: 88 AVDLSGSMAIKDMQTQSGQSIDRLTAIKHVLSNFIEK-RKGD------RLGLVLFGDHAY 140 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ H + + + +T STAI + + A T I S Sbjct: 141 LQTPLTFDRHTVEQQLDRTVL--GLVGQSTAIGEGLGIATKTFIKSK-----------AP 187 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEK 414 ++ I+LL+DG NT + + AK G+ I T+ ++ Q+ Sbjct: 188 QRVIILLSDGANTAGVIDPLEAAKLAKESGVTIYTVGIGADEMLQRSIFGVQKVNPSQDL 247 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFR 442 L+ A + +F A + EL+KI++ Sbjct: 248 DEKTLTKIAQMTGGKYFRARNPQELDKIYQ 277 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKFPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 69/197 (35%), Gaps = 26/197 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +++ + ++++ + + +G F + L+ + Sbjct: 108 HVDRLTAIKNVAGKFVEGKEELEGRFNDL-VGLMTFAGYADGITPPTLDHPYLVSQLNNI 166 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI DA+ A + + + + ++ K I+LLTDGEN E Sbjct: 167 QIVTNRSEDGTAIGDAISLAVEKL-----NALDARRDEKVKSKVIILLTDGENNAGEVEP 221 Query: 387 IAICNKAKSQGIRIMTIAFS------VNKTQQEKARYF------------LSNCA--SPN 426 I A++ GI++ TI V T + L A + Sbjct: 222 IQAAELAETLGIKVYTIGVGTKGEAPVPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHG 281 Query: 427 SFFEANSTHELNKIFRD 443 +F A T L KI+ + Sbjct: 282 KYFRATDTDSLEKIYHE 298 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 66/489 (13%), Positives = 158/489 (32%), Gaps = 82/489 (16%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ +G+ +I AL + ++G+ G +D + + ++QA +A++ A+ + Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDG 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + K + + K + N NN N T+ ++ I +E + Sbjct: 66 -KDRAYSLKEANDYFKGIL------NQSNNSGLNCTNIDLVYIDETEELEGH-------- 110 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + L+ + GI+ + A + I +V D S SM + Sbjct: 111 -VECSQNTTLS------KVAGIRHLDFNVSSAATYGIGK------LEIAFVFDVSGSMAN 157 Query: 192 ---------YQRDSEGQPLNCFGQPADR---------------TVKSYSSQNGKVGIRDE 227 R++ L G D + + + R E Sbjct: 158 DNRMGNLKVAAREAVNTLLPVEGYAGDPEDVRLAMVSYDTMVNAGPYFKAVTNQDPERTE 217 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS--LRHVIKKKHLVRDALASVIRSI 285 Y+ Y + + S + ++ + Sbjct: 218 PFYGYIRERTTCRRYRNNGTCREWNYEWRGPYHRSYTIKSTCVWEREGAERYTDASPGHN 277 Query: 286 KKIDNVNDTVRMGATFFNDRVIS--------DPSFSWGVHKLIRTIVKTFAIDENEM--G 335 + + V+ T ++ + ++ +KL I G Sbjct: 278 RWLPPVSATFDSYNDSWSTDHQTDPWCNDNTPIPLTYNRNKLHDFIDDMTPRRNTAGHIG 337 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---------DNEEG 386 + ++++ + + + + K +++++DG+ + + Sbjct: 338 QAWGWYLVSPEWNSVWPAGSKALPYDEPDAT--KVVIMMSDGQYNETRHNNAYPSSVTQA 395 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRI 445 AIC+K K + + I T+ F + L+ CAS +F ++ + EL + ++ I Sbjct: 396 EAICDKMKEKEVVIYTVGFDAGY-----GQDVLNYCASNPAFAYKPTNGQELTEAYKS-I 449 Query: 446 GNEIFERVI 454 I + I Sbjct: 450 ARSISDLRI 458 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 213 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 273 FRAQNSSDLKKVYES 287 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 68/189 (35%), Gaps = 35/189 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++V+ A +R + D ++G F R ++ H ++ + A Sbjct: 115 SRLNIVKSAAEQFVRE-RSGD------KIGLILFGTRAYLQTPLTYDRHSILLRL--EDA 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +T+I DA+ A + S+ + + I+LLTDG N + Sbjct: 166 TAGLAGKTTSIGDAVGLAVKRLDSAPKK-----------GRVIILLTDGANNSGVLAPLK 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-ARYFLSNCASP--------------NSFFEANS 433 AK +GI+I TI + + + + A+ +F A Sbjct: 215 AAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEETLKKMSDMTGGRYFRATD 274 Query: 434 THELNKIFR 442 T L+ I++ Sbjct: 275 TESLHLIYK 283 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 68/199 (34%), Gaps = 35/199 (17%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 D + + + L+ I+ ++ D R+G F + S+ K Sbjct: 103 TDMALNGQPANRLEAAKSVLSDFIQE-RRGD------RIGIIVFGSKAYLQAPLSFDT-K 154 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 I +V+ I TAI DA+ + D KK ++L+TDG Sbjct: 155 TINQLVQEAQIG-FAGEQTAIGDAIGLGIKRLEDKPSD-----------KKVLILMTDGA 202 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA-- 423 NT + A SQ ++I TI + + L N A Sbjct: 203 NTAGRVQPQQAATFAASQNVKIHTIGIGADSMIVQSFFGPKAINPSSDLDETLLKNIAAQ 262 Query: 424 SPNSFFEANSTHELNKIFR 442 + +F A ST +L I++ Sbjct: 263 TGGEYFRAKSTEDLQAIYQ 281 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 63/195 (32%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ A + + V + R+G F R ++ + + + + Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K ++LLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGT 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + + Sbjct: 219 LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKY 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L K++ Sbjct: 279 FRAQNSSDLKKVYES 293 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 55/160 (34%), Gaps = 25/160 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F + ++ + + K D TAI + A + + Sbjct: 122 RLSLIVFGGDAYTKVPLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNRL----- 176 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK------ 409 K++ K I+LLTDGEN A + AK GI+I TI + Sbjct: 177 ------KDSEAKSKVIILLTDGENNSGEMSPSAAADIAKELGIKIYTIGIGAKEIKVPSF 230 Query: 410 ------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 E L + A + +F A+ + E +IF Sbjct: 231 FGYKTVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIF 270 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 70/193 (36%), Gaps = 35/193 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + V+ + + + ++ D R+G F ++ ++ K ++T++ Sbjct: 112 QTVNRLMAVKAVVGNFVTE-REGD------RLGLILFGEKAYLQTPLTFD-RKTMQTLLY 163 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G TAI DA+ + + E+ + ++LLTDG N + Sbjct: 164 EAQLGFAGNG-TAIGDAIGLSVKRLQQRPENH-----------RVVILLTDGANNAGELD 211 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNC--ASPNSFFE 430 + A S ++I TI + + L+ A+ +F Sbjct: 212 PLKAAELASSAKVKIYTIGVGAETQEAWGLFGKRVTNPSADLDEQTLTAIAEATGGQYFR 271 Query: 431 ANSTHELNKIFRD 443 A + EL I+++ Sbjct: 272 ARNPEELMAIYQE 284 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 79/221 (35%), Gaps = 35/221 (15%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S + + V+ ++ I +K D R+G F D Sbjct: 90 DLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ-RKGD------RLGLVLFGDHAYLQ 142 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 ++ + + + +T +M TA+ + + A T I SN ++ Sbjct: 143 TPLTFDRNTVREQLDRTVLRLVGQM--TAMGEGLGLATKTFIESN-----------APQR 189 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKAR 416 I+LL+DG NT E + AK +I T+ + Q ++ Sbjct: 190 TIILLSDGANTAGVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDE 249 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L+ A + +F A + EL +I++ E + + Sbjct: 250 DTLTKIATMTGGQYFRARNADELAEIYQTIDALEPVTQATQ 290 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 76/467 (16%), Positives = 135/467 (28%), Gaps = 99/467 (21%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-TASVPLIQSL 72 I +G+F I+TALLM ++G GM VD L AA A + + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ + + K + + L + D + + Sbjct: 68 AAMAMNGNGTISLGKTDARDIFMSQVSGELAEVHVDLGIDVTKTANKLN----------- 116 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S+ + F+R G S I A AE Y + ++D + SM Sbjct: 117 --SQVSFTATVPTTFMRIFGRDSITISGTATAE-----YQTAAFMDFYILLDNTPSMGVG 169 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 S+ L A + S N + KSL + Sbjct: 170 ATPSDVSKLEAKVGCAFACHQMDKSTNN-------------YTIAKSLGVAM-------- 208 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--- 309 + +VR A ++ + K +D RMG F + Sbjct: 209 -----------------RIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLT 251 Query: 310 --PSFSWGVHKLIRTIVKTFAIDE-----NEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + K+ + N T + AM I D Sbjct: 252 TISGLTSDLTKVKNYTDAVDLMTIPYQNYNSDQITNFDSAMTQMNTII-----DLAGDGT 306 Query: 363 NNLEAKKYIVLLTD--GENTQDNEEGIAI-------------CNKAKSQGIRI---MTIA 404 +N A+K + ++D G++ + + C K +G++I T Sbjct: 307 SNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTY 366 Query: 405 FSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + Q + + CASP +FE + T + + Sbjct: 367 LPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMK 413 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 77/466 (16%), Positives = 137/466 (29%), Gaps = 97/466 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM ++G GM VD L AA A + + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + + V + T L Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F++ G S I A AE Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMQIFGRDSITISGTATAE-----YQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N + KSL + Sbjct: 171 TPSDVSKLEAKVGCAFACHQMDKSTNN-------------YTIAKSLGVAM--------- 208 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 + +VR A ++ + K +D RMG F + Sbjct: 209 ----------------RIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 310 -PSFSWGVHKLIRTIVKTFAIDE-----NEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + N T + AM I D + Sbjct: 253 ISGLTSDLTKVKSYTDAVDLMTIPYQNYNSDQITNFDSAMTQMNTII-----DPAGDGTS 307 Query: 364 NLEAKKYIVLLTD--GENTQDNEEGIAI-------------CNKAKSQGIRI---MTIAF 405 N A+K + ++D G++ + + C K +G++I T Sbjct: 308 NTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYL 367 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + Q + + CASP +FE + T + + Sbjct: 368 PLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMK 413 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 55/160 (34%), Gaps = 25/160 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F + ++ + + K D TAI + A + + Sbjct: 122 RLALIVFGGDAYTKVPLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNRL----- 176 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK------ 409 K++ K I+LLTDGEN A + AK GI+I TI + Sbjct: 177 ------KDSEAKSKVIILLTDGENNSGEMSPSAAADIAKELGIKIYTIGIGAKEIKVPSF 230 Query: 410 ------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 E L + A + +F A+ + E +IF Sbjct: 231 FGYTTVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIF 270 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 69/444 (15%), Positives = 126/444 (28%), Gaps = 92/444 (20%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A L+ VML + DV L + A L ++ + Sbjct: 16 RRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQA 75 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 R F K I ++ +++ TD E + V L Sbjct: 76 RVVAKDIFSKNLIGGRELKLHNKDIEFGRTDANPDGTWEFLPNERPFQAIRISVNLDDNR 135 Query: 138 DLLLNPL--SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 N LF + +G S+ + A + I +D S SM Sbjct: 136 QKGRNGSVPLLFGKVLGQSSFATNHSSVAANLVHE--------IVLCLDRSHSM------ 181 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 DE Y Y Y P +P S Sbjct: 182 ----------------------------CFDETGVDYAYPPGTPSYPAGYITPPNPVGS- 212 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIK--KIDNVNDTVRMGA---TFFNDR----- 305 + ++ A+ + ++ +I V G+ ++ Sbjct: 213 --------------RWAKLQGAIQVFVDTLDDLQIVPDVGVVTWGSDITLSWSWYPFQGR 258 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V+ D ++ + I + MG T ++ + + + ++ Sbjct: 259 SFPAVMVDVPLGQNLNLVSPAIAAKL--GDIMMGGTNMSSGIDRSVSLLTANG------- 309 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + A+K I+L++DG+ + N A + I I TIAF + + Sbjct: 310 -THSLAQKTIILMSDGQWNAG-RNPLDAANDAADKNITIHTIAFL------NGDQSVMRQ 361 Query: 422 CA--SPNSFFEANSTHELNKIFRD 443 A + FF A L F++ Sbjct: 362 IAERTGGKFFNAPDGESLEDTFKE 385 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 68/473 (14%), Positives = 148/473 (31%), Gaps = 74/473 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +K + G I+ L+ V+ + G+ VD R+ A A++ + L + Sbjct: 30 EKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARALQTN 89 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ + + ++ Q + L + ++ + + D A M +A V Sbjct: 90 GGDQAAALRVAQSYYAQATKNRL----------SLSNDTINFAIADNATAMVTTGNA--V 137 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-L 190 + + L LR G + ++ + E + I ++D + SM Sbjct: 138 ITTPFMGLAGTGSLPILRKDG-------SDYSKAVLAVGGNAELNLEIAMMLDITGSMRG 190 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D + + ++S+ V + P + + Sbjct: 191 QKLTDMKAAASDLLNIVVWTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPCVV 250 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 E + D++ + + + T ++ Sbjct: 251 ERTGTEKYTDAAPATGKYVMV------------------HNTSSTKKNKTTYSPTCDVAS 292 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS------NEDEVHRMKNN 364 S ++ + + GSTA + A+ + + + N Sbjct: 293 SAEVLPLTSDKSTLLAKVNGLSTAGSTAGHIGTAWAWYMLAPNWSSLWTSASSTPAAYNA 352 Query: 365 LEAKKYIVLLTDGENTQ-------------------------DNEEGIAICNKAKSQGIR 399 +K VL+TDGE + + ++ C K++GI Sbjct: 353 DNLRKIAVLMTDGEYNTQYTTNGVPDDSSSLTRCPNAANGVCSSAQAVSQCTAMKAKGIE 412 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFE 451 + T+ F ++ + A LS CA+ +S F+ + + L FRD I +I Sbjct: 413 VYTVGFQLDN---QTAIDTLSQCATDSSHFYNSTTGDALKAAFRD-IALKIST 461 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 71/203 (34%), Gaps = 46/203 (22%) Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + + L +++ G T I++ + A+ T+ Sbjct: 451 TYGPNASCPEPLTRLTNNLSTVTAAIDSMNYWLNGGTVISEGLMWAWRTLSPQKPYADGA 510 Query: 361 MKNNLEAKKYIVLLTDGEN----------------------------------------T 380 + + KK IVL+TDG N T Sbjct: 511 AYTDKKTKKVIVLMTDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQT 570 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAF----SVNKTQQEKARYFLSNCAS-PNSFFEANSTH 435 ++ C+ AK++GI I T+ F ++ T+Q ++ LS CAS P F A + Sbjct: 571 FLDDRLKKACDNAKAKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSA 630 Query: 436 ELNKIFRDRIGNEIFERVIRITK 458 LN F +I + +R+T+ Sbjct: 631 ALNSAFG-QIASSAAASPLRLTR 652 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 31/165 (18%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 ++PV G +D R + +L++AA + + ++ Sbjct: 8 MIPVTFM-AGAAIDYGRATLLRSSLQKAADA------------GALAAGARTSLTQLARE 54 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL 148 +I +N + N + + T E P +QV + + + + Sbjct: 55 QIA-------KNAVLANLGAKARN--LSLTITETEPSAGVFQVQIQASIA------TSIM 99 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + +EA V S + + I +D + SM D Sbjct: 100 KVARFDTIPVVVTSEARVVGGSTN---PIEIALALDNTGSMRDDM 141 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 73/473 (15%), Positives = 155/473 (32%), Gaps = 110/473 (23%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ A+L+PV+LG+ G+ +D + L+ A A + S E+ + Sbjct: 3 MLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGS-------LELPYDPQLDK 55 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 K + +Y+ N+ + K + P V + + + Sbjct: 56 GLVKGAVNQYMAANYPAAVLKG----------------VTPGTEERSVTVKAEATVD--- 96 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 ++F+ ++GI S ++ +A A + + + +VID + SM Sbjct: 97 -TIFMGALGIGSSTVRAQATAGYNN--------LEVVFVIDNTGSMKGTAIQQANAAATQ 147 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 + S + G V R + P V G L PS E + + Sbjct: 148 LAELIMPDGMETSVKVGLVPFRGKVHIPAGVDGLADG-CRNADGTLAPSWILEEYKQTKY 206 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ + + I V + + ++ I Sbjct: 207 RYPTGSSLNVPK-------GTCDSIPRV------------------QALTSNRTTIVSAI 241 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD- 382 K A+ + T I++ ++ + E + +N + +K +++LTDG+ Sbjct: 242 AKQDALGDAS--GTVISEGIKWGRHVLT--PEAPFTQGSSNKDMRKVMIVLTDGDTEDGK 297 Query: 383 -----------------------------------NEEGIAICNKAKSQGIRIMTIAFSV 407 N ++ AK +GI I I + Sbjct: 298 CGGNYALNYTPNAYWTNAYYGMFDMNTHCENGGKLNAAMLSEAQIAKDKGIEIFAIRYGD 357 Query: 408 NKTQQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + + AS + ++ A S ++L +IF+ +IG ++ R++R Sbjct: 358 SDSTD---ISLMKAIASSKAGTDDHYYNAPSAYDLEEIFK-KIGRQLGWRLLR 406 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 75/209 (35%), Gaps = 35/209 (16%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + I + V+ L+ I +K D R+G F D Sbjct: 88 AVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDFIEK-RKGD------RLGLVLFADHAY 140 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ + + + + +T STAI + + A N+ Sbjct: 141 LQTPLTFDRNTVEQQLDRTVL--GLIGQSTAIGEGLGIA-----------TKTFINSKAP 187 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEK 414 ++ I+LL+DG NT + + AK G++I T+ ++ Q+ Sbjct: 188 QRVIILLSDGANTSGVIDPLEAAKLAKESGVKIYTVGVGADQMVQKGFFGDRLVNPSQDL 247 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIF 441 L+ A + +F A + +L KI+ Sbjct: 248 DEKTLTEIAKMTGGEYFRARNPQQLEKIY 276 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 51/309 (16%), Positives = 99/309 (32%), Gaps = 40/309 (12%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT---- 211 + A +++ + V D S SM + Sbjct: 698 LDASSLATSQSTVEKIQTVDAM---MVFDLSGSMNEIMSGQNQLKDIGEFSRVKNQMDIN 754 Query: 212 ----VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD------PSLSEEHFVDS 261 Y D+ + + + ++Y +D S + DS Sbjct: 755 KVYYWNKYEKSGWWPWTYDKSVGMGTAAVSGNVYAKYPVKYIDGQWKKYVDGSYQSISDS 814 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRVISDPSFSWGV 316 + K ++DA + + I + G F ++ + G+ Sbjct: 815 DVMAVWTSKISALKDAASGFVTGISDTSPDSLV---GIATFYGIGNGWNSSTEGKLNHGL 871 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K+ + + G T+ ++ AY + + + KKY++L +D Sbjct: 872 SKVNKNEMLKSVNALFADGGTSPQKGLEHAYSELQKAEDGN----------KKYVILFSD 921 Query: 377 GENTQDNEE--GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 GE + N++ A K K G ++T+ +N E A + AS F A++ Sbjct: 922 GEPSDSNDKMETEASAVKLKEAGYTVITVGLGLNN---ETATWLGEKVASAGCAFTADTA 978 Query: 435 HELNKIFRD 443 ELNKIF++ Sbjct: 979 EELNKIFQN 987 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 87/511 (17%), Positives = 154/511 (30%), Gaps = 110/511 (21%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 Y + G I+T LL+ ML +GGM VD +R+ L+ + A++ A+ Sbjct: 21 YLHAFGRDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAAS--- 77 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 N + +E+Y + F + V + + ++ Sbjct: 78 ----------LNQTLDSRDVVEDYFAK-------AGFPNALVGAPI------VVDNGNSR 114 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + S L +N +LR G+ ++ A V I V+D S SM Sbjct: 115 TVTVRSA--LDVNTF--YLRLAGMDRLTAPARSSATEGVGK------VEISLVLDISGSM 164 Query: 190 LDYQRDSEGQP-----LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 R Q PA+ S + + + +L Sbjct: 165 RFSNRFVNMQAAAIAFAEEVLDPANGGTVSLTIIPYAGATNPGPEMFAFMGGVRYPDTLL 224 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKH----LVRDALASVIRSIKKIDNVNDTVRMGAT 300 +++F S + + A + + + D Sbjct: 225 AGDDGILGTEDDYFFPQVSSCVEMVGSDWSSAGLPGAGRAQVPHFQVWDIARSV-----M 279 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--- 357 + S + + + ++F T + AM+ A T+ S++ Sbjct: 280 DWGWCPQDRSSIQYAMATPAQA--RSFINGLRMHDGTGTHYAMKYALATLDPSSQPAFMH 337 Query: 358 -VHRMKNNLEA--------------KKYIVLLTDGE------------------------ 378 H + + KK IVL+TDG+ Sbjct: 338 LSHPGRGLVPPQFANRPAAWDDPETKKIIVLMTDGDITQQERPRIAQQERDIDYIISRSI 397 Query: 379 NTQDNEEGI-----------AICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 N +DN AIC A ++ + + T+AF V + + NCAS Sbjct: 398 NGRDNRGQFVDAATNVGRFEAICTLANQPARSVDVYTVAFEVQPNSAADLQ--MRNCASD 455 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 S F S EL +F I I + + + Sbjct: 456 PSMFFRTSGAELIDVF-SGIAERITDLRLNL 485 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 67/191 (35%), Gaps = 35/191 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V++ + I ++ D R+G F D + + + +T Sbjct: 109 YIDRLSAVKNVVTQFIEQ-RQGD------RLGLVLFADHAYLQTPLTADRQTVANQLNQT 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 162 IIGLIGQK--TAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--SPNSFFEA 431 I N AK G++I TI + + + L+ A + +F A Sbjct: 209 IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRA 268 Query: 432 NSTHELNKIFR 442 EL I++ Sbjct: 269 RDAQELQAIYQ 279 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 78/509 (15%), Positives = 158/509 (31%), Gaps = 107/509 (21%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G I +++ +ML V G+ VD+++ ++ AI+ AS Sbjct: 3 FKREEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAAS-------- 54 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ + +++Y + + + D VR T P + V Sbjct: 55 DLDQPL-----PADEVVDDYFAK---AGMTEFLND------VRITPGSDLPTTNFRIVQA 100 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +R S+++ G+++ + AE + I V+D S SM + Sbjct: 101 EARTRTP----SIYMAMTGVRTLPVYVSGTAEETIEK------IEISLVLDISGSMRNNG 150 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + A + + + + + +P + ++ D + Sbjct: 151 KIGNLRTAAKDFIGAVLEGNAAKTTSLNIVPYAGQTNPGRIVFERAGGLPFATFIEDSNG 210 Query: 254 SEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 E + + +S+ D S +I + F N + Sbjct: 211 DEILYGQTIVDDEGNSIDVPYNTMSSCLDLTNSDFDNIDL--PSGGYDQT-PYFMNWPID 267 Query: 308 SDPSFSWG----VHKLIRTIVK------TFAIDENEMGSTAINDAMQTAYDTIISSNED- 356 + WG + IR F D T M+ + S+ + Sbjct: 268 APT-MDWGWCPQNNSSIRYAQNDAGRLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRNT 326 Query: 357 ---------------EVHRMKNNLEAKKYIVLLTDGENTQ-------------------- 381 + +K+IVL+TDG+ T Sbjct: 327 FLALNAAGLVPDGFKNRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQR 386 Query: 382 -DNEEGIAI-----------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 + + + CNKAK++GI + TIAF + + CA+ +FF Sbjct: 387 TGDRDTYSTQSTNVTNFYSVCNKAKAEGITVYTIAFEAPADAVTQ----MRTCATSPAFF 442 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ F+ I +I E +R+T+ Sbjct: 443 YKVEGVQIKTAFKS-IARQINE--LRLTQ 468 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 74/209 (35%), Gaps = 35/209 (16%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + I + V+ L+ I +K D R+G F D Sbjct: 88 AVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDFIEK-RKGD------RLGLVLFADHAY 140 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ + + + +T STAI + + A N+ Sbjct: 141 LQTPLTFDRKTVEQQLDRTVL--GLIGQSTAIGEGLGIA-----------TKTFINSKAP 187 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEK 414 ++ I+LL+DG NT + + AK G++I T+ ++ Q+ Sbjct: 188 QRVIILLSDGANTSGVIDPLEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDL 247 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIF 441 L+ A + +F A + +L KI+ Sbjct: 248 DEKTLTEIAKMTGGEYFRARNPQQLEKIY 276 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 75/209 (35%), Gaps = 35/209 (16%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + + V+ L+ I +K D R+G F D Sbjct: 88 AVDLSGSMSIPDMVTKNGQSVDRLTAVKHVLSDFIEK-RKGD------RLGLVLFADHAY 140 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ + + + +T STAI + + A N+ Sbjct: 141 LQTPLTFDRKTVEKQLDRTVL--GLIGQSTAIGEGLGIA-----------TKTFINSKAP 187 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEK 414 ++ I+LL+DG NT + + AK G++I T+ ++ Q+ Sbjct: 188 QRVIILLSDGANTSGVIDPLEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDL 247 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIF 441 L++ A + +F A + +L KI+ Sbjct: 248 DEKTLTDIAKMTGGEYFRARNPQQLEKIY 276 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 66/193 (34%), Gaps = 39/193 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ V+ L S S + D R+G F R S+ + + R ++++ Sbjct: 117 RRIDAVKQ-LGSDFMSRRSGD------RLGLILFGSRAYLQSPLSFDIQTVQRFLLESQI 169 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ A + + ++LLTDG++T + + Sbjct: 170 --GFAGQETAIGDAIGLAVKRLQERPATS-----------RVLILLTDGQDTASTVDPLE 216 Query: 389 ICNKAKSQGIRIMTIAFSVNK-------TQQEKARYF----------LSNCASP--NSFF 429 N A G+RI TI + AR L AS +F Sbjct: 217 AANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTGGQYF 276 Query: 430 EANSTHELNKIFR 442 A EL ++R Sbjct: 277 RARDPEELATVYR 289 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 71/195 (36%), Gaps = 38/195 (19%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + +V+ + I + D R+G F ++ + + +V+ Sbjct: 113 QQIPRIAVVKHIVGDFIER-RVGD------RLGLVLFGTSAYLQSPLTFDRTTVKQLLVE 165 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +TAI DA+ + + + + ++LLTDG+NT Sbjct: 166 SQI--GFAGPNTAIGDAIGLSIKRLRDRPAEN-----------RVVILLTDGQNTAGEVS 212 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT----------------QQEKARYFLSNCA--SPNS 427 + AK G+++ TI N+ ++ L+ A + Sbjct: 213 PRQAADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTGGR 272 Query: 428 FFEANSTHELNKIFR 442 +F A+S ELN+I++ Sbjct: 273 YFRAHSPQELNQIYQ 287 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 66/182 (36%), Gaps = 27/182 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ A+AS S + D R+G F R ++ + R I + Sbjct: 115 VSRIEAVK-AIASDFTSQRVGD------RVGLILFGTRAYVQAPLTFDTATVTRFIRE-- 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A TAI DA+ A + + + ++LLTDG++T + + Sbjct: 166 AQLGFAGEDTAIGDALGLAIKRLRERPAES-----------RVLILLTDGQDTASTVDPM 214 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKA-----RYFLSNC--ASPNSFFEANSTHELNKI 440 A GI++ TI S + L+ A+ +F A + EL I Sbjct: 215 EATALAAESGIKVYTIGISRRIGARAGGSGEVDEALLNAIAEATGGEYFRARNPAELQSI 274 Query: 441 FR 442 + Sbjct: 275 YG 276 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 72/198 (36%), Gaps = 35/198 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V+ ++ I +K D R+G F D ++ + + Sbjct: 99 DMKTDSGFVDRLTAVKRVVSDFIEK-RKGD------RLGLVLFGDHAYLQTPLTFDRNTV 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +T + TAI + + A T I SN ++ I+LL+DG N Sbjct: 152 QEQLNRTVLGLVGQR--TAIGEGLGLATKTFIESN-----------APQRTIILLSDGAN 198 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--S 424 T + I AK +I T+ + Q ++ L+ A + Sbjct: 199 TAGVLDPIEAAQLAKDNNAKIYTVGIGAGEMQVRGFFGNQTVNTARDLDEDTLTKIATMT 258 Query: 425 PNSFFEANSTHELNKIFR 442 +F A + EL +I++ Sbjct: 259 GGQYFRARNADELAEIYQ 276 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 70/191 (36%), Gaps = 35/191 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + LV+ +A I+ +K D R+G FF D ++ + + + + Sbjct: 109 VDRLSLVKTVVADFIQQ-RKGD------RVGLIFFADNAYLQAPLTFDLKTVSGYMQQ-- 159 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A+ TAI + + A + R +K ++LLTDG+N+ + + Sbjct: 160 AVLGLVGEQTAIGEGIGLA-----------LKRFDAADNPQKVLILLTDGQNSAGEVKPL 208 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEAN 432 A+ QG++I TI + ++ L A + +F A Sbjct: 209 DAAKFAQEQGVKIYTIGVGADAYYKRTLFGNQKVDPSRDLDEVTLKTIAAQTGGQYFRAR 268 Query: 433 STHELNKIFRD 443 L I+ + Sbjct: 269 DASSLAAIYAE 279 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 57/171 (33%), Gaps = 34/171 (19%) Query: 293 DTVR------MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 VR +G F R ++ ++ I A +T+I DA+ A Sbjct: 126 QFVRDRLGDKIGLILFGSRAYLQTPLTYDRQTVLLRI--EDATVGLAGKTTSIGDAVGLA 183 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + I+LLTDG N E + AK +GI+I TI Sbjct: 184 VKRLDAVPQK-----------GRVIILLTDGANNSGILEPLKAAELAKDEGIKIYTIGLG 232 Query: 407 VNKTQQEKARYFLSNCA---------------SPNSFFEANSTHELNKIFR 442 + FL A + +F A T LN I++ Sbjct: 233 AATDPRALTNGFLMQAAAADLDEETLKEMSAMTGGRYFRATDTATLNSIYK 283 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 62/186 (33%), Gaps = 29/186 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + + A + I K N R+ F + ++ + + K Sbjct: 100 KPNRLETAKKLLEEFIDKRIND----RISLVVFGGDAYTKVPLTFDHNVVKDITSKLTTD 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D TAI + + + + K++ K I+L+TDGEN + Sbjct: 156 DITSNNRTAIGMGLGVSLNRL-----------KDSEAKSKVIILMTDGENNSGEMSPMGA 204 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCASP--NSFFEANSTH 435 AK GI+I TI + Q E L N AS +F A S Sbjct: 205 SEIAKELGIKIYTIGIGAREIQIRVPFGHTTVKNTELDENLLKNIASTTGGEYFRAGSEK 264 Query: 436 ELNKIF 441 E +IF Sbjct: 265 EFQEIF 270 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 39/175 (22%), Positives = 69/175 (39%), Gaps = 24/175 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ + + + R+G F + ++ V + +++ Sbjct: 113 ITRLMAVKKVVGDFVAR-------RQSDRLGLVLFGTQAFLQAPLTFDVKTVQEMLIE-- 163 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A +TAI DA+ + + R++ AK+ I+LLTDGENT Sbjct: 164 AESGYAGEATAIGDAIALS-----------IKRLREQPNAKRVIILLTDGENTAGELGIA 212 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + A +I TIAFS +E + + A + FF A +T +L +I Sbjct: 213 TATDLAVKANTKIYTIAFS--PYDREVDSHSMQQIAEQTGGEFFRARNTRDLEEI 265 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 75/537 (13%), Positives = 160/537 (29%), Gaps = 121/537 (22%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ + + + AL + V+L + G +D R + L+ A +A++ A+ + S Sbjct: 9 KQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLAAARAYLTS 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +V + + ++ +YL N ++ NF + +++ + ++ + + ++ Sbjct: 69 SVQVKETKRQEDS--QKIASDYLTANLLSSS-NNFENNQIQLVFKEDGEIVGNASTKIKL 125 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + F G ++ A A + I V+D S SM Sbjct: 126 I--------------FGGLFGKSDVVLPALAAATVGDSRK-----LEIVLVLDTSGSMSS 166 Query: 192 ------------------YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + P + TV + G Sbjct: 167 QNRMKQLRTASINFVNSVFDNAVYERTVQVGVVPWNATVNINMDRPGTWDASPGPAIHNS 226 Query: 234 VSCNKSLYYM--------LYPGPLDPSLSEEHFVDSSSL-------RHVIKKKHLVRDAL 278 N + LYP S K + Sbjct: 227 NYGNGTNQVTSFQDFTENLYPPGFSDFGSYSDSDIDDDFGSSGWLGCITATKDERKISSS 286 Query: 279 ASVIRSIKKIDNVNDT--VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + + + R A ++ + + I+K N G+ Sbjct: 287 GN-VTPLTDVPPSKMKWPARKVA-GWDPNSDCPSPML-AMSQSRPQIIKKL-NQLNPSGN 342 Query: 337 TAINDAMQTAY---------DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T + + Y + N D ++ +++K +++LTDGENT N EG Sbjct: 343 THADIGLMWGYRMFSQQANWNNFFGYNSDTKPDSFHSTKSRKIMIMLTDGENTATNSEGY 402 Query: 388 AI----------------------------------------------CNKAKSQGIRIM 401 + C +S+ + + Sbjct: 403 SYYGWCTYTNHYNKWGRYTGSTKDCEVPKGINKDEISNNDLNSLMLDACEVIRSKDVELF 462 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 TIA ++ A L CA +S +EL++ F++ + + +R++K Sbjct: 463 TIALDLHSYYDSTAIALLRECAGSDSHAYNIKGNELDETFQE-----LASKALRLSK 514 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 35/187 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+ ++ I + D R+G F ++ ++ K + ++ AI Sbjct: 116 RLEAVKRVASAFIER-RSGD------RIGLILFGEQAYLQVPLTFD-RKTVEKLLDEAAI 167 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TAI DA+ A + + D ++ ++LL+DG NT + + Sbjct: 168 GLAGDK-TAIGDAIGLAIKRLRDNPAD-----------QRVLILLSDGANTAGQVQPLQA 215 Query: 390 CNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANST 434 A +G++I TI ++ ++ ++ A + +F A +T Sbjct: 216 AELAAREGLKIYTIGVGADEMIVRDFFGTRRVNPSEDLDEAAMTAIAEKTGGRYFRARNT 275 Query: 435 HELNKIF 441 EL++I+ Sbjct: 276 EELDRIY 282 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 66/176 (37%), Gaps = 23/176 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V++ I D R+G F+DR ++ + +R ++ Sbjct: 116 TRLSIVKETADDFISR-------RDGDRLGLVLFSDRAYLQAPLTFD-REAVRKLLDQAQ 167 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI DA+ + + ED + +VLLTDG N + Sbjct: 168 VGLTGQK-TAIGDAIAVSVKRLKDRPED-----------GRVLVLLTDGANNEGVMSPDK 215 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 + A GIRI TI +++ + L A+ ++F A L +I+R Sbjct: 216 AADLAAKLGIRIYTIGVGSARSR-DLDERTLRQIADATGGAYFRATDVQGLAQIYR 270 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 73/514 (14%), Positives = 152/514 (29%), Gaps = 93/514 (18%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F K I G+F I++A+L+ +L G VD + + ++ + I A+ Sbjct: 2 RFKDLLSKFIADDNGNFAIMSAILLMPLLLAVGAAVDYSSARDHRNDIQ-VTADSAILAA 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 S V S A ++ K+ + + + R + ++ Sbjct: 61 ASSYSSSSGVDSLAAGIDSYLDSKLTDQGSNDVDTAAVP----------KRLSGPTLSAD 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 +V+ + F++ G+K+ + K+ A + + V+D Sbjct: 111 GKEICIVVGEGV------PTSFMQLAGVKTVDVSAKSCAALPG-----NIDLEVSLVLDV 159 Query: 186 SRSMLDYQRDSEGQP-LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SM++ R Q + F + + + + + Y Sbjct: 160 SSSMIEEGRFVPMQTAVKSFLTSFANDATVAKRSKIAIAPFSSRFNIGLTHKDWLKAYGG 219 Query: 245 YPGPLDPSLSEEHFVDSS--SLRHVIKKKHLVRDALASVI--------RSIKKIDNVNDT 294 + + S S I + ++ ++ DN Sbjct: 220 NDAVPSRWTDPKSYYKDSKYSFSQWIDNVTTLAYTSSNYYWIGCVEPRADVEMKDNGA-- 277 Query: 295 VRMGATFFND---RVISDPSFSWGVHKLIR-------------TIVKTFAIDENEMGSTA 338 +G +D + + + + +++ D GST Sbjct: 278 --IGTYGLSDAPPSTEAFVAQDYNTGSSTSFCPPPIVPLTSSFSTLQSAIADMTSEGSTR 335 Query: 339 INDAMQTAYDTIISSNEDE----VHRMKNNLEAKKYIVLLTDGENT-------------- 380 ++ M + T+ + + KK IV +TDGE Sbjct: 336 LDAGMLAGWYTLSPKWRSAWGGGTAPADYSEKVKKVIVFMTDGEMNVKFGSTDPAKSSTE 395 Query: 381 ----------------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + C+ KS I I I++S Q L C+S Sbjct: 396 KLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEIYAISYSSEADVQN-----LQTCSS 450 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + ST + ++ I I +R+T+ Sbjct: 451 GTKYYFSASTTNIKDVYT-AISKNIIGSTVRLTQ 483 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 68/501 (13%), Positives = 146/501 (29%), Gaps = 104/501 (20%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ + + G +I L + M + G VD + L A A + A Sbjct: 13 SRVRKRLRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAA 72 Query: 65 SVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 Q + + N F ++ + ++ D +R R +E+ Sbjct: 73 VKAAAQKESDCVANPAGNNVSNFQGCGQKDIIKAGVAAGVQYMNGDPLMRGADRKPTIEL 132 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + ++ ++ D+ N R MG+++ + K + ++ ++ + Sbjct: 133 SSSEGSWSATVNYSADIPTN----IARLMGVQTIPVNGKVTSNIALGTHMY---LNFHLL 185 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + R + L+ +C+ Y Sbjct: 186 LDRSMSMGIGATSDDIS-------------------------RLQALTGCAFACHSEGYE 220 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGAT 300 Y D + + + +RDA +++ K + N + ++MG Sbjct: 221 AQY-------------YDQPKAQGIRFRIDDLRDATGALVAQAKMVASANAREHIQMGVY 267 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 FN V + + + + ++ T DA+ I N + Sbjct: 268 AFNHHVSPLVEMTSDLTNVANAVKNLDLPTHDD--GTQAADAVTWLVANKIKGNGTGL-- 323 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG---------------------------IAICNKA 393 + + + L+TDG + C+ Sbjct: 324 --TSAAPLEIVFLVTDGVEDGIYTGWNKMVGPTGLPLPWWPSWMTKAPTSAFPVTACDAL 381 Query: 394 KSQGIR---IMT--------------IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 KS+G + T I L CAS FF A+ + Sbjct: 382 KSKGAIVAVVYTTYVPFPGTVQYDRLIGPFAPNISPN-----LQGCASQGYFFTASEPGD 436 Query: 437 LNKIFRDRIGNEIFERVIRIT 457 + + + + E +++T Sbjct: 437 ITRGMQSLFNRALQELALKLT 457 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 59/484 (12%), Positives = 140/484 (28%), Gaps = 103/484 (21%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ G+ + A++ ++ G VD R + L+ A A + A + Sbjct: 10 RFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVGAVSRTSPAF 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + R N T + + +K+ + Sbjct: 70 IAAGAMTTDGVI----AAGNDDARKIFNGNMSGTTGYTLDSL------TPEVKKTGSVLT 119 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + L F+ +G K+ +Q + A+ Y + ++D S SM Sbjct: 120 ATVSFSATVPTL--FMSIVGYKTMSLQGSSTAKASMPKY-----IDFYLLLDNSPSMGVA 172 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + + Sbjct: 173 ATPADVTKMVSATSDKCAFACHDYNDAN-------------------------------- 200 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM-----GATFFNDRVI 307 ++ + + V + ++R A ++ + ++ ++ RM GA+ + Sbjct: 201 ----NYYNLAKTLGVTTRIDVLRSATQQLMDTAQQTQTYSNQFRMAIYDFGASSKTIGLR 256 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ-TAYDTIIS--SNEDEVHRMKNN 364 + + S + + G+ A + T Y T + +NE + Sbjct: 257 ALFALSSSLTSAKSAAGNIDLMGV--YGNNDSFTADKDTPYTTALPAINNEIATPGDGTS 314 Query: 365 LEAKKYIVLLTDGENTQDNEEGI---------------AICNKAKSQGIRI---MTIAFS 406 KY+ ++DG + N + A+C K++GI+I T Sbjct: 315 GSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTALKNRGIKIAVLYTTYLQ 374 Query: 407 VN------------------KTQQEKARYFLSNCASPNSFFEANS----THELNKIFRDR 444 + + + + CAS +FE + +N +F+ Sbjct: 375 LPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACASDGFYFEVSPTQGIADAMNALFKKA 434 Query: 445 IGNE 448 + + Sbjct: 435 VADA 438 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 74/479 (15%), Positives = 146/479 (30%), Gaps = 103/479 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K + +G+F I+TALLM + G G+ +D+ R + L+QAA +A + A + S Sbjct: 2 KNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAADSAALAAVADMSAS 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + + E F+ N + T+V+++ K V Sbjct: 62 VQAAKKMSGDGVIPVGN---EEARAFFDGNQR-------GDADYTITSVDVSVIKHGNVV 111 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S + ++ L S A T + Y E ++D + SM Sbjct: 112 ESSVSFKASVSTTLSGLLGKDFVSV-------AGTATAKYETETFSDFYLLLDNTPSMGV 164 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ L V+ D Sbjct: 165 GATPTDVATL--------------------------------VANTGDKCAFACHIVKDG 192 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + V + +V A AS++ + K ++ RM F +R Sbjct: 193 VADPNSYYFKAKKLGVTTRIDVVAKATASLMDTAKSTRKSSNQYRMAVYTFGERAEDTKL 252 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM---------- 361 + ++ + + G + M Y + + + R Sbjct: 253 LE------VVSLTSDLDAAKKKAGEINL---MSIPYQGYNNDQQTDFDRALIQIGDKVGS 303 Query: 362 ----KNNLEAKKYIVLLTD--GENTQDNEEGIAI-------------CNKAKSQGIRI-- 400 ++ K I ++D G++ + + + C K K +G RI Sbjct: 304 SGTGASSANPDKVIFFVSDGVGDSYKPSSCTKKLTGGRCQEPIDIKDCTKLKEKGFRIAV 363 Query: 401 -MTIAFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKI----FRDRI 445 T + Q + + +CASP +FE + + ++ F+ I Sbjct: 364 LYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSCASPGLYFEVSPSQGISDAMTVLFKKAI 422 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 78.9 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 74/502 (14%), Positives = 151/502 (30%), Gaps = 82/502 (16%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + + G+ I A+ + MLG G VD R + +++ A +A + + Sbjct: 9 RIRSSAVRFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAALMVA 68 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++++ + + + N L N + T Sbjct: 69 KDANAASPQMTADQVTAA-----------AQKYFNALYHNTDAQGASVSAVYTPYNNGTP 117 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + VVLS ++ + F++ +G +T + A + + +D Sbjct: 118 AT---VVLSGSGNVQ----TDFMKVVGFPQISFKTNSTATWG------NTKLRVAMALDV 164 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + SM + + + + + I + + N + + Sbjct: 165 TGSMSSAGKLVQMKIAAKKLIDTLKASATAEGDVYISIIPFNVMVNVGANNNTASWLEWE 224 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKK---KHLVRDALASVIRSIKK-IDNVNDTVRMGATF 301 G D S S S K + I S K + + + G+ Sbjct: 225 DGSYDNSSSNYGSCSGSGKSKPNTKSSCIAAGKTWTPKNISSWKGCVTDRGPVSKPGSGD 284 Query: 302 FNDRVISDPS-------FSWGVHKLIRTIV--------------------KTFAIDENEM 334 ++ + + +I+ K + Sbjct: 285 YDTTKDEPVASTPYTLYLARNYSTCPSSILPMTSAYDSKESDSSTDDSTLKGKINNLVAN 344 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---------- 384 G+T AMQ A+ + + + I+LL+DG NTQD Sbjct: 345 GATNQAIAMQMAWMMLQPTAPFPAPAKDEKYKYTDAIILLSDGLNTQDRWYGNGSDWSSQ 404 Query: 385 -EGIA--ICNKAKSQGI---------RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + +CN K+ I RI TI + + + L NCA+ FF + Sbjct: 405 VDTRQALLCNNIKNDPISKTDPTRRTRIYTIQVNTDGDPES---TVLKNCATDG-FFPTS 460 Query: 433 STHELNKIFRDRIGNEIFERVI 454 + + F +IG + + I Sbjct: 461 TASGIASAF-AQIGASLSQLRI 481 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 67/197 (34%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D + + LV+ L ++ ++ D R+G F + ++ Sbjct: 104 FPDMHWRDEDVSRLSLVKHLLGDFLQQ-REGD------RVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + R++ + ++L+TDG Sbjct: 156 RTVRTWLDEARIGI-AGKNTAIGDAIGLALKRL---------RLRPAQ--SRVLILITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNC--A 423 N + + A +G++I I + Q L A Sbjct: 204 ANNAGQIDPLTAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDEPALKAIAAA 263 Query: 424 SPNSFFEANSTHELNKI 440 + +F A EL I Sbjct: 264 TGGQYFRARDGEELQAI 280 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 66/194 (34%), Gaps = 38/194 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L I+ +K D R+G F + ++ + R + + Sbjct: 114 VPRIVAVKTVLNEFIQR-RKGD------RLGLILFGSQAYVQAPLTFDQTTVQRFM-REA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ + + D + ++LLTDG+N I Sbjct: 166 QIGFAGEENTAIGDAIGLSVKRLRDRPGD-----------RHVMILLTDGQNNGGKINPI 214 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEK--ARYFLSNCASP-----------------NSF 428 A + GI I TI ++ F S +P + Sbjct: 215 PASKIAANNGIIIYTIGVGADEMVMPGVLGSSFGSRRVNPSADLDEKTLQQVATATGGQY 274 Query: 429 FEANSTHELNKIFR 442 F A + EL KI+R Sbjct: 275 FRARNPQELEKIYR 288 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 67/191 (35%), Gaps = 35/191 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V++ + I ++ D R+G F D + + + +T Sbjct: 109 YIDRLSSVKNVVTQFIEQ-RQGD------RLGLVLFADHAYLQTPLTADRQTVANQLNQT 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 162 IIGLIGQK--TAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--SPNSFFEA 431 I N AK G++I TI + + + L+ A + +F A Sbjct: 209 IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRA 268 Query: 432 NSTHELNKIFR 442 EL I++ Sbjct: 269 RDAQELQTIYQ 279 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 67/191 (35%), Gaps = 35/191 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V++ + I ++ D R+G F D + + + +T Sbjct: 109 YIDRLSAVKNVVTQFIEQ-RQGD------RLGLVLFADHAYLQTPLTADRQTVANQLNQT 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 162 IIGLIGQK--TAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--SPNSFFEA 431 I N AK G++I TI + + + L+ A + +F A Sbjct: 209 IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFRA 268 Query: 432 NSTHELNKIFR 442 EL I++ Sbjct: 269 RDAQELQTIYQ 279 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 80/492 (16%), Positives = 145/492 (29%), Gaps = 87/492 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS- 71 +L + II A+ + +L G VD + + L+ AA A + + Sbjct: 27 RLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAASVASISQQSLGY 86 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + ++ + N N+L +T+ + V T V++ Sbjct: 87 NAALQMTSDGTVQVAVEEATKLFNGNAANSL--GYTNLSLNAQVMKTGVKLAA------- 137 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML- 190 ++ D+ + F+ +G + + +++ + Y + ++D S SM Sbjct: 138 TVAFSADVP----TTFMTVVGYRKLTVTGTSKSTSSLPPY-----LDFYLMLDVSGSMGL 188 Query: 191 ----------------DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 +Y++ G C A + N +S Sbjct: 189 PSTDAEQTRLAAINPDNYKQYPNGCTFACHFTSASSCPAANQKYNTNGSCMGYPMSRVSY 248 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDN 290 S K L LS V S L DA+ + ++ + Sbjct: 249 SGVKKLLTNNGGKLPSSLLSSLTAVTSCPTDGSDACIQLRADAVGAAVQQLLVTANATQK 308 Query: 291 VNDTVRMGATFFNDRVISDPSFS---WGVHKLIRTIV----------KTFAIDENEMGST 337 + R+G F + + + G TI T A G T Sbjct: 309 TPNQFRIGLYPFVRYLYAYSPLTASINGSPTTPGTINHAAANLASQLDTGANASLGSGGT 368 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNEEGIAI--- 389 +A T I S + + Y+ L+TDG + G Sbjct: 369 HFENAFPTMNGIITSVGDGSASNKT-----QPYVFLITDGAQNPQVYWNGSWSGSNSATT 423 Query: 390 -----CNKAKSQGIRIMTIAF----SVNKTQQEKARYF------------LSNCASPNSF 428 C KS+GI + + N T + F L CASP F Sbjct: 424 MDTSKCTTLKSRGIIVSVLYIPYQPIQNPTSFANSEDFYANANIPKIPPSLQACASPGYF 483 Query: 429 FEANSTHELNKI 440 + ANS ++ Sbjct: 484 YTANSPADITAA 495 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 67/488 (13%), Positives = 145/488 (29%), Gaps = 122/488 (25%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + +G+ +I A+ + +LG G +D + L+ A A++ A Sbjct: 15 RRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVS----- 69 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + F + ++ E + ++ Sbjct: 70 --------NSEINRTTAQAKADAEQFFN----ATIGAYGLTATIKIEVTENDGKR----- 112 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S+ D + FL +G + I ++ + Y ++D S SM Sbjct: 113 --SATADFTSTVTTNFLNLIGYPTLAIGNRSTSTVSRPIYQD-----FYLLLDNSPSMGV 165 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ + VG +K + + S Sbjct: 166 AATTADIATM--------------------VGNTSDKCAFACHDLSDSN----------- 194 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRV 306 ++ + + V + +VR A+ + + + VN+ RM + + Sbjct: 195 -----NYYNLAKKLGVKMRIDVVRQAVQQLTSTATLMTAVNNQFRMAVYTLGGSCASLGL 249 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDE-----NEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + S S + + + N T N A+ TI SS Sbjct: 250 TTIASLSSAMSSVQTAAGAIDLMSIPKQNYNNDQCTDFNSALAAMNTTIPSSGTGTA--- 306 Query: 362 KNNLEAKKYIVLLTDGENTQDNEE----------------GIAICNKAKSQGIRI---MT 402 + +K++ ++DG +N + C K +GI+I T Sbjct: 307 ---AQPQKWLFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTVTQCKAMKDRGIQIAVLYT 363 Query: 403 --------------IA-FSVNKTQQEKARYF---LSNCASPNSFFEANST----HELNKI 440 IA F+ + +CASP+ +FE + T ++ + Sbjct: 364 TYLALPTNQWYNDHIAPFNAGPYGPSVNSQIAAKMKSCASPDFYFEVSPTQGISEAMDAL 423 Query: 441 FRDRIGNE 448 F+ + Sbjct: 424 FKKAVAKA 431 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ + + + V + R+G F ++ + + + + Sbjct: 106 IQDMQKSNGKMESRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K +VLLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKRYPGDS-----------KALVLLTDGENNSGA 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + F Sbjct: 213 LQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKF 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L +++ Sbjct: 273 FRAQNSADLKQVYES 287 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ + + + V + R+G F ++ + + + + Sbjct: 112 IQDMQKSNGKMESRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKML 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K +VLLTDGEN Sbjct: 172 D--DASIALPGPQTAIGDAIGLAVKKLKRYPGDS-----------KALVLLTDGENNSGA 218 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + F Sbjct: 219 LQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKF 278 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L +++ Sbjct: 279 FRAQNSTDLKQVYES 293 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 35/195 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ + + + V + R+G F ++ + + + + Sbjct: 106 IQDMQKSNGKMESRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKML 165 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A TAI DA+ A + D K +VLLTDGEN Sbjct: 166 D--DASIALPGPQTAIGDAIGLAVKKLKRYPGDS-----------KALVLLTDGENNSGA 212 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSF 428 + + AK I+I TI + ++ L A + F Sbjct: 213 LQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKF 272 Query: 429 FEANSTHELNKIFRD 443 F A ++ +L +++ Sbjct: 273 FRAQNSTDLKQVYES 287 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 62/166 (37%), Gaps = 25/166 (15%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F ++ + ++ + +T + + TAI DA+ + + + + + + Sbjct: 127 GLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQ 186 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VNKTQ 411 +++ K ++LLTDGENT + I A++ GI+I I V Sbjct: 187 ERKVQ-----SKILILLTDGENTAGELDPIQAAELAETLGIKIYAIGVGTKGKAPVPVRD 241 Query: 412 QEKARYFLSNC--------------ASPNSFFEANSTHELNKIFRD 443 R L + +F A T L+ I+R+ Sbjct: 242 PFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYRE 287 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 71/211 (33%), Gaps = 35/211 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ ++ + +K D R+G +F D ++ + Sbjct: 101 DMAYQDDYIDRLTAVKHVVSDFVDR-RKGD------RVGLVYFADHAYLQTPLTFDRETV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +T TAI D + A T + SN ++ ++LL+DG N Sbjct: 154 KTQLNQTVLKLI--GTQTAIGDGIGLATKTFVDSN-----------APQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--S 424 + + AK G I TI + Q ++ L A + Sbjct: 201 NAGVLDPVQAAEIAKKYGTTIYTIGVGAGEMQVKDFFMTRTVNTAEDLDEKTLIKIANIT 260 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 +F A + EL I+ + ++ + Sbjct: 261 GGQYFRARNADELATIYDTINALQPVQQATQ 291 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 79/473 (16%), Positives = 154/473 (32%), Gaps = 116/473 (24%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 A+L+PV+LG+ G+ +D L+ A A + S+ L Sbjct: 2 AVLLPVILGIMGLGLDSGMLYLSHSRLQAAVDAAALAGSLQL----------------PY 45 Query: 87 KQKIEEYLIRNFEN-NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 +++ L+R + + NF V+ ++ P V +++ + + Sbjct: 46 DPAMDKGLVRAAVDEYMHANFPQAVVQSVL--------PGAEERSVTVNAEATVG----T 93 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 +F+ ++GI S ++ +A A + + + +VID S SM + Sbjct: 94 IFMGALGIGSSTVRAQASAGYNN--------LEVVFVIDNSGSMKGSPINETNAAATRLV 145 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + S + G V R + P V S + + ++ L Sbjct: 146 DLIMPEGMATSVKIGLVPFRGKVRIPADVDGLPS-----------GCRNADGSLNEDGLL 194 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 KK ND +R+ + + G+ TI + Sbjct: 195 DEYKKPEYRY--------------PYNDRLRVTPYSCSS-----IPLTQGLTADRATITQ 235 Query: 326 T--FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + T I++ ++ A + E + + +K I+LLTDG+ N Sbjct: 236 AIGRQDARGDSSGTVISEGLKWARHVLT--PEAPFTEGSSAKDMRKVIILLTDGDTEDGN 293 Query: 384 EEGIAI----------------------CNK--------------AKSQGIRIMTIAFSV 407 G C AK GI I I + Sbjct: 294 CGGNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVLNNAMLSEAALAKDAGIEIFAIRYG- 352 Query: 408 NKTQQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + R + AS + +F+A S ++++ +F+ IG ++ R++R Sbjct: 353 --SSDAVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDVFK-LIGRQLGWRLLR 402 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 78/196 (39%), Gaps = 32/196 (16%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 D + + V+ + + I+ ++ D RMG F + ++ ++ Sbjct: 118 TDMPLRGVEVDRLTAVKSVVKNFIQK-RQGD------RMGLVVFGSQAFLQSPLTYDLN- 169 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + T++ I +TAI DA+ A + ++E + ++LLTDG Sbjct: 170 TVETLLNETEIGMAGN-NTAIGDAIGIALKHLHQNSEKKA-----------VLILLTDGS 217 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------LSNCA--SPN 426 NT + + +A+ G++I TI N+ A F L A + Sbjct: 218 NTAGAVQPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTLQKIAELTQG 277 Query: 427 SFFEANSTHELNKIFR 442 FF A T++LN+I++ Sbjct: 278 RFFMAKDTNQLNEIYQ 293 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 73/494 (14%), Positives = 132/494 (26%), Gaps = 109/494 (22%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 G+ I+ A + +L G +D R + ++ AA A SV + Sbjct: 9 FDSFHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAA----SVGSVS 64 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 A + T + N N + D AV + Sbjct: 65 KASPAFLAAGSMTTDGPIAVGSTDATNIFNGNMAS--QSGYTLSKLDAAVTKSGATLTST 122 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V S+ + FL +G + I T + + ++D S SM Sbjct: 123 VTFSASVA------TTFLTIIGKTALAI-----GGTSVSTSSMPVYIDFYLLLDNSPSMG 171 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 ++ + +S D K+L Sbjct: 172 VGATPTDVATMV-----------DNTSDKCAFACHDVNDEHNYYELAKTL---------- 210 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 V + ++RDA ++ + + RM F S Sbjct: 211 ---------------GVKTRIDVLRDATQQLMDTAAATATYPNQFRMAIYDFGASAQSAA 255 Query: 311 --SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS--NEDEVHRMKNNLE 366 L AID + ND T+Y ++ + + + Sbjct: 256 LRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDDRDTSYSKLLPAIDKQITAAGAGTSDA 315 Query: 367 AKKYIVLLTDG---ENTQDNEEGI------------------AICNKAKSQGIRI---MT 402 +KY++ ++DG E + + A+C +G+++ T Sbjct: 316 PQKYLLFVSDGVADETNAGCAKTMKNAFWGNKSPRCQSPIDPALCKAMTDRGVKVAVLYT 375 Query: 403 --------------------IA-FSVNKTQQEKARYF---LSNCASPNSFFEANS----T 434 IA F+V + CASP +FE + Sbjct: 376 TYLALPLKQANGDPSWYASWIAPFNVGPYGPSPNSEIANNMKACASPGFYFEVSPTDGIA 435 Query: 435 HELNKIFRDRIGNE 448 +N IFR + + Sbjct: 436 DAMNAIFRKAVADA 449 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 62/166 (37%), Gaps = 25/166 (15%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F ++ + ++ + +T + + TAI DA+ + + + + + + Sbjct: 139 GLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQ 198 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VNKTQ 411 +++ K ++LLTDGENT + + A++ GI+I I V Sbjct: 199 ERKVQ-----SKILILLTDGENTAGELDPVQAAELAETLGIKIYAIGVGTTGKAPVPVRD 253 Query: 412 QEKARYFLSNC--------------ASPNSFFEANSTHELNKIFRD 443 R L + +F A T L+ I+R+ Sbjct: 254 PFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYRE 299 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 64/193 (33%), Gaps = 39/193 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ V+ L S S + D R+G F R S+ + + R +++ A Sbjct: 109 RRIDAVKQ-LGSDFMSRRSGD------RLGLILFGSRAYLQSPLSFDIQTVQRFLLE--A 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ A + + +VLLTDG++T + + Sbjct: 160 QIGFAGQETAIGDAIGLAVKRLQERPASS-----------RVLVLLTDGQDTASTVDPLE 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK--ARYFLSNCASP-----------------NSFF 429 N A G+RI TI + + +P +F Sbjct: 209 AANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDESTLIEIARSTGGQYF 268 Query: 430 EANSTHELNKIFR 442 A EL ++R Sbjct: 269 RARDPEELATVYR 281 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 61/161 (37%), Gaps = 22/161 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F ++ K + T++ A+ +TAI DA+ A + SN Sbjct: 152 RVGLILFGSNAYVQTPLTFD-RKTVITLLDEAAVGL-AGKATAIGDAIGLAVKRLEQSNR 209 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 D+ K ++ ++LLTDG NT A G+ I TI + Sbjct: 210 DKRIASK-----EQVLILLTDGVNTAGQLSAPQAAELAAEHGLTIYTIGIGADAMTVQSF 264 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 + L++ A+ +F A+ T EL KI+ Sbjct: 265 FGTQRVNPSADLDEKMLTDIATKTGGRYFRAHDTQELQKIY 305 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 61/415 (14%), Positives = 110/415 (26%), Gaps = 50/415 (12%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN--NLKKNFTDREVRDIVRDT 118 I+ V L+ S S K + + + + +T Sbjct: 19 AISMVVGLVPVNSITSYAKDKSAGNVKSQPRAAFDEESVSDESSLETWTKVVENSTENIG 78 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + ++ S + L + G +L+ A + T + S + + Sbjct: 79 RIWVDKTVSDKNIKLPASSQ-----GKEIEIDKGTSDFLVGLSALSSTSNISTVADKPLD 133 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I +V+D S SM D N R + V I + Sbjct: 134 IVFVLDTSGSMSDPMEYIYSPTYNVVTDG--RVEYYAEVEGNYVKIDRITGLFGFFKHWE 191 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDT 294 + P + + F + K ++ A+ + I + Sbjct: 192 VAGKEVTPKKNESDTNGIQFYTRREKPNSQSKMGALKIAVNQFAQETAKRNDSITDAAKQ 251 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 RM F+ S + +T N G+T N M+ A +++ + Sbjct: 252 HRMSIVTFSSESYIRQSLKAYNSNTVSEFERTI-NGLNANGATYANLGMEKAKESLKNVR 310 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE------NTQDNEEGIAICNKAKSQGIRIMTI----- 403 E +K ++ TDG + I K +I +I Sbjct: 311 EKA----------QKVVIFFTDGTPGRSGFDDDTANNTIQAAKSLKDDLTKIYSIGVFDQ 360 Query: 404 ---------------AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 S N K + N + A ELNKIF + Sbjct: 361 ANPDNTSSSFNAYMHGVSSNYPNATKWTELGERAENSNYYKAAQDADELNKIFEE 415 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 63/193 (32%), Gaps = 38/193 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + R + + R+ F + + + + ++ Sbjct: 103 RLEVARKTIQDFVDQRPSD-------RIALIAFAGTAYTRVPLTLDHNVVRESLQDISFK 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 NE G TAI A+ + + K + K ++LLTDG+N + + Sbjct: 156 SVNEEG-TAIGMAISVGLNRL-----------KKSTSPSKIMILLTDGDNNAGSIDPNTA 203 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKAR-----------------YFLSNCA--SPNSFFE 430 AK GI+I TI +KT L A + ++ Sbjct: 204 STLAKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYESGLLNEDLLKKIAETTNGQYYR 263 Query: 431 ANSTHELNKIFRD 443 A ++ L+++F + Sbjct: 264 AKDSNALSQVFAN 276 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 71/210 (33%), Gaps = 35/210 (16%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + I + V+D LA I +K D R+G F Sbjct: 88 AVDLSGSMSIEDMITQSGESIDRLAAVKDVLAEFIEQ-RKGD------RLGLVLFAQHAY 140 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ + + + + +T STAI + + A N+ Sbjct: 141 LQTPLTFDRNTVKQQLERTVL--GLIGQSTAIGEGLGIA-----------TKTFINSEAP 187 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEK 414 ++ I+LL+DG NT E + A + I T+ + Q+ Sbjct: 188 QRVIILLSDGANTAGVIEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDL 247 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFR 442 L+ A + +F A + EL I++ Sbjct: 248 DERMLTKIADMTGGQYFRARNPQELEHIYQ 277 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 70/189 (37%), Gaps = 35/189 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ + + +K D R+G F ++ ++ + + R + + Sbjct: 112 VNRLQTVKAVVTDFVEE-RKGD------RLGLILFGEQAYIQTPLTFDLSTVKRLLDE-- 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A+ TAI DA+ + E + ++LLTDG+NT E + Sbjct: 163 AVVGLAGNKTAIGDAIGLGVKRLQDLPESN-----------RVLILLTDGQNTAGEIEPL 211 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEAN 432 A+ G++I I ++ ++ L+ A + ++ A Sbjct: 212 KAAELAEKAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDEDTLTAIAENTGGQYYRAR 271 Query: 433 STHELNKIF 441 + +EL +I+ Sbjct: 272 NVNELEQIY 280 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 60/476 (12%), Positives = 134/476 (28%), Gaps = 42/476 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +K + G+ ++ + + ++ G+ VD R + A++ AA +A + S Sbjct: 10 LRRTAKAFHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSK 69 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF-ENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + K I N + +V +T+ M Sbjct: 70 DYAAGVIRASDIQATAEKYFKALYTSPGINNVTVTATYTARSANGSSTVVMNTSGSMPTS 129 Query: 126 --------KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + +S + +++ L G W + A + + Sbjct: 130 FLKVAGFTALPFTASSTSTWGATRLRVAMALDVTGSMDWDDKLTAMKTAAIKLVNTLKAT 189 Query: 178 S-----IQWVIDFSRSMLDYQRDSEG------QPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + + I M++ ++ + T S + Sbjct: 190 ASTDADVYISIIPFNVMVNVGTANKDAEWLDWDTDYGSCKSNRTTQNSCQAAGETWSWWA 249 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV--IKKKHLVRDALASVIRS 284 + + + P S + D ++ IK + Sbjct: 250 NSCTSRYTRKSTCVAGGETWIPSGVSNWKGCVTDRTTSNDYDVIKTPPTTATPATLFLAK 309 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ A N+ + +K + G+T + Sbjct: 310 SYSACPLSLLPMKAAYSSNES----------DTSTAESTLKGKINKLDAEGNTNQPIGLF 359 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG---------IAICNKAKS 395 A+ ++ + + I+LL+DG+NTQ +C+ K Sbjct: 360 WAWMSLQTGVPLNTPAKDTEYKYTDAIILLSDGDNTQSGNSNSVSAIDARQKKLCDNIKD 419 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 TI T + L CAS FF++ + ++ F+ IG+ + + Sbjct: 420 PLNGTTTIFTIQVNTDGDDESAVLKYCASDGQFFQSTTADQIEIAFQS-IGSSLTK 474 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 66/191 (34%), Gaps = 35/191 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + ++ + I ++ D R+G F ++ ++ H + ++ Sbjct: 114 VDRLTALKAVAGAFIDQ-RQGD------RIGLILFGEQPYIQAPLTFD-HNTVTRLLHEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + TAI DA+ A + + K ++LLTDG + + + + Sbjct: 166 VVGLAGNK-TAIGDAIGLAVKRLRKDPQA-----------KNVLILLTDGASNSGSLDPL 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ-------------EKARYFLSNCA--SPNSFFEAN 432 A +G+++ TI + + L A + +F A Sbjct: 214 KAAKLAAQRGLKVYTIGIGAEAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRAR 273 Query: 433 STHELNKIFRD 443 T EL +I++ Sbjct: 274 DTEELAQIYQQ 284 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 68/182 (37%), Gaps = 27/182 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ A+A+ S + D R+G F R ++ V + + I Sbjct: 114 VSRITAVK-AIAADFASRRTGD------RVGLILFGTRAYVQAPLTFDVKTVKQFI--EE 164 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A TAI DA+ A + D + ++LLTDG++T + + Sbjct: 165 AQLGFAGEDTAIGDALGLAVKRLRERPADS-----------RVLILLTDGQDTASTVDPM 213 Query: 388 AICNKAKSQGIRIMTIAFS-----VNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKI 440 A ++I TI S + + E L+ A+ +F A + EL I Sbjct: 214 EAAALASEMNVKIYTIGISRRLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDI 273 Query: 441 FR 442 ++ Sbjct: 274 YQ 275 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 66/200 (33%), Gaps = 35/200 (17%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 D + + +V+ + I +++ D R+G F + ++ + Sbjct: 109 TDLQLKGNQATRLDVVKSVVTDFI-QVRQGD------RLGLILFGAQPYIQAPLTYDLVT 161 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + A +TAI DA+ + D + +VLLTDG Sbjct: 162 VGELLNE--ATLGIAGNATAIGDAIGLGIKRLRERPADS-----------RVLVLLTDGA 208 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA-- 423 NT A GI+I T+ ++ + L + A Sbjct: 209 NTGGEVSPEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADE 268 Query: 424 SPNSFFEANSTHELNKIFRD 443 + +F A +T EL I+ Sbjct: 269 TDGQYFRARNTGELELIYES 288 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 60/157 (38%), Gaps = 12/157 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS--S 353 R+G F S + + L + + TAI DA+ + + + Sbjct: 153 RIGLVIFAGEAFSLCPLTTDYNLLNQYL-NDLNDGMIRTSGTAIGDALARCINRMRDRPA 211 Query: 354 NEDEVHRMKNNL---EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K E K I+LL+DG+NT N + I + AK+ I+I TIA Sbjct: 212 ASSDTTQAKTEQWKSERSKVIILLSDGDNTAGNLDPITAASLAKAFNIKIYTIAVGQPVA 271 Query: 411 QQEKA----RYFLSNCAS--PNSFFEANSTHELNKIF 441 +A L A+ SFF A + L +F Sbjct: 272 SASEASTVDEGILKKIATIGKGSFFRAVDSGRLKTVF 308 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 65/194 (33%), Gaps = 39/194 (20%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ + I ++ D R+G F S+ + R +++ Sbjct: 115 VSRIRAVKQVGSRFIEQ-REGD------RLGLILFGSNAYVQSPLSFDTATVKRFLLE-- 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A TAI DA+ A + + + ++LL+DG++T + + + Sbjct: 166 AQIGFAGQDTAIGDAIGLAVKRLKERPAEN-----------RVLILLSDGKDTASSVQPL 214 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEK--ARYF---------------LSNCA--SPNSF 428 A GIRI TI + F L A + + Sbjct: 215 NAAKLAADLGIRIYTIGIGADSLTMPGLFGSSFGARQVNPSAELDEAGLQQIAKITDGKY 274 Query: 429 FEANSTHELNKIFR 442 F A + EL I++ Sbjct: 275 FRARNPEELANIYQ 288 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 85/514 (16%), Positives = 151/514 (29%), Gaps = 120/514 (23%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ G ++TAL++ M+ + G +D++ L+ ++ A+ Sbjct: 15 FLRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAAS------- 67 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 P++ IE +L D + ++ VE + Sbjct: 68 ------LTQTRPPRETIESFLKAAVTKGSYA--LDVKADELSNAKRVEASATAVT----- 114 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + FLR +GI ++ AEAE K + I ++D S SM + Sbjct: 115 ----------DTAFLRLIGIDKLTVEAHAEAE------EKRKNIEISLLLDMSGSMRFDK 158 Query: 194 RDSEGQPLNCF-GQPADRTVKSYSSQNGKVGIRDEKLSPYM-VSCNKSLYYMLYPGPLDP 251 S P KS+ G D + + S+ +L+ Sbjct: 159 SGSYPGPSGAMRINYLRPAAKSFMDMVLADGAEDYTTVSIVPYAGQVSIGPVLFDALARN 218 Query: 252 SLSEEHFVDSSSLRHVIK-------------------KKHLVRDALASVIRSIKKIDNVN 292 + R+ ++ A + I + Sbjct: 219 RRQHDRSSCFQFGRNDFTLGVPDFANLPQTQHFTQANHHDALKKAGEAQITEPWWCPDDP 278 Query: 293 DTVRMGAT-------FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST-------- 337 R G T + S S L R I + T Sbjct: 279 HDPRPGTTPDFVAGEGKDTDRTSVSFLSNDREYLKRQIDNY---KLYDGTGTPIALKWGL 335 Query: 338 -----AINDAMQTAYDTIISSNEDEVHRMKNNLEAK-------KYIVLLTDGE------- 378 AI ++ A S E ++ +N A K++VL+TDG Sbjct: 336 LLLDPAIQPMLREAARYRALSEELDIDARFSNRPASFTDPDTMKFLVLMTDGAISSQRIP 395 Query: 379 -----------NTQDNEE----------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 N N + A+C AK + + + TI F VN T ++ Sbjct: 396 KDASKPVQYYNNGSLNTDLYSVGDAERFAAALCTAAKQKNVIVFTIGFDVNDTAAKQ--- 452 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 +SNCAS F + ++ F+ I I + Sbjct: 453 -MSNCASGAERFYRVNALDIQDAFKS-IATAIQK 484 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 63/193 (32%), Gaps = 33/193 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + D I + LV+ L I ++ D R+G F + ++ Sbjct: 104 YEDMHWDEQPISRLELVKRLLGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +RT + I +TAI DA+ A + + +VL+TDG Sbjct: 156 HTVRTWLDEAMIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----------LSNC--AS 424 N + + A +G+RI I + Q F L A+ Sbjct: 204 ANNGGEIDPMVAAQLAAEEGVRIYAIGIGADPRQSGVLGAFGFSALDLDETSLRAIAEAT 263 Query: 425 PNSFFEANSTHEL 437 +F A + EL Sbjct: 264 GGEYFRARNQAEL 276 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 76.2 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 66/190 (34%), Gaps = 35/190 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+D LA I +K D R+G F ++ + + + + +T Sbjct: 108 IDRLAAVKDVLAEFIEQ-RKGD------RLGLVLFAQHAYLQTPLTFDRNTVKQQLERTV 160 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 STAI + + A N+ ++ I+LL+DG NT E + Sbjct: 161 L--GLIGQSTAIGEGLGIA-----------TKTFINSEAPQRVIILLSDGANTAGVIEPL 207 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEAN 432 A + I T+ + Q+ L+ A + +F A Sbjct: 208 EAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRAR 267 Query: 433 STHELNKIFR 442 + EL I++ Sbjct: 268 NPQELEHIYQ 277 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 55/155 (35%), Gaps = 12/155 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI--ISS 353 R+G F D+ + S+ + R + + A ST I D + A + + Sbjct: 148 RIGLVIFADQAYVAAAPSFDTAAVARALDE--ATIGISGRSTGIGDGLGLALRRLDPRDA 205 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K + K ++LL+DG N + A+ GI++ TIA Sbjct: 206 GGEAASGSKPGEKPAKAVILLSDGANNAGQTAPKDVAELARELGIKVYTIALGPRDMADA 265 Query: 414 KARY------FLSNC--ASPNSFFEANSTHELNKI 440 L + AS F +T +L ++ Sbjct: 266 DGEQDVVDTETLRDMARASGGEAFRVRTTEDLVRV 300 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 72/196 (36%), Gaps = 32/196 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L + I ++ D R+G F D + ++ + Sbjct: 103 DMAYQGRYVDRLSMVKAVLKNFIAQ-RQGD------RLGLILFGDTAFLQTPLTRDLNTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +++ I +TAI DA+ A V R ++ + +VLLTDGEN Sbjct: 156 SK-MLEEAQIGLVGR-ATAIGDALGLA-----------VKRFSQKQDSNRILVLLTDGEN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN----------KTQQEKARYFLSNCASP--NS 427 T N A+ +GI++ T+ L A+ Sbjct: 203 TAGNLAPEEALLLAREEGIKVYTVGVGSQGGNRFNLFSMSGSSSLDESLLQKIATETGGL 262 Query: 428 FFEANSTHELNKIFRD 443 +F A L +I+++ Sbjct: 263 YFRATDVASLQQIYQE 278 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 69/401 (17%), Positives = 135/401 (33%), Gaps = 71/401 (17%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA--- 128 +E+ ++ P +Y + E N + + + +V T V+ + Sbjct: 114 VEDWLDQSGQVTNDPNWLTTDYKVEVDEINFMELTSQAHNKRVVFKTTVKADGSSKGGFS 173 Query: 129 -----------YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 VV+ ++ L + + L + + W I+ K E + V + Sbjct: 174 RYSSSNVAASSADVVVVTQDGLTVTKTAKELAAKNV--WEIEVKVEGKNVV----LQEAT 227 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V+D S SM D A + + + + + C Sbjct: 228 DVVLVLDRSGSMGQGVVDKNN-------PNAQKCTVLTCTNSNRWHRHNA-------DCY 273 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 YY+L F+ +S + + V+DA + + ++++ ++VN +V Sbjct: 274 DEEYYILKCTQNHTHTLPGDFIANSC---YVSRADKVKDASYTFLDTLQEKEDVNISVVT 330 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 +++ + G+ T ++ A + Sbjct: 331 --YAGTASKVTNSNLKSGIESAYN---------VLGTDGTNTGRGIEIASQIL------- 372 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 N+ K IV+L+DGE+ N N AK++G + TI Sbjct: 373 ----SNSTAPNKMIVVLSDGESNAGNSRT--AANSAKNKGCIVYTIG--AGIASGSNGAK 424 Query: 418 FLSNCASPN------SFFEANST-HELNKIFRDRIGNEIFE 451 L +CAS + F+ A+ T + LN+IF + I EI E Sbjct: 425 ELFDCASVDQSTNKAKFYLADDTGNALNEIFAE-IAGEIQE 464 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 67/487 (13%), Positives = 142/487 (29%), Gaps = 122/487 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +G+ +I A+ + +LG G VD S L+ A A++ A + Sbjct: 16 RFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVSNSAIN- 74 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + V+ ++ F + + + E + ++SA Sbjct: 75 KTVADAQADAVQFFN----------------ATLDGYGLSATIDLSVSENDGKRSAVSSF 118 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 SS + FL +G + I ++ + Y V ++D S SM Sbjct: 119 SSSV-------KTHFLDMIGYPTLAIGNRSTSTVSLPVY-----VDFYLLLDNSPSMGVA 166 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 S+ + + +S D S + K L + Sbjct: 167 ATTSDIATMV-----------ANTSDQCAFACHDLSTSNNYYNLAKKLGVTM-------- 207 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN-----DRVI 307 + +VR A+ + + + V + RMG F + Sbjct: 208 -----------------RIDVVRQAVQRLTTTATAMSAVTNQFRMGVYTFGSSCTAIGLT 250 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-----STAINDAMQTAYDTIISSNEDEVHRMK 362 + + S + + ++ + G T + ++ I S Sbjct: 251 TVANLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTDFDGSLTAINSAIPSPGSGI----- 305 Query: 363 NNLEAKKYIVLLTDG--ENTQDNEE--------------GIAICNKAKSQGIRI------ 400 + + +K++ ++DG + + +A C KS+GI+I Sbjct: 306 -STQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQEPLTVAQCTAIKSRGIQIAVLYTT 364 Query: 401 ------------MTIAFSVNKTQQEKARYF---LSNCASPNSFFEANS----THELNKIF 441 F+ + +CASP +FE + ++ +F Sbjct: 365 YLALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSCASPGFYFEVSPTQGIAEAMDALF 424 Query: 442 RDRIGNE 448 + + Sbjct: 425 KKAVAKA 431 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F ++ ++ + K ++T++ I TAI D + A + + Sbjct: 131 RVGMILFGEQAYLQTPLTFDL-KTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRLRERDA 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 + + ++LLTDG+N + A+ GI I TI ++ Sbjct: 190 NN-----------RVLILLTDGQNNTGALNPLQAAELAEHAGITIYTIGVGADEMIVKNR 238 Query: 411 -------QQEKARYFLSNCA----SPNSFFEANSTHELNKIFR 442 S A + +F A T E+ +I++ Sbjct: 239 FFGNRRINPSLELDEESLIAVAEKTGGRYFRARDTKEMEEIYQ 281 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 67/204 (32%), Gaps = 39/204 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D I + LV+ + ++ + R+G F + ++ Sbjct: 104 FPDMQWKNEDISRLDLVKALMGDFLQ-------DREGDRVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + + +VL+TDG Sbjct: 156 RTVRTFLDEAKIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A +G+RI TI N + L A Sbjct: 204 ANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADI 263 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++F A+ EL D IG+ Sbjct: 264 THGAYFRAHDGAEL-----DAIGD 282 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 62/183 (33%), Gaps = 33/183 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + LV+ L I ++ D R+G F + ++ +RT + Sbjct: 114 ISRLELVKRLLGDFIEG-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRTWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + +VL+TDG N + + Sbjct: 166 LIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLVLITDGANNGGEIDPM 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----------LSNCA--SPNSFFEANST 434 A +G+RI TI + Q F L + + +F A + Sbjct: 214 VAAQLAADEGVRIYTIGIGADPQQSGAFGSFGFSALDLDETSLRAISDTTGGEYFRARNQ 273 Query: 435 HEL 437 EL Sbjct: 274 AEL 276 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 61/459 (13%), Positives = 136/459 (29%), Gaps = 111/459 (24%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 ++L V G+ VD+ L+ A + S E+ S +Q + Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGS-------LELPYDPDLSKGIVQQAVS 53 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + + N + + ++ ++P VV++++ + L L + + Sbjct: 54 DMIHTNMPDAVVES----------------VSPGTEVRSVVVTAKAKVNL----LVMGFL 93 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + ++ A A + + I +VID S SM + + Sbjct: 94 NLADQWVEAGAAAGF--------NKLEIVFVIDNSGSMKGTPINLVKEASIGLTDLLIPD 145 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G + + + K G + + V++ + Sbjct: 146 -----------GQQPDTKVGLVAFRGKVRLGGDVDGLEAGCRNADGSVNTGIHEDFMSMY 194 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + I + + S ++ I A+ Sbjct: 195 WALSSYYRNQID-LDTC---------------SSIPESRPLSQDKGDIVEGINSQTALGS 238 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--------- 382 T I++ ++ A + K + +K +++LTDG+ Sbjct: 239 AS--GTVISEGIKWARHMLTPEAPYTQAGDK--KDFRKIMIVLTDGDTEDGECGGSYRAS 294 Query: 383 ---------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 N++ +A AK +GI I I F V+ Sbjct: 295 FRPNNYWTNAYYGMGVDTAHCQDGGVLNQDMLAEAQLAKDEGIEIFAIRFGVSDNTD--- 351 Query: 416 RYFLSNCASP-----NSFFEANSTHELNKIFRDRIGNEI 449 + AS + +F+A S +++ +F+ +IG ++ Sbjct: 352 ISLMKQIASSKAGTNDHYFDAPSVYDIPDVFK-KIGKQL 389 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 72/484 (14%), Positives = 153/484 (31%), Gaps = 91/484 (18%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + G+ II AL V++ G +D R + L+ A + + A+ Sbjct: 8 FSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALAAAS---- 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 L + + + + + ++ L + V DI ++ V Sbjct: 64 -LTQDRNPEDVVRAYVEAALADH------PQLLASLQLDVVADISLNSRV---------- 106 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ + + L +GI + ++ +EA R V I V+D S SM Sbjct: 107 VNATASVAMP----TTMLGLVGINTLTLEHASEAIEQVRD------VEISLVLDVSGSMG 156 Query: 191 DYQRDSEGQPLNCF---GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + ++ F AD ++ S G + + + G Sbjct: 157 GSKINALQDAAIEFVEIVLAADAAERTSISVIPYNGGVRTPREVNQDIVSGNNNHRRQSG 216 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D + + + S + + ++ Sbjct: 217 CVDMGTDYPVEMTLPYREMEFTEYYGSEQTGNSSSAFCPRSNMESEF------------- 263 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND--AMQTAYDTIISSNEDEVHRMKNNL 365 S ++ I A + + A+ A+ + + + ++ Sbjct: 264 ----LSQNEGRMRGLINSLRAEGNTGLDVATMWGARALDPAWRGNLGGSFSDRPASYDDR 319 Query: 366 EAKKYIVLLTDGENT------------------------------QDNEEGIAICNKAKS 395 + K +V++TDGE T Q E C+ A+ Sbjct: 320 DTIKILVVMTDGEATAQIRSEEYTYYDWWGRERTGTRSYELYSARQARENMAEACDIAEG 379 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 G++I TIAF ++ R + NCA+ P ++++ + ++ + F I +I + Sbjct: 380 NGVQIYTIAFQLSGQTN---RDLMRNCANKPQNYYQVENL-DIAEAF-SSIAADI--NRL 432 Query: 455 RITK 458 R+T+ Sbjct: 433 RLTR 436 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 75.8 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 66/204 (32%), Gaps = 39/204 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D I + LV+ + ++ + R+G F + ++ Sbjct: 104 FPDMQWKNEDISRLDLVKALMGDFLQ-------DREGDRVGLILFGSQAYLQAPLTFDRR 156 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +++ A +TAI DA+ A + + +VL+TDG Sbjct: 157 TVRTFLIE--AQIGIAGKNTAIGDAIGLAVKRLRERPAQS-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A +G+RI TI N + L A Sbjct: 204 ANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADI 263 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++F A+ EL D IG+ Sbjct: 264 THGAYFRAHDGAEL-----DAIGD 282 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 66/170 (38%), Gaps = 31/170 (18%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A S + ++ + D R F+D + + I + + Sbjct: 554 KIAAKSFVDALIQGD------RAAVVDFDDYGYLLQPLTTDFQTVKNAIDR-----IDSW 602 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + ++ A +IS + D+ K I+LLTDGE DN +AK Sbjct: 603 GGTNIAEGIRIANHQLISQSSDDR---------IKVIILLTDGEGYYDNNLT----TEAK 649 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + GI I TI + + L N A+ +F +S +L ++F+ Sbjct: 650 NNGITIYTIGLGTSVDE-----NLLRNIATQTGGMYFPVSSASQLPQVFK 694 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 62/197 (31%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D + + + LV+ L + + R+G F + ++ Sbjct: 104 FPDMNWQGEDVSRLSLVKHLLGDFLE-------GREGDRVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + + ++L+TDG Sbjct: 156 RTVRTWLDEARIGI-AGKNTAIGDAIGLALKRLRQRPAQS-----------RVLILVTDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNC--A 423 N + + A +G++I I + Q L A Sbjct: 204 ANNAGQIDPLTAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDEPALKAIAEA 263 Query: 424 SPNSFFEANSTHELNKI 440 + +F A EL I Sbjct: 264 TGGQYFRARDGEELQAI 280 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 60/180 (33%), Gaps = 33/180 (18%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK + + R+G F + + L++ + + N TAI A+ Sbjct: 124 LKKFVDKRQSDRIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSINGN-QTAIGSALS 182 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 T+ + + K I+L+TDG+N + A++ GI+I TI Sbjct: 183 TSMNRLRELKSKS-----------KIIILMTDGQNNAGKVPPLTAAEAARALGIKIYTIG 231 Query: 405 F-------SVNKTQQEKARYFLSNC--------------ASPNSFFEANSTHELNKIFRD 443 T + + ++ A+ST L KI+ D Sbjct: 232 VGTKGVARMAVGTDPFSGQKIYQQVPVDIDEGTLTSISKMTNAKYYRADSTATLEKIYAD 291 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 66/204 (32%), Gaps = 39/204 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D I + LV+ + ++ + R+G F + ++ Sbjct: 104 FPDMQWQNEDISRLDLVKALMGDFLQ-------DREGDRVGLILFGSQAYLQAPLTFDRR 156 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +++ A +TAI DA+ A + + +VL+TDG Sbjct: 157 TVRTFLIE--AQIGIAGKNTAIGDAIGLAVKRLRERPAQS-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A +G+RI TI N + L A Sbjct: 204 ANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADI 263 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++F A+ EL D IG+ Sbjct: 264 THGAYFRAHDGAEL-----DAIGD 282 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 66/170 (38%), Gaps = 31/170 (18%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A S + ++ + D R F+D + + I + + Sbjct: 59 KIAAKSFVDALIQGD------RAAVVDFDDYGYLLQPLTTDFQTVKNAIDR-----IDSW 107 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + ++ A +IS + D+ K I+LLTDGE DN +AK Sbjct: 108 GGTNIAEGIRIANHQLISQSSDDR---------IKVIILLTDGEGYYDNNLT----TEAK 154 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + GI I TI + + L N A+ +F +S +L ++F+ Sbjct: 155 NNGITIYTIGLGTSVDE-----NLLRNIATQTGGMYFPVSSASQLPQVFK 199 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 70/454 (15%), Positives = 131/454 (28%), Gaps = 62/454 (13%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 K L G+ IITAL+ ++ + GM +D + ++ A A + A Sbjct: 4 LFIKFKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALAAVT 63 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 P + + + S+ F Q + T V D V + Sbjct: 64 PAMVAAGQSSAITTAQNVFNAQAL----------------TMTGVTYNANDVTVSIATSG 107 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ MG S I +A A T + ++D S Sbjct: 108 DKRTATV----QYQAQSQAMLPDVMGFGSIKIGGQATATTTIAPN-----IDFYLLLDDS 158 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM S + +N + +SL L Sbjct: 159 PSMAIAATQSGINTMVANTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSLGVTLR- 217 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 +D + +++ +K R A+ + + I N+ + Sbjct: 218 --IDMLRQATQDLMTTAQTTETQKGTTYRMAIYTFDIGLNTIGNL-----------TSDL 264 Query: 307 ISDPSFSWGVHKLIRTIVK-TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + L D N+ T + A+ + I+ + + Sbjct: 265 SQAQTEAGNIQLLEVYSNNWLTQNDYNDDEDTNYDTALN-GINAIMPNPGNGTGAA--GD 321 Query: 366 EAKKYIVLLTDGENTQDNEEGIA-------ICNKAKSQGIRI---MTIAFSVNKTQ---- 411 ++ + +TDG +D +C K++GIRI T + Sbjct: 322 TPQEVLFFVTDGVEDEDVNGNRQQSLLNTDLCTAIKNRGIRIAVLYTEYLPLPTNSWYNT 381 Query: 412 -----QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q L CASP +FE S +++ Sbjct: 382 YIAPFQNSIAPTLQQCASPGLYFEVKSGGDISAA 415 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 120/397 (30%), Gaps = 83/397 (20%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + L + TG+ I+ AL + ML G D +R ++ A+I + Sbjct: 24 RLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA-A 82 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V I + E+ + + + + ++E E+ Sbjct: 83 VKQINNSEDTDALKQKVYDWFHAQVENSYALG-----------------------EIEID 119 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + ++ + + F++ I + + + + + SY +++ VID Sbjct: 120 TTNHNITATASGTVP----TTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDR 170 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + Sbjct: 171 SPSMLLAATTSGQSTM-------------------------------YSGIGCQFACHTG 199 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + D S+ +++ + + DA+ V+ I + D+ ++ +++G D Sbjct: 200 DAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDT 259 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY--------DTIISSNEDE 357 + + + + +M Y I+ + D Sbjct: 260 TKEVLAPTLDTSNARKRLSDDSY-------GLTSATSMNYTYFDVALAALQKIVGTGGDG 312 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 ++ K ++LLTDG +Q N K Sbjct: 313 ----TSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKK 345 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 72/489 (14%), Positives = 155/489 (31%), Gaps = 106/489 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + +++ G I L + + + G VD + S + AL QAA ++ L ++ Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVL---TALKEA 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E++ + + +++ + N V T +N S Sbjct: 60 REQLKQGKPDWQSIAEKQGGKAFTNNA-----------SKIGGVSGTGATINLSLSGG-- 106 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 VLS + N + FLR G+ + ++ A A + Y I +VID S SM Sbjct: 107 VLSGSLNYAANAPTHFLRIAGLNTINLKGSASATMSAAQYRD-----IHFVIDVSASMGI 161 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ Q + V C + ++ P Sbjct: 162 GATKADQQAM-----------------------------QNSVGCAVACHHAEAADPATD 192 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 +L+ + ++ + +VR A+ + I + + R+ F++ + + Sbjct: 193 NLAAVRAIGATL------RIDVVRKAVMDALAKI----PNDGSTRVAIHSFSNSLKTVFP 242 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT-------AYDTIISSNEDEVHRMKNN 364 S + I +EN G T + ++ A + + +S + + Sbjct: 243 LSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNLLASAGNGLTASQPRGFVLLATD 302 Query: 365 LEAKKYIVLLTDGENTQDNEEGI------------------------AICNKAKSQGIRI 400 + DG + + A C+ K++G + Sbjct: 303 AVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYFAWGLHYVQAPDAANCSAIKAKGYTM 362 Query: 401 MTI--------AFSVNKTQQEKAR------YFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 MT+ + +++CAS P+ +F A S E+++ + + Sbjct: 363 MTLETEYLIPDGVYNPTFDAVRGDMGPAMTKSMTDCASAPDYYFHAESPQEIDRAVQTMV 422 Query: 446 GNEIFERVI 454 + + Sbjct: 423 SKTVNLSLT 431 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 67/443 (15%), Positives = 139/443 (31%), Gaps = 81/443 (18%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K ++S +G ++ +L+PVML V ++VV L+ + A A L + Sbjct: 26 KARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVT 85 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + YL R + + + E P K V Sbjct: 86 GDKAEAIEAAERLLEANP---YLDRTLSIG-DADIIFGKSNRTEENRRYEFTPDKKVNSV 141 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + + P+ + I+ I+ ++ + + I V+D S SM Sbjct: 142 GLRA-FGADDVPMLFPTMGVPIEFRPIK---------QAVATQVELDIAIVLDRSGSMAF 191 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + P K+G Sbjct: 192 SHDEVAKNGSPSSAPP-----------GWKMG---------------------------- 212 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 H + K D +A+V + +++ + R+ + ++D+ +D Sbjct: 213 --------------HAVPKNARWLDTVAAVNGFLDIMEDSSHDERVSLSTYSDKSKADVK 258 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + ++ + + G+T I + T+ N A + + Sbjct: 259 LTGDYTEIRAAMNAHSTN--FKGGATNIGSGILEGGATLGDKNLAR-------SWASRVL 309 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 ++++DG + E I + ++ I I T+ FS QE + +S F A Sbjct: 310 IVMSDGIHNTG-IEPIPAAQQVANEKIMIFTVTFSNEANVQEMEKVAVSG---GGQHFHA 365 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + +L + FR +I + + Sbjct: 366 KDSQQLAEAFR-KIAKSLPTLIT 387 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 70/208 (33%), Gaps = 35/208 (16%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 LD S S S + I + V++ ++ + +K D R+G F D Sbjct: 91 LDLSYSMSQEDMQDSSGNYIDRLTAVKNVVSQFAQQ-RKGD------RLGLVLFADHAYL 143 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + + + TAI + + A T I S + Sbjct: 144 QTPLTLDRNTISEQVNSLVLQLIGQK--TAIGEGIGLATKTFIDS-----------DAPQ 190 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKA 415 + ++LL+DG NT + I N AK I TI + Q+ Sbjct: 191 RVMILLSDGSNTSGVLDPIEAANIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLD 250 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIF 441 L + A + +F A + EL I+ Sbjct: 251 EKTLMSIAKITGGQYFRARNAQELATIY 278 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 62/453 (13%), Positives = 148/453 (32%), Gaps = 52/453 (11%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I +G+F I+TALL+ +LG G VD L AA A + + P ++ Sbjct: 4 RFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYAAADAAAVGSITPTSEAA 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ + + + + + F+ + V+++ +K + Sbjct: 64 AQANTMSGDGSLTLGKSEAQKIF----------FSQMSKKQGDAPVTVDISVQKKGDTLS 113 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + ++ + + F++ MG + A A+ + SY + ++D + SM Sbjct: 114 STVSFNATMP--TTFMQVMGFDEIAVTGAATAQYQTPSY-----MDFFMLLDNTPSMGVA 166 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQN-----GKVGIRDEKLSPYMVSCNK-SLYYMLYP 246 + + K + + G+ D+ + N ++ + Sbjct: 167 ATTDDITAMKKATANGHDGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVA 226 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +++ S + + + A + + K+ ++N + GA + Sbjct: 227 SAVKALMAKAKDTQSMPSQFRVAAYTSGKTAQDAKAAKLFKVSDLNYDL--GAVAAAANM 284 Query: 307 ISDPSFSW-----GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 I S + +K + T ++A + ++ + ++ Sbjct: 285 IKLMSIPYQNYYSDQQTSFDEALKGIEGEIKGNIGTGTSNADRQKIVFFVADGVGDSYKP 344 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGI--AICNKAKSQGIRI---MTIAFSVNKTQ----- 411 K G N E I C K K +GI++ T + Sbjct: 345 TGCTSPK--------GANGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDW 396 Query: 412 ----QEKARYFLSNCASPNSFFEANSTHELNKI 440 + + + CA+P +F + T + + Sbjct: 397 VKPFETRIAAKMEECATPGFYFAVSPTEGIEEA 429 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 69/488 (14%), Positives = 145/488 (29%), Gaps = 106/488 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ + G+ +I L +PV+ G VD R + L+ AA A + SV + Sbjct: 6 KRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASV-GSVAVNSY 64 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + +++ +SF + + N + + ++ +I ++ ++ Sbjct: 65 AYKANTKGHSSFKTGENQALAIFNSNVKKH-------NDLNNIKVKAKIKKQSTNLVSEI 117 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +++ Y L MG+ + I K+ + + Y + ++D S SM Sbjct: 118 GVTADYR------PYLLGLMGMNTMPITIKSTSSSTFPPY-----IDFYLLLDNSPSMGV 166 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + + + + + YY L Sbjct: 167 GATTKDIDTMVANTSDK------------------CAFACHQMDKAGNDYYAL------- 201 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + V + +VR A +++ + K + D RM F Sbjct: 202 ----------AKKLKVTTRIDVVRQATQNLMTTAKNTQTLTDQYRMAIYHFGMAADQI-- 249 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAIN-DAMQTAYDTIISSNEDE------------- 357 V + + S A D M Y S + Sbjct: 250 ---DSKNPAPYEVSALTTNLSTSASNAAKIDLMTIPYQNYNSDRQTNFPSYLLGMNKVIP 306 Query: 358 -VHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-------------ICNKAKSQGIRI--- 400 ++ + ++ + ++DG N + C K++G++I Sbjct: 307 SSGDGSSSSKPQQVLFFVSDGANDGYDCAYSNGASCRRISPLDTPQCKAMKARGVKIAVL 366 Query: 401 MT--------------IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 T +A V+ T Q A+ + CA+ +FE +++ Sbjct: 367 YTTYLPLPTNAFYNSHLAKYVSPTSQLAAK--MQECATEGLYFEVGPNEGISEAMNALFA 424 Query: 447 NEIFERVI 454 I I Sbjct: 425 KVISTVRI 432 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D ++ + T++ I TAI DA+ A E Sbjct: 133 RIGLILFADTAYVQAPLTYD-RDTVSTLLSEAVIGL-VGEQTAIGDAIGLAVKRFDEREE 190 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 ++LLTDG+NT N A S+G+++ TI +K Sbjct: 191 SN-----------NVLILLTDGQNTAGNITPEQAKELAISKGVKVYTIGVGADKMLIQSF 239 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 QE L+N A+ +F A + EL I++ Sbjct: 240 FGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQELQAIYQQ 282 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TAI DA+ A + + Sbjct: 135 RVGLILFGSQAYLQAPLTFD-RRTVRHWLDEARIGI-AGKNTAIGDAIGLALKRL---RQ 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 H + ++L+TDG N + + A +G++I I Q Sbjct: 190 RPAH--------SRVLILVTDGANNGGEIDPLTAARLAADEGVKIYPIGIGAAPEQSGTT 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A + +F A +L I Sbjct: 242 GSLGVNPSLDLDEPTLKEIAEVTGGQYFRAQDGEQLLGI 280 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 54/161 (33%), Gaps = 28/161 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G FF D ++ + + ++ I TAI DA+ A S E Sbjct: 132 RLGLIFFADTAYLQAPLTYD-RETVSQLLGESLIGL-VGEQTAIGDAIGLAIKRFQSKKE 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 K ++LLTDG+NT N A + G+ + TI ++ Sbjct: 190 SN-----------KVLILLTDGQNTAGNISPQQANELAINNGVTLYTIGVGADQMMVQSI 238 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 QE L+ A +F A L I+ Sbjct: 239 FGSRQVNPSQELDESMLTQLAESTGGRYFRARDAESLKAIY 279 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 57/166 (34%), Gaps = 31/166 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + L+ + + + TAI A+ TA + + Sbjct: 128 RIGLVVFAGQAFTQVPPTLDYRFLLTMLQRLQVGRLED--GTAIGTAIATAINRL----- 180 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 KN+ K I+LLTDG+N + + + A+ GIRI TI S Sbjct: 181 ------KNSEARSKVIILLTDGQNNRGEIDPLTAAELARQAGIRIYTIGLSGRGEAPYPV 234 Query: 416 R----------------YFLSNCA--SPNSFFEANSTHELNKIFRD 443 + + A + +F A L I+ + Sbjct: 235 QTPFGTRPQPVPVEIDEAMMREVAEKTGGRYFRATDARTLEAIYAE 280 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 75/532 (14%), Positives = 140/532 (26%), Gaps = 86/532 (16%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ + ++L + G I ALL M + L+D+ R S H ++ A A + A Sbjct: 2 SRLTRFFRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLMA 61 Query: 65 SVPLIQSLEEVSS-------RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + + E+ S N++ T I Sbjct: 62 ARSTAVTDAELESVGDPAFLAEIAGLNLGLSASNASFKAGAGNHIIGTATATVKPIIANL 121 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + VV SS+ + L + G + ++T A ++ Sbjct: 122 WTTDDFNLTATSDVVRSSKNLEVAVVLDITGSMSGSRITDLKTGASDLVDIVVKDQQAPF 181 Query: 178 SIQWVID------FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + I + D R + + V + ++ VG + Sbjct: 182 YSKVAIVPYSVGVNVGTYADAVRGAVIARTITGVSKTNAAVVASAAHGFIVGDKVTISGV 241 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + Y + D + ++ + + N Sbjct: 242 SGPTMLNGNTYNITAASADSFTINANTSNAPKYVSGGVATCDTSTNPGCLNFTFTSASNT 301 Query: 292 NDT--------VRMGATFFNDRVISDPSFSWGVHKLI---------------------RT 322 +T R G + D S R Sbjct: 302 KETRTLSTCVTERTGTYAYTDIAPSIAPVGRNYPTTKLENSMQPNPCPTATITPLSSDRV 361 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISS------NEDEVHRMKNNLEAKKYIVLLTD 376 +K + GSTA + + + N + N+ + K +VL+TD Sbjct: 362 TLKGQINALSIGGSTAGQIGFAWGWYMVSPNFGYLWPNATQRPAPYNSKDLVKVVVLMTD 421 Query: 377 GE------------------------------NTQDNEEGIAICNKAKSQGIR--IMTIA 404 G N + +C+ K ++ I T+ Sbjct: 422 GAFNTPYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPSLKLTIFTVG 481 Query: 405 FSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERVI 454 F + A L CA+ +F A + EL F+ I EI I Sbjct: 482 F--DVGGDANAVNMLKYCATDAQHVYFPATGS-ELKTAFKS-IAQEISSLRI 529 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 68/218 (31%), Gaps = 39/218 (17%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +D L+ + + + ++ S I D D R+G F Sbjct: 84 NTEGIDIMLAIDVSGSMLARDFKPDRITAAKEVAGSFI-----ADRYGD--RIGLVAFAG 136 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + L + + + + TAI + + TA + + + + Sbjct: 137 EAFTQSPLTTDQSTLQTLLARIRSGLIED--GTAIGNGLATAINRL-----------RES 183 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------ 412 K I+LLTDG N Q + AK+QGIR+ TI Sbjct: 184 DAKSKVIILLTDGVNNQGQIAPMTAAEIAKAQGIRVYTIGVGTEGMAPYPAIDMFGNLTF 243 Query: 413 -----EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L + + +F A +L ++ + Sbjct: 244 VNQKVEIDEKVLKAISDMTGGRYFRATDKEKLKAVYDE 281 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 78/511 (15%), Positives = 156/511 (30%), Gaps = 77/511 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + +G +I AL + M + GM +D + + + A + + + + Sbjct: 14 RFRHTESGSVLVIFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATALALAKLPLDTT 73 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS----A 128 ++ + + ++ I ++ N EV T + Sbjct: 74 DKDLAAKAEAQVLT--ALKGLPIDALTVTMRHNGDLIEVAA-KGATPTSLTRLAGFMSMP 130 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMG--IKSWLIQTKAEAETVSRSYHKEH----------- 175 V S + ++L L + G + L KA A + S ++ Sbjct: 131 LSVSAISNRSMTNLEIALVLDNTGSMKGTKLTNLKAAARDLVTSLFQQADPAKPNALKIG 190 Query: 176 --GVSIQWVIDFSRSMLDYQRDSEGQPL-----NCFGQPADRTVKSYSSQNGKVGIRDEK 228 S+ + + D+ + P+ N G PA+R G + + Sbjct: 191 VVPFSMTVNVGSGFAGSDWLDINAKSPIHQQIFNAQGVPANRFSLFADMGKPWAGCVESR 250 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-- 286 +PY V + +E DS ++ + A + R ++ Sbjct: 251 PAPYDVQDTAPSQATPSTLFVPFFAPDESDNDSRAVNDYMADLPSGGSAGGASNRQLQGM 310 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVH-------KLIRTIVKTFAIDENEMGSTAI 339 + ++ T D + G +T + +G T I Sbjct: 311 TAKYDKNAFKVSTTARQDGTNYLFGPNAGCEIQPLTRLTTSQTQLTNAIAAMTVIGDTNI 370 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-------------------- 379 + + + + + ++ KK+IVL+TDG+N Sbjct: 371 PIGLAWGWHLLSPNGPFKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGFIW 430 Query: 380 --------TQDNEEGIAI-------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA- 423 + AI C+ + I++ + VN L CA Sbjct: 431 QNRIGTTSNDNAVRTKAIDTRLTLLCDNIRKARIQVFAVRVEVNDGDSA----VLKACAT 486 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 SPN FF+ ++ L +FR I ++I E I Sbjct: 487 SPNMFFDVKNSSGLPAVFR-AIADQISELRI 516 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 71/218 (32%), Gaps = 39/218 (17%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +D L+ + + + + + I + R+G F Sbjct: 84 NTEGIDIVLAIDISTSMLAQDLQPDRIQAAKQVAGNFITD----RPGD---RIGLVAFAG 136 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + ++T++ E G TAI + + TA + + SN Sbjct: 137 EAFTQSPLTTD-QGTLQTLLGRLRSGVVEDG-TAIGNGLATAINRLRESNAKS------- 187 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA--------------FSVNKT 410 K I+LLTDGEN + + A+ QGIR+ TI F Sbjct: 188 ----KVIILLTDGENNRGEIAPLTAAEIARDQGIRVYTIGVGTRGTAPYPTVDFFGNPTV 243 Query: 411 QQEKAR---YFLSNCA--SPNSFFEANSTHELNKIFRD 443 Q K + L A + +F A +L I+ + Sbjct: 244 VQAKVQIDEKILGEIADLTGGRYFRATDNAKLQSIYDE 281 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 69/185 (37%), Gaps = 18/185 (9%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + + + + V+DA+ + +R R+G F+D ++ Sbjct: 107 LKTGTTAAGRTRMDAVKDAVRTFVR-------GRRDDRIGLVVFSDNAYVISPLTFDHQY 159 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L+ + G TAI D + A + + + + +VL TDGE Sbjct: 160 LLDYLGFVDGEILLGEGQTAIGDGLALASAVL-------ARQAGRDARGHQVVVLFTDGE 212 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-ARYFLSN--CASPNSFFEANSTH 435 + + + I + +AKS GIR+ I ++ + + L A+ +F A+S Sbjct: 213 SNRG-RDPIEVVGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGGRYFAADSER 271 Query: 436 ELNKI 440 +L Sbjct: 272 DLLTA 276 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 64/197 (32%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + L I R+G F + S+ + + Sbjct: 107 DMKINQQAANRLDAAKQVLNRFITE-------RQGDRIGIIVFGSKAYLQAPLSYDLDTI 159 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +T +TAI DA+ + + + D K+ ++L+TDG N Sbjct: 160 AQLVNETQI--GFAGENTAIGDAIGLGIKRLANIDAD-----------KRVMILMTDGAN 206 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--S 424 T + A QG++I TI + + L A + Sbjct: 207 TAGRVKPDQAAQFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVADLT 266 Query: 425 PNSFFEANSTHELNKIF 441 +F A ST EL I+ Sbjct: 267 QGQYFRAKSTQELASIY 283 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 64/197 (32%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D I + LV+ + ++ R+G F + ++ Sbjct: 104 FPDMQWQGDEISRLDLVKALMGDFLQ-------DRQGDRVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + + ++L+TDG Sbjct: 156 RTVRTFLDEAQIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLILITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A +G+RI TI N + L A Sbjct: 204 ANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIAGI 263 Query: 424 SPNSFFEANSTHELNKI 440 + ++F A+ ELN I Sbjct: 264 THGTYFRAHDGAELNAI 280 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 64/184 (34%), Gaps = 28/184 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 SL K +V+ + K+ D+ +G F ++ L Sbjct: 91 ESLYDEKSKFEVVKSMAQNFFH--KRFDDN-----IGIVIFGSFAYIAAPLTYDTKALDF 143 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + +TAI + + + I + D +K ++L+TDG + Sbjct: 144 LIN--YLEPSIAGNNTAIGEGL---WQGIKALQADTAK--------QKVLILITDGHHNS 190 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + KAK GI+I TI + ++ L A S FF A S +L Sbjct: 191 GSISPRQAVEKAKKLGIKIYTIGLG------DADKHLLEQIAKESGGKFFYAKSEEDLQS 244 Query: 440 IFRD 443 IF + Sbjct: 245 IFSE 248 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 69/482 (14%), Positives = 147/482 (30%), Gaps = 74/482 (15%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + K+ G I ++ P + + + ++ + + L A + A + + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALA-- 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 I++ E + + N N + +V + N Sbjct: 69 -IENNEIPDEPQQ-------------IKNNALVLSYVNAYLPSKKFLVPIININDNTHYL 114 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y ++ Y S F ++ + A + + +V D+S Sbjct: 115 EYNAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEAS---EPTDVIFVADYSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML--- 244 SML +++ + K + I P+ + ++ Sbjct: 172 SMLYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQK 231 Query: 245 ------------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 P S + ++ L I + S+ + + ID Sbjct: 232 TYCHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPM 291 Query: 293 DTVRMGAT------FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 VR G ++ + + I+ G T I+ + +A Sbjct: 292 SDVRFGDVCLQGSNAYSLEQEQYI---NNIDNI---------IEMEPHGWTLISSGILSA 339 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKS 395 + I + H KK +++L+DG +T D + +GI I C + K Sbjct: 340 -NNIFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKE 390 Query: 396 QGIRI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I++ + IA+S + + E C ++++EA++ HEL + + V Sbjct: 391 NDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREV 450 Query: 454 IR 455 R Sbjct: 451 GR 452 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 86/238 (36%), Gaps = 38/238 (15%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + ++ S ++ +L +E++FVD R + V+ L++ I Sbjct: 71 RPQAVTTERHSKSRGYDIVLAVDLSRSMEAEDYFVD----RKRSNRLQAVKPVLSAFI-- 124 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 N + R+G F R + ++ L R + + TAI D++ Sbjct: 125 -----NRRENDRIGLIAFAGRAYTVAPLTFDHKWLARQTERLQIGLIED--GTAIGDSLA 177 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A ++ ++ +IVLLTDGENT + + AK GIR+ TIA Sbjct: 178 VATSRLLEGAKERA-----GEREGAFIVLLTDGENTAGMMDPMEGATLAKDAGIRVYTIA 232 Query: 405 FSVNKTQQ-----------EKARYFLS-------NCA--SPNSFFEANSTHELNKIFR 442 N + FL A + FF A ++ +++ F Sbjct: 233 AGKNGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETNGEFFRAENSDTIDQAFE 290 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 61/475 (12%), Positives = 141/475 (29%), Gaps = 48/475 (10%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G+ +I + + +LG G VD R S A++ A + + + L Sbjct: 19 AASGGNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSGKITA 78 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-QVVLS 134 + + T+ + + + + +T + + + T + + +V+ Sbjct: 79 ENVQSAANTYFTSLYKNTDAPSID--VTATYTPKTSSENAKLTVGGTGSINTEFMKVMNI 136 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS---------------I 179 S+ L + + + + + + ++ + + + Sbjct: 137 SQMSLGASSTTTWGGTRLRVALALDVTGSMDSAGKLSAMKTAAKQLIDTLKATSTTKEDV 196 Query: 180 QWVIDFSRSMLDYQRDSEGQP-LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I M++ ++ L+ K + + G K Sbjct: 197 YISIVPFNVMVNVGPGNKNATWLDWDTSYGSCKSKYTTKNACQAGGDSWNYWSNTCQSQK 256 Query: 239 SLYYMLYPG----PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 +L G S + V + + K ++ + D + Sbjct: 257 TLKSACQAGGHTWTASNVNSWKGCVTDRTQNYDTTKTEPTSATPDTLFLAQNYSDCMASL 316 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + M + + S T +K + G T M A+ T+ ++ Sbjct: 317 LPMKSAYEATESDSSTD---------ATTLKGRINTLDAQGGTNQGIGMFWAWMTLQATA 367 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQD-------------NEEGIAICNKA--KSQGIR 399 + + IVLL+DG NT++ ++ +C+ K G+ Sbjct: 368 PLYTPAKDSEYKYTDAIVLLSDGMNTKNRWYGNGSNWSPQVDDRQKILCDNITTKVNGVP 427 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 TI T + L C S FF + + F++ +G + + I Sbjct: 428 ETTIYTIQVNTSGDPESSVLKYCGSTGGFFSTTTASGIQSAFQE-VGASLTKLRI 481 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 67/198 (33%), Gaps = 34/198 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I R+G F D + V K Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDV-KT 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ I +TAI DA+ + + +E + +VLLTDG+N Sbjct: 155 VSKMLSEAQIGLVGR-ATAIGDALGLSVKRFANKDESN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SP 425 T N A+ +GI++ TI + + L A + Sbjct: 203 TAGNLNPEDALLLAREEGIKVYTIGVGSDNPRGFSLFNVGGSSGSNLDESLLKKIAEQTG 262 Query: 426 NSFFEANSTHELNKIFRD 443 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLQQIYAE 280 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 69/191 (36%), Gaps = 35/191 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +++ VRD A I ++ D R+G F R S+ + + + Sbjct: 113 QAVRRIDAVRDIGAEFIER-REGD------RVGLILFGSRAYMQSPLSFDRDTVKQFL-- 163 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + A TAI DA+ A + + + ++LLTDG++T + + Sbjct: 164 SEAQIGFAGSETAIGDALGLAVKRLRDK-----------EDGDRVVILLTDGQDTASSVD 212 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP--NSFFE 430 + A + G+++ TI ++ E LS A +F Sbjct: 213 PLDATALAANYGVKVYTIGIGADEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYFR 272 Query: 431 ANSTHELNKIF 441 A S EL KI+ Sbjct: 273 ARSPDELAKIY 283 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 66/204 (32%), Gaps = 39/204 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D I + LV+ L ++ + R+G F + ++ Sbjct: 57 FPDMQWKNEDISRLDLVKALLGDFLQ-------DREGDRVGLILFGSQAYLQAPLTFD-R 108 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + + +VL+TDG Sbjct: 109 RTVRTFLDEAQIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLVLITDG 156 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A +G+RI TI N + L Sbjct: 157 ANNGGQIHPLTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALREIGEI 216 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++F A+ EL D IG+ Sbjct: 217 THGAYFRAHDGAEL-----DAIGD 235 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 58/146 (39%), Gaps = 19/146 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S+ +++ + ++ + I K + R+G F + Sbjct: 104 LDLSTRENIVTRLDAAKEVVQEFIG---KRPHD----RIGLVAFAADAFVVSPLTLNHDW 156 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L + + + D N G TAI A+ + + + +++ + ++LLTDGE Sbjct: 157 LKKNVQRLELGDINLSG-TAIGTALGASVNRL-----------RDHESRSRIVILLTDGE 204 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA 404 N ++ AKS +++ TIA Sbjct: 205 NNSGTLSPLSAAEAAKSLNVKVYTIA 230 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 46/200 (23%), Positives = 76/200 (38%), Gaps = 29/200 (14%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 ++ +DSS L R+ I + DN V F+ + Sbjct: 25 DGINIAFVIDSSGSMFYNDPNGLRREVTHKFIDRLT--DNDMAAV----IGFDYKATVLE 78 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 F+ KL + K G T I A+ AYD + + + + K+ Sbjct: 79 QFTSNKEKLHDAVDK-----IRSDGGTNIGRAVSIAYDLFNNLDNNRKEK------YPKF 127 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++LLTDG+ E I AK GI+I TI +++ L + A + + Sbjct: 128 LILLTDGDGDYSEEYTI----LAKKAGIKIYTIGLGNGVSEK-----LLKDIAKGTDGEY 178 Query: 429 FEANSTHELNKIFRDRIGNE 448 F A +LNKIF ++I ++ Sbjct: 179 FHAKDASKLNKIF-EKIADK 197 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 20/156 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + +T VK + G T I+ ++ A E Sbjct: 925 QVGVVSFYTSASLNSALKQMNSGTNKTTVKNAINSLSASGGTDISSGIKKA------IAE 978 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEK 414 + H+ AK+YI++LTDG + + I +KAK++G I TI ++ Sbjct: 979 LDAHKRST---AKQYIIVLTDGYSQYPEFDLIE-ADKAKAKGYTIFTIGMGMADE----- 1029 Query: 415 ARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 L AS P ++ S +L + D IG EI Sbjct: 1030 --DTLKKIASKPEYYYRVLSPEQLEAAYYD-IGQEI 1062 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 52/390 (13%), Positives = 122/390 (31%), Gaps = 69/390 (17%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + L + TG+ I+ AL + ML G D +R ++ A+I + Sbjct: 11 RLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA-A 69 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V I + E+ + + + ++E E+ Sbjct: 70 VKQINNSEDTDALKQKVSDWFHAQVENSYALG-----------------------EIEID 106 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + ++ + + F++ I + + + + + SY +++ VID Sbjct: 107 TTNHNITATASGTVP----TTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDR 157 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + Sbjct: 158 SPSMLLAATTSGQSTM-------------------------------YSGIGCQFACHTG 186 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + D S+ +++ + + DA+ V+ I + D+ ++ +++G D Sbjct: 187 DAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDT 246 Query: 306 VISDPSFSWGVHKLIRTI-VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + + ++ + + D A I+ + D ++ Sbjct: 247 TKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALAALQKIVGTGGDG----TSS 302 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 K ++LLTDG +Q N K Sbjct: 303 ANPLKLVLLLTDGVQSQRGWVVKNSSNLKK 332 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 68/474 (14%), Positives = 146/474 (30%), Gaps = 100/474 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G+ ++ A+L+ ++ G +D R ++ A + A+ Sbjct: 25 RHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAAS----- 79 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 L + + + E+ ++ V+++ + Sbjct: 80 LSQGEDPETIVRNYITAALSEH------------------NGVLERLNVQVSSDLAINSR 121 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +++ + + L L +G + + +EA R+ + I V+D S SM Sbjct: 122 EVTADAVISVPTL--MLGIIGYDALTLNRVSEANERVRN------LEISLVLDISGSMSG 173 Query: 192 YQ----RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR-DEKLSPYMVSCNKSLYYMLYP 246 + RD+ + + P + S S G+R + ++ +V + L Sbjct: 174 SKITALRDAAEEFVGVMMDPDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCLEL 233 Query: 247 GPLDP--------SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 G DP + D + + + + + +R Sbjct: 234 GVSDPVTMDLAANGYDWLDWQDRDQRGWRSSAFCPEENEATVFLEQTPSV--LVNLIRDL 291 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 N + +WG L G A A AYD + Sbjct: 292 DAGGNTGLDVAT--AWGARALDPAW------RGRLGGDFASRPA---AYDDPSTM----- 335 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-----------CNKAKSQ----------- 396 K +V++TDG T ++A+ Sbjct: 336 ----------KVLVVMTDGAATAQIRRAQNWYGDWYSYEIYSASQARDNMADACDAAEAE 385 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 G+ I TIAF V+ + R + +CAS P +++ + +++ F + I ++ Sbjct: 386 GVHIYTIAFQVSGSTN---RNLMRDCASRPENYYAVENL-DISAAF-NSIAADL 434 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D ++ + T++ I TAI DA+ A +E Sbjct: 133 RLGLILFADTAYVQAPLTYD-RDTVSTLLSEAVIGL-VGEQTAIGDAIGLAVKRFDERDE 190 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 ++LLTDG+NT N A ++G+++ TI +K Sbjct: 191 SN-----------NVLILLTDGQNTAGNITPEQAKELAINKGVKVYTIGVGADKMLIQSF 239 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 QE L++ A+ +F A + EL I++ Sbjct: 240 FGSREINPSQELDEGMLTDIATSTGGQYFRARNAQELEAIYQQ 282 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 68/482 (14%), Positives = 147/482 (30%), Gaps = 74/482 (15%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + K+ G I ++ P + + + ++ + + L A + A + + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALT-- 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 I++ E + + N N + +V + N Sbjct: 69 -IENNEIPDEPQQ-------------IKNNALVLSYVNAYLPSKKFLVPIININDNTHYL 114 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y ++ Y S F ++ + A + + +V D+S Sbjct: 115 EYNAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEAS---EPTDVIFVADYSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML--- 244 SML +++ + K + I P+ + ++ Sbjct: 172 SMLYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQK 231 Query: 245 ------------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 P S + ++ L I + S+ + + ID Sbjct: 232 TYCHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPM 291 Query: 293 DTVRMGAT------FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 VR G ++ + + I+ G T I+ + +A Sbjct: 292 SDVRFGDVCLQGSNAYSLEQEQYI---NNIDNI---------IEMEPHGWTLISSGILSA 339 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKS 395 + + + H KK +++L+DG +T D + +GI I C + K Sbjct: 340 -NNLFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKE 390 Query: 396 QGIRI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I++ + IA+S + + E C ++++EA++ HEL + + V Sbjct: 391 NDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREV 450 Query: 454 IR 455 R Sbjct: 451 GR 452 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 57/163 (34%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + + A TAI +A+ Sbjct: 130 RIGLILFADHAYLQSPLTQDRRTVAQYLNE--AEIGLVGRQTAIGEAIALG--------- 178 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 V R + + ++LLTDG N + + A +GI I T+ ++ Sbjct: 179 --VKRFDQVENSNRVLILLTDGSNNAGSISPEQATDIAAKRGITIYTVGVGAEVMERRTL 236 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + S+F A +T EL +I+++ Sbjct: 237 FGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEELERIYQE 279 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 46/251 (18%), Positives = 88/251 (35%), Gaps = 39/251 (15%) Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 L F ++ V + + + K+++ + + +++D Sbjct: 440 LMTFVPLDEQGVDPVNKVVWAKTNHFTTFVLFYIPTWKAIWEVPINKGEREINQQINYID 499 Query: 261 ------SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 SS + A S + ++ + D R F+D + Sbjct: 500 LVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDDFGYLLQPLTT 553 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVL 373 + I + + G T I + ++ A +IS S+ED + K I+L Sbjct: 554 DFQAVKNAIDR-----IDSWGGTNIAEGIRIANQQLISLSSEDRI----------KVIIL 598 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEA 431 LTDGE DN +AK+ GI I TI + + L + A+ +F Sbjct: 599 LTDGEGYYDNNLT----TEAKNNGITIYTIGLGTSVDE-----NLLRDIATQTGGMYFPV 649 Query: 432 NSTHELNKIFR 442 +S +L ++F+ Sbjct: 650 SSASQLPQVFK 660 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 51/335 (15%), Positives = 108/335 (32%), Gaps = 60/335 (17%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVS-IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 +++ I A + + S + V+ I V+D S SM + + A Sbjct: 1065 RTYDITINATSTSTSSIIETKTSVADIMLVLDVSGSMGEDITSYSYTFVANNTSEARDDK 1124 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD---SSSLRHVIK 269 K + D + + + + P +++ + + + Sbjct: 1125 KLLNRNVTYYIEVDGSYKEMWYYSSYNKGWRVGPRGSSDDAAKDKYNNCKIYTRTSTTET 1184 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW----------GVHKL 319 + +++A+ I K + ++G T F+ + + G Sbjct: 1185 RLDALKNAVNQFIDDTAKKSPNS---KIGITVFSSTDDYNRPYGNHGTSVSLGEVGTADS 1241 Query: 320 IR-TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + T +K F D G T ++ A + + + + KY+VL TDG+ Sbjct: 1242 AKVTELKNFVKDLKANGGTDPAVGLEDAKNKLDAMVDTNP----------KYVVLFTDGK 1291 Query: 379 NTQDNEEGIAIC-------------------NKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 T + + + AK+ + TI F++N ++A+ FL Sbjct: 1292 PTGGGNKWNSNAQKNAETQAGELKTGLRNNVDNAKNP-YTVYTIGFALNDEG-DRAKTFL 1349 Query: 420 SN-----------CASPNSFFEANSTHELNKIFRD 443 S +S + A+ L +IF+ Sbjct: 1350 SGGTYDGKKDPGIASSSDCAKTADDAASLTQIFQS 1384 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 78/506 (15%), Positives = 167/506 (33%), Gaps = 138/506 (27%) Query: 23 FIITALLMPVMLG---VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 ++ AL+M ++L V G+ VDV+R ++Q + + A+ Sbjct: 31 GVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDASTLAAA-------------H 77 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 K++ PKQ + +Y F + + + + +++ V + +L Sbjct: 78 KDNALDPKQVVLDY------------FDKAALASYISADDILVGGGETSTAVEV----NL 121 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS--- 196 + F+R +G +S+ + + AE + + V+D S SM D +R S Sbjct: 122 TAQVKTPFIRHLGNESFNVPARGRAEQAYGNSE------VSLVLDISGSMDDNRRMSRLH 175 Query: 197 --------------EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 ++ P V ++ +R Y V + Sbjct: 176 RAANEFVDTVLTPDSVDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFS 235 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P D + +HF + + I +S + + + Sbjct: 236 TTAIDPEDAYIQGQHFSHVDARFNYISCPT----------QSYETVTPFSQ--------- 276 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++ + +++L + I G+ +++A + + ++ ++ + Sbjct: 277 -----NNAALEAQINRLTGRERTSIHIGIK-WGAAMLDEAFRPLVNDLVDNSIVDEAFRD 330 Query: 363 NNLEAK----KYIVLLTDGENT---------------------------QDNEEGI---- 387 K IV++TDG NT ++ +G Sbjct: 331 RPAPFTSNTLKVIVVMTDGMNTETKRIKEFAYDTPDMRAHWARHAMDDWDNDVDGSVEDH 390 Query: 388 ----------------AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 ICN AK+ GI I +I F +N ++ + +CAS S F Sbjct: 391 LFDTYYDTAIGNALLQNICNAAKANGIIIYSIGFEINNDAAQE----MEDCASSPSHFYR 446 Query: 432 NSTHELNKIFRDRIGNEIFERVIRIT 457 ++++ F I ++ + +R+T Sbjct: 447 VEGVQISEAF-SSIAQQL--KQLRLT 469 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 66/198 (33%), Gaps = 34/198 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L I R+G F D + V K Sbjct: 103 DMAYNGQYVDRLTMVKAVLTDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDV-KT 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ I +TAI DA+ + + ++ + +VLLTDG+N Sbjct: 155 VSKMLSEAQIGLVGR-ATAIGDALGLSVKRFANKDKSN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T N + A+ GI++ TI + + L A + Sbjct: 203 TAGNLKPEEALLLARDAGIKVYTIGVGSDNPRGFSLFNMGGMSGDTIDEGLLKRIAEQTG 262 Query: 426 NSFFEANSTHELNKIFRD 443 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLQQIYAE 280 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 78/268 (29%), Gaps = 44/268 (16%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + Q G+ + + + ++Y P Sbjct: 31 TTAVPLTYLPPSSDNHQPQQWLMKGLVLTLWICLVAAAARPVWYGEPITHSQPHRDMMLV 90 Query: 259 VDSSSLRH---------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 VD S + I + V+ L+ I ++ D R+G FF D Sbjct: 91 VDLSYSMNQKDMQIGDDYIDRLTAVKQVLSDFIAQ-RQGD------RLGLIFFADHAYLQ 143 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + + + A+ TAI D + A + ++ Sbjct: 144 TPLTLDRTTVAQQLNQ--AVLRLIGTQTAIGDGIGLA-----------TKTFIESDAPQR 190 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKAR 416 ++LL+DG N + I + AK I T+ + ++ Sbjct: 191 VMILLSDGSNNAGVLDPIEAAHIAKQYHTTIYTVGVGAGEMMVRDFFMTRRINTAEDLDE 250 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFR 442 L A + +F A H+L I++ Sbjct: 251 ETLQKIADLTGGQYFRARDKHDLQTIYQ 278 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 73.5 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 65/197 (32%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D + + LV+ L + S D R+G F + ++ Sbjct: 104 FPDMQWNDEDVSRLSLVQHLLGDFLES-------RDGDRVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +R + I +TAI DA+ A + RM+ + ++L+TDG Sbjct: 156 RTVRVWLDEARIGI-AGKNTAIGDAIGLALKRL---------RMRPAQ--SRVLILVTDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA-- 423 N + + A S+G++I I + + L A Sbjct: 204 ANNGGEIDPLTAAKLAASEGVKIYPIGIGADPEESGATALLGGNPTLDLDEPALKAIAEV 263 Query: 424 SPNSFFEANSTHELNKI 440 + +F A +L I Sbjct: 264 TGGRYFRARDGKQLQAI 280 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 39/195 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ++ I D +ND R+G + + + +++++ Sbjct: 50 RLDALKRVASTFIE-----DRIND--RIGLVVYAGESYTRTPITSDKTVILQSLKTVEYD 102 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T I + TA + I K++ + I+LLTDG N + Sbjct: 103 DSIIADGTGIGVGLATAINRI-----------KDSKAKSRVIILLTDGVNNAGTIDPRMA 151 Query: 390 CNKAKSQGIRIMTIAFSVN-------KTQQEKAR------------YFLSNCA--SPNSF 428 + AK GI++ TI N QE + + A + + Sbjct: 152 ADIAKQYGIKVYTIGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTDGKY 211 Query: 429 FEANSTHELNKIFRD 443 F A +L I+ + Sbjct: 212 FRATDDKKLKAIYAE 226 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 83/515 (16%), Positives = 161/515 (31%), Gaps = 133/515 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I+ ++ +M+ VGG+ +D +R L+ + A++ A+ L Sbjct: 30 RFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA-----DL 84 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++ + + + N + NF V Sbjct: 85 DQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFR----------------------TVT 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + D+ + F+ G + + ++AE + V I V+D S SM Sbjct: 123 VDASKDMD----TQFIGRFGFPTLEVPAHSQAE------ERVAKVEISLVLDISGSMATN 172 Query: 193 QR-----DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 R D+ L+ + + + S S + L ++ N+ + + Sbjct: 173 NRLGEVQDAADIFLDTVLKDENEDLISVSLVPYSEQVNAGPLIMDRMNVNR-KHDYSHCI 231 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D + ++SS+ + ++ D + + D R+ Sbjct: 232 DFDNGDFDSIAMNSSTRYNQMQHFQWNYDGRNNYRD--DTVCPRYDYERI---------- 279 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT-------AYDTIISSNEDEVHR 360 FS L I D T+I M+ A+ I +S + H Sbjct: 280 --TPFSQNKRTLKNQID-----DLVPRAGTSIFLGMKWAAAMLDPAFRDINNSLVNAGHV 332 Query: 361 MK---------NNLEAKKYIVLLTDGENTQD----------------------------- 382 + + E K ++L+TDG N Sbjct: 333 DREFYNRPASYTDSETLKTVILMTDGANDNSFRISNTYYNEDSEYVHWNRYNLWWYLRRE 392 Query: 383 -------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N IC+ AK++ I I +I F V+ + +CA Sbjct: 393 VNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIVIWSIGFEVDDEDVPA----MQDCA 448 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 S S F EL++ FR I +I + +R+T+ Sbjct: 449 SSPSHFFRVEGVELSEAFR-AIARQINQ--LRLTQ 480 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 69/195 (35%), Gaps = 42/195 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A I+ + D R+G ++ + + +++ + + Sbjct: 112 RIEALKRVAAQFIQQ-RASD------RIGIVVYSGESYTKVPATTDKSIVLQALKEIRQG 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + TAI + TA + + K++ K I+L+TDG N + ++ Sbjct: 165 EIED--GTAIGMGLGTAINRL-----------KDSKTKSKVIILMTDGVNNTGVIDPLSA 211 Query: 390 CNKAKSQGIRIMTIAF---------------------SVNKTQQEKARYFLSNCASPNSF 428 AK GIR+ TI V EK +S + + Sbjct: 212 AELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISKI-TGGKY 270 Query: 429 FEANSTHELNKIFRD 443 F A ++L +I+ + Sbjct: 271 FRATDNNKLAQIYTE 285 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 73.1 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 40/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + I +K D R+G ++ + + +++++ Sbjct: 112 RIEALKRVASQFIEE-RKSD------RIGIVVYSGESYTKVPATTDKSIVLQSLKDIKQG 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + TAI + TA + + K++ K I+L+TDG N + ++ Sbjct: 165 EIED--GTAIGMGLGTAINRL-----------KDSKTKSKVIILMTDGVNNTGVIDPLSA 211 Query: 390 CNKAKSQGIRIMTIAFSVN-------KTQQEKARY-----------FLSNCASP--NSFF 429 AK GIR+ TI N + + L A +F Sbjct: 212 AELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSLQYDMVPVEIDEKLLGEIAQSTGGKYF 271 Query: 430 EANSTHELNKIFRD 443 A +L +I+ + Sbjct: 272 RATDNKKLAQIYTE 285 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 88/251 (35%), Gaps = 39/251 (15%) Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 L F ++ V + + + K+++ + + +++D Sbjct: 440 LMTFVPLDEQGVDPVNKVVWAKTNHFTTFVLFYIPTWKAIWEVPINKGEREINQQVNYID 499 Query: 261 ------SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 SS + A S + ++ + D R F++ + Sbjct: 500 LVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDNFGYLLQPLTT 553 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVL 373 + I + + G T I + ++ A +IS S+ED + K I+L Sbjct: 554 DFQAVKNAIDR-----IDSWGGTNIAEGIRIANQQLISRSSEDRI----------KVIIL 598 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEA 431 LTDGE DN +AK+ GI I TI + + L + A+ +F Sbjct: 599 LTDGEGYYDNNLT----TEAKNNGITIYTIGLGTSVDE-----NLLRDIATQTGGMYFPV 649 Query: 432 NSTHELNKIFR 442 +S +L ++F+ Sbjct: 650 SSASQLPQVFK 660 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 69/482 (14%), Positives = 147/482 (30%), Gaps = 74/482 (15%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + K+ G I ++ P + + + ++ + + L A + A + + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALT-- 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 I++ E + + N N + +V + N Sbjct: 69 -IENNEIPDEPQQ-------------IKNNALVLSYVNAYLPSKKFLVPIININDNTHYL 114 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y ++ Y S F ++ + A + + +V D+S Sbjct: 115 EYNAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEAS---EPTDVIFVADYSG 171 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML--- 244 SML +++ + K + I P+ + ++ Sbjct: 172 SMLYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQK 231 Query: 245 ------------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 P S + ++ L I + S+ + + ID Sbjct: 232 TYCHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPM 291 Query: 293 DTVRMGAT------FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 VR G ++ + + I+ G T I+ + +A Sbjct: 292 SDVRFGDVCLQGSNAYSLEQEQYI---NNIDNI---------IEMEPHGWTLISSGILSA 339 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAI---------CNKAKS 395 + I + H KK +++L+DG +T D + +GI I C + K Sbjct: 340 -NNIFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKE 390 Query: 396 QGIRI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I++ + IA+S + + E C ++++EA++ HEL + + V Sbjct: 391 NDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREV 450 Query: 454 IR 455 R Sbjct: 451 GR 452 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 67/191 (35%), Gaps = 35/191 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ L I+ ++ D R+G F ++ + +RT++ Sbjct: 113 INRLQAVKRVLDDFIQR-REGD------RLGLLLFGTEPYIQAPLTFDLA-TVRTLLHEA 164 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + ++ ++ ++LLTDG NT Sbjct: 165 GIGM-AGRATAIGDALGLAVKRLRDRPQE-----------QRVVILLTDGANTAGEIAPD 212 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKA-------------RYFLSNCA--SPNSFFEAN 432 AK+ G+RI TI Q L+ A + +F A Sbjct: 213 KAAEIAKAAGVRIYTIGIGAETMVQRGLLGSRRVNPSRDLDEGLLTRIAQQTGGEYFRAR 272 Query: 433 STHELNKIFRD 443 S EL I+ Sbjct: 273 SLPELELIYES 283 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 65/205 (31%), Gaps = 32/205 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D + + +V+D + + +D ++ +G F + + Sbjct: 106 LDLADGGLDQTRLEVVKDVFRAFVAGEDGLDGRSNDT-IGLVSFAGFADTRCPLTLNHGS 164 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L+ + + E TAI D + A + + + + + + I+LLTDG Sbjct: 165 LLTILDDLEIVRERAEDGTAIGDGLGLAVERL-----------RESEASSRVIILLTDGV 213 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA------------------RYFLS 420 N E + A GI++ TI + + L Sbjct: 214 NNAGIETPLEAAELASRLGIKVYTIGAGTDGVAPVRVTNPLTGAEELRPMPVEIDEATLE 273 Query: 421 NCA--SPNSFFEANSTHELNKIFRD 443 A + +F A L +++ Sbjct: 274 AIAEHTGGRYFRATDGDGLRQVYEQ 298 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 67/448 (14%), Positives = 122/448 (27%), Gaps = 98/448 (21%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G I+ A+L+PV+L + VDV L+ A +A + L Sbjct: 15 ISRRRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGARTLS----- 69 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM------NPRKSA 128 + + + EY +N D +V+ + ++ + K Sbjct: 70 ----LEQDASLAHKSAIEYAAKNNVAGNTLTLADSDVQIGLSVRTDDVGRFTFSSGGKLL 125 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 V ++ R ++ L I T S + I V+D S S Sbjct: 126 NSVNVTGRRTQQAPDGAVRLYLTPIFGHEFFQPVADATAS-----QIDRDIALVVDRSGS 180 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + + +S N P Sbjct: 181 MTFRINRN--------------SYESGWRNN---------------------------DP 199 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + VDS + + + + V + +N Sbjct: 200 VPSRARWWALVDS----------------VDGFLTELGST-PQLELV--SLSTYNSSAKI 240 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 D + ++ + GST I M T+ + + K Sbjct: 241 DEQLTDKYSRIEDALDDYSR--RYPDGSTNITAGMDRGISTLQNKKYARPYASKT----- 293 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPN 426 +V++TDG + + A + A I + TI +S + + A Sbjct: 294 --MVVMTDGNHNYGSSPTNAAYDAASDD-IVVHTITYSDGAN-----QSLMREVARIGGG 345 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 + A EL +IFR+ I + Sbjct: 346 QHWHAPDGDELEEIFRE-IARNAPTLLT 372 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 61/197 (30%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D + + +LV+ L + ++ D R+G F + ++ Sbjct: 104 FPDMQWQDEDVSRLNLVKHLLGDFLEH-REGD------RVGLILFGSKAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +R + I +TAI DA+ A + + ++L+TDG Sbjct: 156 HTVRVWLDEAKIGI-AGKNTAIGDAIGLALKRLRQRPAQS-----------RVLILVTDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA-- 423 N I A +G++I I + Q L A Sbjct: 204 ANNGGEIAPITAARLAAEEGVKIYPIGIGADPEQSATLGVLGINPSLDLDEPALKELAQV 263 Query: 424 SPNSFFEANSTHELNKI 440 + +F A EL I Sbjct: 264 TGGRYFRARDGQELQAI 280 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 74/213 (34%), Gaps = 40/213 (18%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+ E+ D+ R + + +D ++ ++ D R+ F Sbjct: 94 ISGSMEEQDMDDNGQRR---SRIAVTKDVAMDFVKQ-REGD------RIALVLFGTHPYL 143 Query: 309 DPSFSWG----VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 ++ + + + + + +TAI DA+ A V R+++ Sbjct: 144 QTPLTFDHPTVMQHIYEAQLTMADDLQRGIHATAIGDAIGLA-----------VKRLRDI 192 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--------- 415 K ++LLTDG + + A +G++I TI + Q Sbjct: 193 DAPDKTLILLTDGSDNASQVAPLKAAQIAAREGLKIYTIGLGAEQRQASLLGFDFGFGKN 252 Query: 416 ----RYFLSNC--ASPNSFFEANSTHELNKIFR 442 L + A+ +F A + EL +I++ Sbjct: 253 REIDEKTLKDIAKATDGRYFRARNPEELREIYQ 285 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 75/219 (34%), Gaps = 41/219 (18%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 +D LS + + + + +++ + I+ + D R+G ++ Sbjct: 85 TKGMDIVLSIDVSLSMLAKDLEPDRLTALKEIARTFIKQ-RTTD------RIGLVEYSGE 137 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + ++ +D G T I D + A + S Sbjct: 138 ALMRVPLTSDHRVVEEELMSFNPMDLE--GGTNIGDGLAVAVSHLRKSKAKS-------- 187 Query: 366 EAKKYIVLLTDGENTQDNE-EGIAICNKAKSQGIRIMTIAFSVN---------------- 408 K I+L+TDG NT DN + A++ I++ TI N Sbjct: 188 ---KIIILMTDGVNTIDNAMSPLTAAELARNNDIKVYTIGIGSNGLALMPTQQDIFGNLV 244 Query: 409 --KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + Q + Y L + A + +F A S L +I+ + Sbjct: 245 FTEEQVKIDEYLLRDVAQITGGKYFRATSNESLKQIYEE 283 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 83/553 (15%), Positives = 160/553 (28%), Gaps = 114/553 (20%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + T+ K + G + ++TALL +L + VD + L QA A + Sbjct: 16 WSTRCFQKLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAAL 75 Query: 63 TASV--------PLIQSLEEVSSRAKNSFTFPKQKI---EEYLIRNFENNLKKNFTDREV 111 P+ + + S + F K+ ++ + L K + E Sbjct: 76 LLVAENNAYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSEN 135 Query: 112 RDIVRD--------------------TAVEMNPRKSAYQVVLSSRYDLLL-NPLSLFLRS 150 + + R+ + + PLS L Sbjct: 136 KAQKDNHLPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVK 195 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML-------------------- 190 K+ + + + V + V DFS SML Sbjct: 196 THTKNGRLPINSGISYAVKEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILR 255 Query: 191 ----DYQRDSEGQPLNCFGQPADR-TVKSYSSQNGKVGIRDEKLSPYMVSCN-------- 237 D Q L+ P +R +++ + G ++ + PY + Sbjct: 256 SVVSDIQNILFPTKLSEDASPYNRMGFAAFAGGTRQRGDKNSCVMPYYLKSGVHDFRVAY 315 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK------HLVRDALAS----------- 280 L Y G D + R V +R AL Sbjct: 316 WQLDSFNYRGSPWDCKDTNVLDDRGNPRPVNACLIKGNPEDALRTALNDRHLSTSMKLIF 375 Query: 281 -----VIRSIKKIDN-VNDTVRMGATFFND-------RVISDPSFSWGVHKLIRTIVKTF 327 V ++IK+++N + V +N+ + S +W + V Sbjct: 376 EDVLDVDKTIKQVENFDGNRVNDYKLTYNNPDHCLGGNEGVETSQAWFTKSKPK--VAEA 433 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNE-DEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 GSTA + + ++ N E K ++ +++L+DGE+ + + Sbjct: 434 LSKIKPTGSTAASSGFIIGANLLMDKNTVPEAQPAKLGTNTQRILMVLSDGEDNRPTFDT 493 Query: 387 IAI------CNKAKSQGIRI---------MTIAFSVNKTQQEKARYFL-SNCASPNSFFE 430 + C+ + + + IAF+ Q + C N+++E Sbjct: 494 LTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQPPPEQKAAWQKCVGENNYYE 553 Query: 431 ANSTHELNKIFRD 443 +S L F+ Sbjct: 554 PSSKEALLDAFKQ 566 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 70/479 (14%), Positives = 147/479 (30%), Gaps = 91/479 (18%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ +I AL++ ++ G VD R + L+ AA A + + ++ Sbjct: 9 RFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVG---AVSRTS 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S + +R F NL T V Sbjct: 66 PAYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLASSSATVTKASD---------V 116 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++S+ ++F++ +G+ + + + A Y + ++D S SM Sbjct: 117 VTSQVTFSAQISTMFMKVVGMSAMTVGGTSTATASMPKY-----IDFYLLLDNSPSMGVG 171 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + + NKS D + Sbjct: 172 ATPTDVSAMIA------------------------------ATANKSSDDHCAFACHDVN 201 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-- 310 ++ ++ + + + + ++R A ++ + ++ RM F + Sbjct: 202 -NKNNYYNLAKALGITTRIDVLRSATQQLMDTASATATYSNQFRMAIYDFGASAQTAGLR 260 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS--SNEDEVHRMKNNLEAK 368 + L AID + N+ T+Y I +NE + + Sbjct: 261 NLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINNEISSPGSGVSGSPQ 320 Query: 369 KYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRI---MT--------- 402 KY+ ++DG + N + A+C K +GI+I T Sbjct: 321 KYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSND 380 Query: 403 -----IA-FSVNKTQQEKARYF---LSNCASPNSFFEANS----THELNKIFRDRIGNE 448 IA F+ + CASP +FE + +N +F+ + + Sbjct: 381 WYKKWIAPFNAGPYGPSPNSEIAQNMEACASPGFYFEVSPTQGIAEAMNALFKKAVADA 439 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIGLAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNSGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQE 278 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIGLAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYKE 278 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 62/194 (31%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ AS I N R+G + + + ++ + Sbjct: 113 RLEALKKVAASF---INGRPND----RIGLIEYAGESFTKTPITSDKSIVLSALKSIQYN 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + E G TAI + T + + K++ K I+L+TDGEN + Sbjct: 166 NIIE-GGTAIGMGLATGVNRL-----------KDSKALSKVIILMTDGENNAGQIDPRIA 213 Query: 390 CNKAKSQGIRIMTIAFS------VNKTQQEKA------------RYFLSNCA--SPNSFF 429 A+ GI++ TI + L A + +F Sbjct: 214 AELAQEFGIKVYTIGMGTNGMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQYF 273 Query: 430 EANSTHELNKIFRD 443 A + +L +I+ + Sbjct: 274 RATNNEKLQEIYDE 287 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 60/167 (35%), Gaps = 27/167 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + ++ + L R I + A + TAI DA+ A + E Sbjct: 132 RIGLVAFAGRAYTVCPLTFDHNWLKRKIDQLQAGTIED--GTAIGDALGLALSRLEGKKE 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA------FSVNK 409 + + +++LLTDG N N I A + + TI ++ Sbjct: 190 SGERKKIGS-----FLILLTDGANNCGNLTPIEAARLAAHAAVPVFTIGAGINGEVTMPV 244 Query: 410 TQQEKAR------------YFLSNCA--SPNSFFEANSTHELNKIFR 442 +E+ + L N A + +F A ++ + F+ Sbjct: 245 MDEERRKIGSQTVVSEVDEGLLRNIAQLTGGEYFRATDSNAIVSAFQ 291 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 67/189 (35%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLDPLK 223 Query: 389 ICNKAKSQGIRIMTIAFS-----------VNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+RI TIAF + + L A + FF A T Sbjct: 224 AAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 67/197 (34%), Gaps = 33/197 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I R+G F D + V K Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDV-KT 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ I +TAI DA+ + S E + +VLLTDG+N Sbjct: 155 VSKMLSEAQIGLVGR-ATAIGDALGLSVKRFASKKESN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIA-----------FSVNKTQQEKARYFLSNCA--SPN 426 T N A+ +GI++ TI F++ L A + Sbjct: 203 TAGNLNPEDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGG 262 Query: 427 SFFEANSTHELNKIFRD 443 +F A L +I+ + Sbjct: 263 LYFRAKDVAGLQQIYAE 279 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ AS + + + R+G + + + ++ I Sbjct: 112 RMEALKEVAASFVEA-------RQSDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYD 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + G T I + TA + + K++ K I+L+TDG N E + Sbjct: 165 NVLQDG-TGIGMGLATAVNRL-----------KDSKAKSKVIILMTDGVNNAGFIEPVTA 212 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNSFF 429 AK GI++ TI N E + + A + +F Sbjct: 213 AEFAKEFGIKVYTIGIGTNGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYF 272 Query: 430 EANSTHELNKIFRD 443 A S L I+ + Sbjct: 273 RAQSNSSLESIYSE 286 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 59/161 (36%), Gaps = 28/161 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G FF D ++ + + ++ I TAI DA+ A Sbjct: 132 RLGLIFFADTAYLQAPLTYD-RETVSQLLNESLIGL-VGEQTAIGDAIGLA--------- 180 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 + R K+ E+ K ++LLTDG+NT N A + G+ + TI ++ Sbjct: 181 --IKRFKSKEESNKVLILLTDGQNTAGNITPEQANELAINNGVTLYTIGVGADQMLVQSI 238 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 QE L+ A +F A L +I+ Sbjct: 239 FGSRQVNPSQELDEGMLTTLAESTGGRYFRARDAQSLTEIY 279 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 66/204 (32%), Gaps = 39/204 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F D + + LV+ L ++ + R+G F + ++ Sbjct: 104 FPDMQWKDEEVSRLDLVKALLGDFLQ-------DREGDRVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +RT + I +TAI DA+ A + + +VL+TDG Sbjct: 156 RTVRTFLDEAQIGI-AGKNTAIGDAIGLAVKRLRQRPAQS-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA-- 423 N + A + +RI TI N + L A Sbjct: 204 ANNGGRIHPLTAARLAAQEDVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIADL 263 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++F A+ EL D IG+ Sbjct: 264 THGAYFRAHDGAEL-----DAIGD 282 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 60/191 (31%), Gaps = 34/191 (17%) Query: 279 ASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 I ++ I V R+G F+ R + ++ L + + Sbjct: 132 GDRINRLQAIKPVIQAFIERRPSDRIGIVLFSGRAYTMAPLTFDHRWLGSQLERIKVGLI 191 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + TAI D + + + R ++VLLTDG N + + Sbjct: 192 ED--GTAIGDGLGVGLTRLE-----QAQRESGGKRQGAFVVLLTDGANNRGSLTPQQAAE 244 Query: 392 KAKSQGIRIMTIAFSVNKTQQ-----EKARYF-------------LSNCA--SPNSFFEA 431 AK++GI + TI + +K R L + A + FF A Sbjct: 245 LAKARGIPVYTIGAGQDGIVPFPVFDDKGRKLGYRRIMSDLDEGALRDIAEMTGGHFFRA 304 Query: 432 NSTHELNKIFR 442 + FR Sbjct: 305 ADVGTVESAFR 315 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIGLAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQE 278 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 63/193 (32%), Gaps = 41/193 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ S I D D R+G F + + ++T++ Sbjct: 109 RITAAKEVAGSFI-----ADRYGD--RIGLVAFAGEAFTQSPLTTD-QGTLQTLLARIRS 160 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI + + TA + + E E K I+LLTDG N + Sbjct: 161 GLIEDG-TAIGNGLATAINRL---RESEAK--------SKVIILLTDGVNNRGEIAPQTA 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF-------------------LSNCA--SPNSF 428 AK+QGIR+ TI A L + A + + Sbjct: 209 AEIAKAQGIRVYTIGVGTEGMAPYPAVDIYGTPTGGTVMAKVEIDEKTLRSIAEQTGGQY 268 Query: 429 FEANSTHELNKIF 441 F A +L I+ Sbjct: 269 FRATDKAKLKAIY 281 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 53/162 (32%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A + Sbjct: 129 RIGLILFADHAYLQSPLTQDRRTVAQYL-KEAQIGL-VGKQTAIGEAIALAVKRFDKVEQ 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N A +GI I TI + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGAISPEQATQIAAKRGITIYTIGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + +F A +T EL +I++ Sbjct: 236 FGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQ 277 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 63/193 (32%), Gaps = 29/193 (15%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A VI + R+G F + S + H L+ +I++ F + Sbjct: 119 AAKHVITQFVDSRPDD---RIGIVGFAGKTKSFCPLTLD-HALVNSIIRDFHPRMIQADG 174 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI A+ A + E + K I+L+TDG + + A Sbjct: 175 TAIGSAIAAAATRLDDRKETK----------SKIIILVTDGASNSGQISPLVAAENAAKL 224 Query: 397 GIRIMTIAFSVNKTQQEKARYF--------LSNCA--SPNSFFEANSTHELNKIFRDRIG 446 GI+I TIA + L A + F A + NK F IG Sbjct: 225 GIKIYTIAVGTEEGTLANGMVVQSEFDEPTLRKIAQLTGGEHFRATNMASFNKAFTS-IG 283 Query: 447 N----EIFERVIR 455 E + +R Sbjct: 284 KLEKSEAKVQTVR 296 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 62/197 (31%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S I + V+ ++ + + R+G F D + + + Sbjct: 101 DMLSQDDYIDRLTAVKKVVSDFAQQ-------REGDRLGLVLFADHAYLQTPLTLDRNTI 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + TAI + + A T I S ++ ++LL+DG N Sbjct: 154 AKQVESLVLRLIGDK--TAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--S 424 T + I AK I TI + ++ L A + Sbjct: 201 TSGVLDPIEAAKIAKKYNATIYTIGVGAGEMMVKEFFMTRKVNTAKDLDENTLQEIADLT 260 Query: 425 PNSFFEANSTHELNKIF 441 +F A + EL I+ Sbjct: 261 GGQYFRARNADELATIY 277 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 65/198 (32%), Gaps = 42/198 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-TFA 328 + ++ +A I + + D R+G F + + +++ + + + A Sbjct: 110 RLEAAKETIAKFISA-RTSD------RIGLVVFAGESFTMVPPTLDYQMILQRVNEISSA 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ AM A R+K++ + ++ +TDGEN + Sbjct: 163 SSAKIKDGTALGVAMANA-----------AGRLKDSQARSRVMIFMTDGENNSGTIDPET 211 Query: 389 ICNKAKSQGIRIMTIAFSVN---------------------KTQQEKARYFLSNCASP-- 425 AK GI++ +I + + L AS Sbjct: 212 GLEIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDLLGRMASDTG 271 Query: 426 NSFFEANSTHELNKIFRD 443 ++ A + L K+F D Sbjct: 272 GKYYRATTEGALQKVFSD 289 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 55/159 (34%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TAI DA+ A + Sbjct: 135 RVGLILFGTQAFLQAPLTYD-RRTVRVWLDEAKIGI-AGKNTAIGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + +VL+TDG N + I A +G++I T+ + + Sbjct: 193 NS-----------RVLVLVTDGANNGGQIDPITAARLAADEGVKIYTVGIGSDPDKNALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L + A S +F A EL+KI Sbjct: 242 GVLGLNPSLDLDEPTLKDIASLSGGQYFRARDGAELDKI 280 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 59/195 (30%), Gaps = 41/195 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + + R+G + + + + RTI + Sbjct: 115 RLEALKKVAVDFVDR----RPND---RIGIVVYAGESFTQTPITSDKTIVKRTINRLQWG 167 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TAI + + + + K++ K I+LLTDG N N + Sbjct: 168 QLE--GGTAIGMGLGSRVNRL-----------KDSKAKSKVIILLTDGVNNAGNIDPTTA 214 Query: 390 CNKAKSQGIRIMTIAFSVNKT-------QQEKA------------RYFLSNCA--SPNSF 428 AK GI++ TI N L N A + + Sbjct: 215 TELAKELGIKVYTIGIGTNGMADFPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKY 274 Query: 429 FEANSTHELNKIFRD 443 F A L +I+ + Sbjct: 275 FRATDNTSLKEIYDE 289 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 67/459 (14%), Positives = 146/459 (31%), Gaps = 64/459 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV------ 66 KL + GH I+ A+++P + G+ + D R + ++ AA+ A + S Sbjct: 2 KLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAHNDPNQ 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 S+ + + I + N K+N E+ + AV PR Sbjct: 62 DYGGGGSPSSANQQIVTDYINAYISDVDSINEIKVYKRNC--EEIPECKAGLAVG-EPRY 118 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 ++V +++ P + + MG +++R Y E V + + DFS Sbjct: 119 FEHEVGVTTS-QKSWFPGNDAIVGMGD-----SFSTSGHSLARKYQSE-AVDVMFAADFS 171 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM D + D K N ++ + Y Y Sbjct: 172 GSMGDRWT-GGNKKYEDLIDIIDSISKELQKFNDLEHNDNDNTMGITAYNEYT--YSQYS 228 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 G ++ ++ + + + + + K D+ N+ +N Sbjct: 229 GSSGGWWGDDCYLSQAESDGFWGGVSISKT-IDGLWNE-KSKDHCNN-------SYNSGR 279 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 +D + +V G T+ A+ + Sbjct: 280 FNDIPLTSNF-----DVVNQDVSRFWPEGGTSSYQALIRGAQLLTY-----------GTN 323 Query: 367 AKKYIVLLTDGENTQDNEEGIAI----CNKAKSQ------------GIRIMTIAFSVNKT 410 +++ +++L+DG +T +N + C + ++ I F + Sbjct: 324 SRRLLIVLSDGMDTDNNLTSSLVNAGMCRDIQQGLESDKTLDNRPIRAQMAVIGFDYEPS 383 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + L +C + ++A ++ ++ + I EI Sbjct: 384 ENQA----LKDCVGAENVYKAENSDDILNTILELISEEI 418 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 62/197 (31%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + D + I + L++ I ++ D R+G F + ++ Sbjct: 104 YRDMRWQDYEISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +R + I +TAI DA+ A + + + +VL+TDG Sbjct: 156 HTVRVWLDEAQIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCASP 425 NT A Q ++I TI + + L A Sbjct: 204 ANTGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAES 263 Query: 426 --NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 264 TGGEYFRARSSAELESI 280 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 70/479 (14%), Positives = 147/479 (30%), Gaps = 91/479 (18%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ +I AL++ ++ G VD R + L+ AA A + + ++ Sbjct: 9 RFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVG---AVSRTS 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S + +R F NL T V Sbjct: 66 PAYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLASSSATVTKASD---------V 116 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++S+ ++F++ +G+ + + + A Y + ++D S SM Sbjct: 117 VTSQVTFSAQISTMFMKVVGMSAMAVGGTSTATASMPKY-----IDFYLLLDNSPSMGVG 171 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + + NKS D + Sbjct: 172 ATPTDVSAMIA------------------------------ATANKSSDDHCAFACHDVN 201 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-- 310 ++ ++ + + + + ++R A ++ + ++ RM F + Sbjct: 202 -NKNNYYNLAKALGITTRIDVLRSATQQLMDTATATATYSNQFRMAIYDFGASAQTAGLR 260 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS--SNEDEVHRMKNNLEAK 368 + L AID + N+ T+Y I +NE + + Sbjct: 261 NLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINNEISSPGSGVSGSPQ 320 Query: 369 KYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRI---MT--------- 402 KY+ ++DG + N + A+C K +GI+I T Sbjct: 321 KYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSND 380 Query: 403 -----IA-FSVNKTQQEKARYF---LSNCASPNSFFEANS----THELNKIFRDRIGNE 448 IA F+ + CASP +FE + +N +F+ + + Sbjct: 381 WYKKWIAPFNAGPYGPSPNSEIAQNMEACASPGFYFEVSPTQGIAEAMNALFKKAVADA 439 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQE 278 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQE 278 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 65/365 (17%), Positives = 128/365 (35%), Gaps = 38/365 (10%) Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + + + + + + +E S ++ L L+PL + + + T Sbjct: 70 MMRAYQEYYKPTLKGVSGLEPEIIGSECRISL----GYSLSPLLPNFQYESYATKVTATG 125 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 ++V S + V+D S SM S Q L A T++S S+ Sbjct: 126 GGYKSVVESKQSSIPTELVLVLDVSGSMG-----SNIQSLKSILSNALNTIQSQSNNAND 180 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + P+ S P L + + +D + R+ L D LA++ Sbjct: 181 LDSVSISIVPF-----DSGVAAQRPPWLSKEAAGIYCIDGLNYRNGNFSAALTVDNLATL 235 Query: 282 IRS--IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 +K G ++ + ++ +I G T Sbjct: 236 HSQQPVKFAKPN------GWLSDCNQSSPMLPLTSVFSRVRNSI-----NSLTANGGTRS 284 Query: 340 NDAMQTAYDTIISS-NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI---AICNKAKS 395 + +I S + + E ++ +VL TDG + D + + C A + Sbjct: 285 FHGLLWGVRQLIPSWQQAWGINVSTVPETRRKLVLFTDGADEGDTFDQLVNAGFCTTAIN 344 Query: 396 Q-GIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 Q GI + I + V+ ++ + CA +P+ F A +T +LN+ F D + E Sbjct: 345 QYGIEMNFIGYGVSSSRIAQ----FERCAGNPSRVFSATNTTQLNEYFSDILAVEYSAT- 399 Query: 454 IRITK 458 I++T+ Sbjct: 400 IKLTR 404 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 66/194 (34%), Gaps = 38/194 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A+ ++ D +ND R+G + + + +++++ Sbjct: 112 RLEALKKVAATFVQ-----DRIND--RIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFD 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T I + TA + I K++ + I+LLTDG N + Sbjct: 165 DSIIADGTGIGVGLATAINRI-----------KDSKAKSRIIILLTDGVNNSGTIDPRTA 213 Query: 390 CNKAKSQGIRIMTIAF------SVNKTQQEKAR------------YFLSNCA--SPNSFF 429 + AK GI++ TI + + + A + +F Sbjct: 214 ASIAKEYGIKVYTIGIGTNGKAMFPVAKDANGKLVFKMMPVEIDEKLMQEIAKNTDAKYF 273 Query: 430 EANSTHELNKIFRD 443 A S +L I+ + Sbjct: 274 RATSNKKLQAIYDE 287 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 61/186 (32%), Gaps = 34/186 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ L I + D R+G F + ++ + +R + Sbjct: 115 TRLELVKVLLGDFIEQ-RHGD------RVGLILFGSKAYLQAPLTFD-RRTVRVWLDEAR 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +TAI DA+ A + + + +VL+TDG N + + Sbjct: 167 VGIAGS-NTAIGDAIGLAVKRLRERPTNS-----------RVLVLITDGANNGGELDPLL 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEANST 434 A + +RI TI + F L A + +F A S+ Sbjct: 215 AATLAAEESVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASS 274 Query: 435 HELNKI 440 EL I Sbjct: 275 AELKAI 280 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 60/187 (32%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + LV+ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELVKKLFGDFIEG-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLALKRLRQRPAES-----------RVLVLITDGANTGGQISPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANS 433 A + ++I TI + + L A + +F A S Sbjct: 214 TAARLAAEERVKIYTIGIGADPQQGGVIGLFGLNPGLDLDEPVLRGIAETTGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 71/210 (33%), Gaps = 25/210 (11%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S ++ + + V+ + + + R+G F + Sbjct: 99 AIDLSASMDYRDFPGPDGKPVSRFDAVQRVVDQFVA-------NREGDRVGLIVFGAKAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ ++ + TA+ D++ A SS D Sbjct: 152 LQLPFTRDLNTARALVDLMQVGMAGP--QTALGDSIGLAIRAFESSEVD----------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 + ++LLTDG +T I A+ GI I TI + E F L++ A Sbjct: 199 DRVLILLTDGNDTASKMTPINAAEIAQLNGIEIYTIGIGDAEATGEDRIDFETLASIAER 258 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 S FF+A L +++ DRI + Sbjct: 259 SGGQFFDAQDETALRQVY-DRIDALAVADI 287 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 79/524 (15%), Positives = 165/524 (31%), Gaps = 149/524 (28%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ +G + +L +M+ GG+ +D++ ++ A++ A+ L Sbjct: 28 RRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAA-DLDNE 86 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 LE K + + LI +V+++ + V Sbjct: 87 LEAQG---VVEDYMDKMALADALI-----------------------SVDVDEGLNYRTV 120 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 V + N F++ +G+ + +A ++ + + + V + V+D S SM D Sbjct: 121 VAEGYKTMPSN----FMQILGVDNL------QAYGLAEATERINKVEVSLVLDISGSMDD 170 Query: 192 -----YQRDSEGQPLNCFGQPAD------------------RTVKSYSSQNGKVGIRDEK 228 +D+ G ++ + + SY S N K G Sbjct: 171 NDKLANMQDAAGTFIDTLLAEGNEDLVSISLVPYSEQVNAGPEILSYLSANWKHGYSHCI 230 Query: 229 LSPYMVSCNKSL-----YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 P V + +L Y + + D+ R+ ++ +++ Sbjct: 231 EMPNSVFGSAALDFSRTYEQMQHYQWNYDGYNNTLSDTVCPRYGYERIQAWSHDASALKA 290 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA-INDA 342 + ++ R G + F WG L+ + A GS + + Sbjct: 291 QVNQLQP-----RAGTSIF-------MGMKWGTA-LLDPSTRPIASGMIARGSVDQVFEG 337 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------------------------ 378 AY ++ + K +VL+TDG+ Sbjct: 338 RPVAY---------------DDTDVLKTVVLMTDGQHDRSYRIQDWAYNSESEYAHWNRY 382 Query: 379 -----------------------NTQDNEEGI-AICNKAKSQGIRIMTIAFSVNKTQQEK 414 N + + +IC AK+QGI I ++ F V + Sbjct: 383 NLWYYLSRYVSSYERSSFYYQKYNADLGDALLGSICAAAKAQGIIIWSVGFEVG----DH 438 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +CAS + F E+ + F I + + + +R+T+ Sbjct: 439 GADVMESCASSPAHFFRVEGVEITEAF-STIAHTLNQ--LRLTQ 479 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 56/163 (34%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F D + + + + + A TAI +A+ A + + Sbjct: 135 KLGLILFADHAYLQAPLTQDRRSVAQFLTE--AQIGLVGKQTAIGEAIALAVKRFDKAKQ 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV-------- 407 + ++LLTDG N + + A +G+ I TI Sbjct: 193 SN-----------RVLILLTDGSNNSGSITPEQAADIAAKRGVTIYTIGVGAEVMERRTL 241 Query: 408 -------NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++A+ L + +F A ++ EL +I+++ Sbjct: 242 FGKERVNPSMDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQE 284 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 58/168 (34%), Gaps = 32/168 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G ++ + + ++R + + E G TAI + T+ + + Sbjct: 132 RIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESG-TAIGSGLATSVNRL----- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 K++ K I+LLTDG N + AK GI++ TI N Sbjct: 186 ------KDSKAESKVIILLTDGVNNSGFIDPKVASELAKEFGIKVYTIGVGTNGMALTPV 239 Query: 411 -------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 Q E L A + +F A + +L I+ + Sbjct: 240 GIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDE 287 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 60/191 (31%), Gaps = 35/191 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V++ ++ ++ + R+G F D + + + Sbjct: 107 QYIDRLSAVKNVVSDFVKQ-------REGDRLGLVLFADHAYLQTPLTLDRETISDQVNS 159 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI + + A T + S ++ +VLL+DG NT E Sbjct: 160 LVLRLIGDK--TAIGEGIGLATKTFVDS-----------EAPQRVMVLLSDGSNTSGVLE 206 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFE 430 + AK I TI + ++ L A + +F Sbjct: 207 PLEAARIAKKYNATIYTIGIGAGEMMVKEFFMTRKVNTAKDLDEKTLKQIADLTGGQYFR 266 Query: 431 ANSTHELNKIF 441 A + EL I+ Sbjct: 267 ARNADELATIY 277 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 61/161 (37%), Gaps = 26/161 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR + + + + + + TAI DA+ A + S E Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVV--GLAGRETAIGDAIGLAVKRLRSQPE 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 ++ ++LLTDG + E + A+++G+RI T+AF + + + Sbjct: 202 -----------GQRVLILLTDGVSNAGVLEPLRAAEVAQAEGVRIHTVAFGGDGSMRFLG 250 Query: 416 -----------RYFLSNCAS--PNSFFEANSTHELNKIFRD 443 L AS FF A T +L I+ + Sbjct: 251 IPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAGIYAE 291 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 63/193 (32%), Gaps = 38/193 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + I + D R+G F+ + + + L I + A Sbjct: 120 RIEAAKSVATDFIHQ-RLSD------RIGLVVFSGKSFTQCPLTLDYRLLTNFISELKAG 172 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI A+ TA + + + + K I+LLTDG+N E + Sbjct: 173 TIEEDG-TAIGTAIATATNRL-----------RESTAKSKVIILLTDGQNNAGEIEPVTA 220 Query: 390 CNKAKSQGIRIMTIAFSV------NKTQQEKARYF-----------LSNCA--SPNSFFE 430 A + GI+I T+ + + L+ A S +F Sbjct: 221 AELAAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRYFR 280 Query: 431 ANSTHELNKIFRD 443 A L K + + Sbjct: 281 ATDLESLKKTYHE 293 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 59/187 (31%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L++ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGANTGGQIAPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCASP--NSFFEANS 433 A Q ++I TI + + L A +F A S Sbjct: 214 IAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 59/187 (31%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L++ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGANTGGQIAPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCASP--NSFFEANS 433 A Q ++I TI + + L A +F A S Sbjct: 214 IAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 59/187 (31%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L++ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGANTGGQIAPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCASP--NSFFEANS 433 A Q ++I TI + + L A +F A S Sbjct: 214 IAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 59/187 (31%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L++ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGANTGGQIAPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCASP--NSFFEANS 433 A Q ++I TI + + L A +F A S Sbjct: 214 IAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 49/368 (13%), Positives = 119/368 (32%), Gaps = 71/368 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L + G+ I+ AL + ML G D +R ++ A+I +V I + Sbjct: 108 RSLERDRGGNVGIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA-AVKQINN 166 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E+ + + + +++ T +++ + + Sbjct: 167 TEDTDALKEKVSDWFHAQVDN-----------------------SYTLGDIDIDTVNHNI 203 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ + + F++ I++ + + + + SY +++ VID S SML Sbjct: 204 TATANGTVP----TTFMKIANIETVPVSVASAVKGPATSY-----LNVYVVIDTSPSMLL 254 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 S + +G + + + K+ Y Y Sbjct: 255 AATTSGQSTMYS-----------------GIGCQFACHTGDAHTVGKTKYANNYAYSTAK 297 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV--ISD 309 + + + DA+ V+ I + D ++ +++G D + + Sbjct: 298 KIKL--------------RADVAGDAVREVLDMIDESDENHERIKVGLYSLGDTLSEVLA 343 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 P+ S + +T + ++ T + + + ++ K Sbjct: 344 PTLSTDTARTRLADASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----SSGSPLK 398 Query: 370 YIVLLTDG 377 ++LLTDG Sbjct: 399 LVLLLTDG 406 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 69/197 (35%), Gaps = 37/197 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + +V+ L I+ + D R+G F D ++ + ++ Sbjct: 119 NEVDRLVMVKTVLGDFIQR-RVGD------RIGLILFADTAYLQAPLTFD-RTTVEQLLS 170 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 I STAI DA+ A R + K +VLLTDG+NT N Sbjct: 171 ETVIGLVGD-STAIGDAIGLA-----------AKRFSDKPNVNKVLVLLTDGQNTAGNIT 218 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP--NSFFE 430 + A Q I+I I + + L+ A +F Sbjct: 219 PDQALSLAVDQNIKIYPIGVGADAMTVNSLFGQRQVNPSADLDEGLLTRLAKDTGGQYFR 278 Query: 431 ANSTHELNKIFR--DRI 445 A T EL +I+R DRI Sbjct: 279 ARDTQELEQIYRLLDRI 295 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 57/163 (34%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +++ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGESIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + +VLLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLVLLTDGSNNAGNIEPQQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNC--ASPNSFFEANSTHELNKIFRD 443 + L A+ +F A ++ EL +I+++ Sbjct: 236 FGRERVNPSMDLDENQLQQIADATHGRYFRARNSEELEQIYQE 278 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + + AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 61/161 (37%), Gaps = 26/161 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR + + + + + + TAI DA+ A + S E Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVV--GLAGRETAIGDAIGLAVKRLRSQPE 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 ++ ++LLTDG + E + A+++G+RI T+AF + + + Sbjct: 202 -----------GQRVLILLTDGVSNAGVLEPLRAAEVARAEGVRIHTVAFGGDGSMRVFG 250 Query: 416 RYF-----------LSNCA--SPNSFFEANSTHELNKIFRD 443 L A + FF A T +L I+ + Sbjct: 251 ISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAGIYAE 291 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 61/161 (37%), Gaps = 26/161 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR + + + + + + TAI DA+ A + S E Sbjct: 144 RIGLLVFGDRAYTLTPLTADLASVRDQLRDSVV--GLAGRETAIGDAIGLAVKRLRSQPE 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 ++ ++LLTDG + E + A+++G+RI T+AF + + + Sbjct: 202 -----------GQRVLILLTDGVSNAGVLEPLRAAEVARAEGVRIHTVAFGGDGSMRLFG 250 Query: 416 -----------RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + FF A T +L I+ + Sbjct: 251 IPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAGIYAE 291 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 74/201 (36%), Gaps = 36/201 (17%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A +V K + RMG F+ + + H ++ ++ + G Sbjct: 115 TAAKNVASDFVKGRPGD---RMGLVIFSGETFTQVPLTTD-HGVMLNMLAEMKNGLIDDG 170 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + TA + K++ K ++LLTDG N + + AK Sbjct: 171 -TAIGDGLATAISRL-----------KDSEAISKVVILLTDGMNNAGSVDPYTAAEIAKL 218 Query: 396 QGIRIMTIAFSVNKTQQEKAR----------------YFLSNCAS--PNSFFEANSTHEL 437 GIR+ TI T + L++ AS +F A S +L Sbjct: 219 YGIRVYTIGVGSYGTAPYPVQTPFGTQIQQMKVEIDEKLLASVASMTGGKYFRATSNQKL 278 Query: 438 NKIFRDRIGNEIFERVIRITK 458 ++I+ + +++ I +T+ Sbjct: 279 DEIYEEI--DKLERSKIEVTE 297 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 53/163 (32%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + + I TAI +++ A + E Sbjct: 128 RIGLILFGDHAYLQSPMTQDRRSVAQYL-REAQIGL-VGKQTAIGESIALAVKRFENLEE 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + +VLLTDG N + A + + I TI ++ Sbjct: 186 SN-----------RVLVLLTDGTNNAGSISPDKAAAIAAERKVTIYTIGVGAEMMERRSF 234 Query: 413 ----------EKARYFLSNC--ASPNSFFEANSTHELNKIFRD 443 + L A+ +F A S+ +L I+++ Sbjct: 235 FGRDRVNPSMDLDEEQLQRIANATQGKYFRARSSEDLAAIYQE 277 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 31/165 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D ++ + + + + + G TAI D + + E Sbjct: 141 RIGLIIFGDTSYVLSPLTFDRNAIHQLLDG--IVPTLAGGGTAIGDGIGLGIKKLRERPE 198 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------- 408 + ++L+TDG+N + AK +GIRI TI Sbjct: 199 -----------GSRVLILVTDGKNETGTIPPLKAAQLAKQEGIRIYTIGVGSTKNRVRLL 247 Query: 409 ---------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 T L A + ++F AN T L K+++ Sbjct: 248 SPDLRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQ 292 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 62/174 (35%), Gaps = 21/174 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + L I +K D R+G F + G + + I+ T Sbjct: 108 TRLGVSKKLLRKFIDK-RKSD------RLGLVVFAGAAYLQAPLT-GDRESLNEILGTIE 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI DA+ + +R++ + K IVL+TDG + + + Sbjct: 160 EETVAEQGTAIGDAIILS-----------TYRLRASQARSKVIVLITDGVSNTGKIDPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKI 440 + A+ G++I ++ E L ++ FF A E+ + Sbjct: 209 ATDLAEHIGVKIYSVGIGKEDGSYEINFEILRELSASTGGKFFRAEDPEEMKAV 262 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 62/186 (33%), Gaps = 34/186 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ L I + D R+G F + ++ + +R + + Sbjct: 115 TRLELVKVLLGDFIEQ-RHGD------RVGLILFGSKAYLQSPLTFD-RRTVRVWLDEAS 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +TAI DA+ A + + + +VL+TDG N E + Sbjct: 167 VGIAGS-NTAIGDAIGLALKRLRERPANS-----------RVLVLVTDGANNGGEIEPLL 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEANST 434 A + +RI TI + F L A + +F A S+ Sbjct: 215 AATLAAEENVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASS 274 Query: 435 HELNKI 440 +L I Sbjct: 275 EQLQAI 280 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALAVKRFDKIDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQE 278 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALAVKRFDKIDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL++I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQE 278 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 65/197 (32%), Gaps = 34/197 (17%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + D + + LV+ L + S +K D R+G F + ++ Sbjct: 104 YPDMQWKNDEVSRLVLVQQLLGDFLES-RKGD------RVGLILFGTQAFLQAPLTYD-R 155 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +R + I +TA+ DA+ A + + + +VL+TDG Sbjct: 156 QTVRVWLDEAKIGI-AGKNTAVGDAIGLALKRLRMRPANS-----------RVLVLVTDG 203 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA-- 423 N + + A +G++I TI + + L A Sbjct: 204 ANNAGQIDPLTAARLAADEGVKIYTIGIGSDPEKNALQSALGLSASLDLDEPTLKEIARL 263 Query: 424 SPNSFFEANSTHELNKI 440 S +F +L KI Sbjct: 264 SGGQYFRTRDGDQLEKI 280 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKLNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 67/184 (36%), Gaps = 30/184 (16%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 H + ++ A++I+ + D R+G F +R + S+ V + RT+ + Sbjct: 117 HAASRIDALKRVGAALIKR-RTGD------RIGLVIFAERAYAAAPLSFDVDAVSRTLAE 169 Query: 326 TFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 I +G STAI + + A + S + IVLL+DG N Sbjct: 170 ---IPLGLVGHSTAIGEGLGLALKRLTESK-----------APSRVIVLLSDGANDAGTT 215 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY------FLSNCAS--PNSFFEANSTHE 436 + + A + G++I TI V TQ L A F +T + Sbjct: 216 DPTGVAELANNLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRLAEIGGGEAFRVRTTED 275 Query: 437 LNKI 440 L Sbjct: 276 LAYA 279 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +++ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGESIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N + A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIDPDQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL +I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELEQIYQE 278 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 60/187 (32%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L++ I ++ D R+G F + ++ +R + Sbjct: 114 ISRLELIKKLFGDFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RHTVRVWLDEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I +TAI DA+ A + + + +VL+TDG NT Sbjct: 166 QIGI-AGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGANTGGQIAPQ 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANS 433 A Q ++I TI + + L A + +F A S Sbjct: 214 IAAQLAAEQQVKIYTIGVGADPQQGGVPGLFGFNPGLDLDEPTLRGIAEITGGEYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SAELESI 280 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 65/190 (34%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I +K D R+G F D + L + + + Sbjct: 133 YIDRLSAVKHVLSDFIER-RKGD------RVGLVLFADHAYLQTPLTLDRDTLSQQLNQ- 184 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A+ + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 185 -AVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 232 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ + Q+ L A + +F A Sbjct: 233 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 292 Query: 432 NSTHELNKIF 441 + EL I+ Sbjct: 293 RDSKELATIY 302 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 79/495 (15%), Positives = 147/495 (29%), Gaps = 96/495 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +G I ++ ++L +GG+ VD++R L+ AI++A+ L+ Sbjct: 32 FKDDESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAA-----DLD 86 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + E + N + N+ T Sbjct: 87 QTQTPQAVVDDYFETAGLESFLSNVTVDQGINYKTVGAEAQSITT--------------- 131 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML--- 190 + F++ G+ + AE + V I V+D S SM Sbjct: 132 -----------TAFMKMAGVDTLNATAAGVAEERIAN------VEISMVLDISGSMGIGS 174 Query: 191 --DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 R + +N P + + S S + ++ N Y Sbjct: 175 KMTQLRSAATSFVNTVLSPENEDLVSVSLVPYSQHVNAGPKIYNELNTNHRHNYSHCVEM 234 Query: 249 LDPSLSE----------------------EHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 D + SE D+ R+ ++ +++ I Sbjct: 235 ADSAYSETELDLSVTYDQMQHFQWNYSGANQLTDTICPRYSYERITAFSQDASALNAQIA 294 Query: 287 KIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI------ 339 ++ T + MG + + + I F T Sbjct: 295 QLQPRAGTQIFMGMKWAAAMLDPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKTVVL 354 Query: 340 -NDAMQTAYDTI----ISSNEDEVH-----------RMKNNLEAKKYIVLLTDGENTQDN 383 D ++ I S+ D H R N +Y D Sbjct: 355 MTDGKNSSSMRIKSWAYDSSSDYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAAQGDAL 414 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + ICN +K GI I +I F V+ + ++NCAS S F E+++ F D Sbjct: 415 LD--DICNASKDAGIVIWSIGFEVD----DHGADVMANCASSPSHFFRVEGIEISEAF-D 467 Query: 444 RIGNEIFERVIRITK 458 I +I + +R+T+ Sbjct: 468 AIARQINQ--LRLTQ 480 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 65/190 (34%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I +K D R+G F D + L + + + Sbjct: 116 YIDRLSAVKHVLSDFIER-RKGD------RVGLVLFADHAYLQTPLTLDRDTLSQQLNQ- 167 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A+ + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 168 -AVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 215 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ + Q+ L A + +F A Sbjct: 216 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 275 Query: 432 NSTHELNKIF 441 + EL I+ Sbjct: 276 RDSKELATIY 285 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 65/190 (34%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I +K D R+G F D + L + + + Sbjct: 105 YIDRLSAVKHVLSDFIER-RKGD------RVGLVLFADHAYLQTPLTLDRDTLSQQLNQ- 156 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A+ + TAI D + A T + S ++ ++LL+DG NT + Sbjct: 157 -AVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 204 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ + Q+ L A + +F A Sbjct: 205 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 264 Query: 432 NSTHELNKIF 441 + EL I+ Sbjct: 265 RDSKELATIY 274 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 34/198 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I + D R+G F D + + K Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-RTGD------RLGLILFGDTAFLQTPLTRDL-KT 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ I +TAI DA+ + S ++ + +VLLTDG+N Sbjct: 155 VTKMLNEAQIGLVGR-ATAIGDALGLSVKRFASKDDSN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAF------------SVNKTQQEKARYFLSNCASP-- 425 T N A+ +GI++ TI L N A Sbjct: 203 TAGNLNPDDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGGSGGSNLDERLLKNIADDTG 262 Query: 426 NSFFEANSTHELNKIFRD 443 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLKQIYAE 280 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 75/215 (34%), Gaps = 29/215 (13%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + + +++ VRD + + + + RM F + Sbjct: 99 AIDISGSMDTRDFTDASNENVQRLAGVRDVVRAFVE-------GREGDRMALIVFGSKAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + ++ + +T TAI DA+ + T +S Sbjct: 152 LQSPLTEDTGTIVELLDQTEVGMAGPH--TAIGDAIGLSIRTFEAS-----------EIE 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++ ++LL+DG +T + A+ G+ I TIA E L + A Sbjct: 199 QRLLILLSDGADTASRMSPLNAAEIARGAGVEIFTIAVGDPDATGENRVDVAALQDIANR 258 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + S+F A L D I I E R+T+ Sbjct: 259 TSGSYFFAADQAAL-----DEIYARIDELAPRLTE 288 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 75/546 (13%), Positives = 150/546 (27%), Gaps = 116/546 (21%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII---------- 62 K +S +G+F ++ L +P +L VDV + L+ A A + Sbjct: 5 KFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLASSHLGDLDI 64 Query: 63 --------------TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF---------- 98 L + ++ +F K + NF Sbjct: 65 SRTDAFDRYFQANIAGHGELANAQATLTVDRGVNFIKTKAVASADVNLNFGFLFGHNRHI 124 Query: 99 --ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS-------------RYDLLLNP 143 + + ++ EV ++ +T R +A + S + L P Sbjct: 125 AVDASAVESDNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEATKSPTRQIRASLVP 184 Query: 144 LSLFLRSMGIK---SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + G + SW+ + + + + Sbjct: 185 FVTAVNVNGDEFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKDTGWTEAGWNGT 244 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 A + S D PY + P + + + Sbjct: 245 GWKGCVEARPGAYNISDTPPDPDKPDTLFVPYFAPDD----PEDAQKPSSSYGNAAKYYN 300 Query: 261 SSSLRHVIKKKHLVRDA-------LASVIRSIKKIDNVND--TVRMGATFFNDRVISDPS 311 +S L V K + L+S+ + D + A + Sbjct: 301 NSYLDDVSDKTKTAKLKGNRLGIDLSSLADPVPPADKDAKEKVAKYVAPTKALITETGSP 360 Query: 312 FSWGVHKLIRTIVKTFAIDENE------------MGSTAINDAMQTAYDTIISSNEDEVH 359 + G ++ T V + D ++ T +++ + + + Sbjct: 361 ITVGPNRACPTPVVSLTDDFDKLRKAASEMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDG 420 Query: 360 RMKNNLEAKKYIVLLTDGEN---------TQDNEE------------------------- 385 K ++LLTDGEN T+ + Sbjct: 421 APWKTPGISKIVLLLTDGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGW 480 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDR 444 ++C + K+QG++I T V ++ R S CAS P+ ++ N +L +F+ Sbjct: 481 TKSVCTQLKNQGVQIYT---MVLQSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQ-H 536 Query: 445 IGNEIF 450 I N+ Sbjct: 537 IANKFS 542 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +++ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGESIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL +I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQE 278 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +++ A +E Sbjct: 129 RIGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGESIALAVKRFDKMDE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A ++ + I T+ + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A ++ EL +I+++ Sbjct: 236 FGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQE 278 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 82/219 (37%), Gaps = 26/219 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + + + F ++L + + + + + I+ + DNV Sbjct: 80 NGKDIVLAIDASGSMNSTGFDFEGEAALPQKLSRFEIAKIVASEFIQK-RLSDNV----- 133 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G + D ++ + +I + ++ +TAI +A+ Sbjct: 134 -GIVLYGDFAFIASPITYEKNIIIEML--SYLNQGMAGQNTAIGEAIA-----------M 179 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + K++ K +VLLTDGE+ + AK + I+I TI N+ + ++A Sbjct: 180 SLRAFKHSKAKSKIVVLLTDGEHNSGDISPKDALVLAKEENIKIYTIG-MGNRGEADEA- 237 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L A S FF A + EL +I+ + I ++ Sbjct: 238 -LLKKIADESGGEFFYATNAKELKEIY-EHIDELESSKI 274 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 66/197 (33%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +T TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ANQLNQTVLKLI--GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYNTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 13/39 (33%), Positives = 23/39 (58%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE 50 ++++ G+F I+TA+L PV+LG GM + V + Sbjct: 12 RRMLSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSK 50 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 75/215 (34%), Gaps = 29/215 (13%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + + +++ VRD + + + + RM F + Sbjct: 99 AIDISGSMDTRDFTDASNENLQRLAGVRDVVRAFVE-------GREGDRMALIVFGSKAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + ++ + +T TAI DA+ + T +S Sbjct: 152 LQSPLTEDTGTIVELLDQTEVGMAGPH--TAIGDAIGLSIRTFEAS-----------EIE 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++ ++LL+DG +T + A+ G+ I TIA E L + A Sbjct: 199 QRLLILLSDGADTASRMSPLNAAEIARGAGVEIFTIAVGDPDGTGENRVDVAALQDIANR 258 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + S+F A L D I I E R+T+ Sbjct: 259 TSGSYFFAADQAAL-----DEIYARIDELAPRLTE 288 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 64/187 (34%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + LV+ L I ++ D R+G F + ++ + +RT ++ Sbjct: 114 IGRLELVKHLLGQFIED-RRGD------RVGLILFGSQAYLQAPLTFD-RRTVRTWLEEA 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 AI TAI DA+ + + ++L+TDG NT Sbjct: 166 AIGI-AGKDTAIGDAIGLGLKRLRQRPAQS-----------RVLILVTDGANTAGEIAPS 213 Query: 388 AICNKAKSQGIRIMTIAFSVNK--TQQEKARYF----------LSNCA--SPNSFFEANS 433 A ++G+RI TI + L A + S+F A S Sbjct: 214 VAARLAAAEGVRIHTIGIGADPRQDGPPGLLGLTPGLDLDEPTLRAIAEETGGSYFRARS 273 Query: 434 THELNKI 440 + EL I Sbjct: 274 SEELRAI 280 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 55/168 (32%), Gaps = 32/168 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + ++ ++ E G TAI + TA + + Sbjct: 132 RIGLVEYAGESYTKTPLTSDKTVVLSSLNSIEYNSIIE-GGTAIGMGLATAVNRL----- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K + K I+LLTDGEN + A GI++ TI N Sbjct: 186 ------KESTAKSKVIILLTDGENNSGFIDPKIASELAVEFGIKVYTIGLGTNGMASSPI 239 Query: 416 ------------------RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +F A S +LN+I+ + Sbjct: 240 GILPNGRFQYGNQPVKIDETLLKEIAKTTGGQYFRATSNTKLNEIYEE 287 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 87/515 (16%), Positives = 167/515 (32%), Gaps = 133/515 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I+ ++ +M+ VGG+ +D +R L+ + A++ A+ Sbjct: 30 RFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA------- 82 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + PK +E+Y + + + + ++ V D + V ++ K Sbjct: 83 ------DLDQMRDPKTVVEDYFAK---SGMTEFLSNVVVDDGLNFRTVTVDASK------ 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 N + F+ G + + ++AE + V I V+D S SM Sbjct: 128 ---------NMDTQFIGRFGFPTLEVPAHSQAE------ERVAKVEISLVLDISGSMATN 172 Query: 193 QRDSEGQP-----LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 R E Q L+ + ++ + S S + L ++ N+ + + Sbjct: 173 NRLGEVQNAADIFLDTVLKDENQDLISVSLVPYSEQVNAGPLIMDRMNVNR-KHDYSHCI 231 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D + ++SS+ + ++ D + + D R+ Sbjct: 232 DFDNGDFDSIAMNSSTRYNQMQHFQWNYDGRNNYRD--DTVCPRYDYERI---------- 279 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN---- 363 FS L I D T+I M+ A + + D + + N Sbjct: 280 --TPFSQNKRTLKNQID-----DLVPRAGTSIFLGMKWAAAMLDPAFRDINNSLVNAGYV 332 Query: 364 ------------NLEAKKYIVLLTDGEN-------------------------------- 379 + E K ++L+TDG N Sbjct: 333 DREFYNRPASYTDSETLKTVILMTDGANDNSYRIRSNYYDSDSEYVHWNKYNLWWYLRRE 392 Query: 380 ----------------TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 T N IC+ AK++ I I +I F V+ + +CA Sbjct: 393 VDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVIWSIGFEVDDEDVPA----MQDCA 448 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 S S F EL++ FR I +I + +R+T+ Sbjct: 449 SSPSHFFRVEGVELSEAFR-AIARQINQ--LRLTQ 480 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 69/473 (14%), Positives = 144/473 (30%), Gaps = 78/473 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K+ GH I+ A+++PV+ G+ + D R + ++ A + A + + Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLI---RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S + ++ +YL + ++ R DI ++ + + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFF 119 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + L + + + S T + ++ V + + DFS+SM Sbjct: 120 EYEVEA-----LTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSM 174 Query: 190 LDY---QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + R + K + ++ S S + Sbjct: 175 EEPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKF----- 229 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS-------IKKIDNVNDTVRMGA 299 + ++ + + +S H KK ++ L ++ K D Sbjct: 230 DNYNSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDA------- 282 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 V D + + I+K TA + + Sbjct: 283 ------VFHDIYLTNDFDTFNKEIMKFR-----PGNGTASYQGIIRS-----------AQ 320 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIA--------------ICNKAKS--------QG 397 ++ +++ +++++DG + G +CNK + G Sbjct: 321 MLRKGTNSRRLLIIISDGNDWYYPYSGYKETDKEIANKLVNAGMCNKIRETLNLDKTPSG 380 Query: 398 IRIMT-IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 I T IA K + L NCA ++ F+A EL I EI Sbjct: 381 QEIKTRIAVIGFDYDANKNKALL-NCAGEDNVFKAQYRDELLDQILSLITEEI 432 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 61/194 (31%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ +S I N R+G + + + ++ + Sbjct: 113 RLEALKKVASSF---INGRPND----RIGLVEYAGESFTKTPITSDKSIVLSALKGIQYN 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI + T + I K++ K I+L+TDGEN + Sbjct: 166 SIIE-GGTAIGMGLATGVNRI-----------KDSKALSKVIILMTDGENNAGQIDPRIA 213 Query: 390 CNKAKSQGIRIMTIAFS------VNKTQQEKA------------RYFLSNCA--SPNSFF 429 A+ GI++ TI + L A + +F Sbjct: 214 AELAQEFGIKVYTIGMGTNGTALSPYARNPNGTFVYENIQVTIDEELLEEIAETTGGQYF 273 Query: 430 EANSTHELNKIFRD 443 A + +L +I+ + Sbjct: 274 RATNNKKLQEIYDE 287 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 35/190 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +V+ L I +K D R+G F D ++ + + + + ++ Sbjct: 110 VDRLEMVKAVLGDFIER-RKGD------RLGLILFADTAFLQTPITYDRNTVQQMLDESV 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A V R K + + +VLLTDG+NT N Sbjct: 163 L--GLVGERTAIGDAIALA-----------VKRFKGKQQTNRVLVLLTDGQNTAGNLSPE 209 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEAN 432 AK+ +RI IA + ++ + N A + +F A Sbjct: 210 QALELAKAYDVRIYPIAVGAEEVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYFRAR 269 Query: 433 STHELNKIFR 442 ST EL +I++ Sbjct: 270 STEELERIYQ 279 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 64/190 (33%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I +K D R+G F D + L + + + Sbjct: 116 YIDRLSAVKHVLSDFIER-RKGD------RVGLVLFADHAYLQTPLTLDRDTLSQQLNQ- 167 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A+ TAI D + A T + S ++ +VLL+DG NT + Sbjct: 168 -AVLRLIGNQTAIGDGIGLATKTFVDS-----------DAPQRVMVLLSDGSNTAGVLDP 215 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ + Q+ L A + +F A Sbjct: 216 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 275 Query: 432 NSTHELNKIF 441 + EL I+ Sbjct: 276 RDSKELATIY 285 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 68/201 (33%), Gaps = 36/201 (17%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A SV ++ +G F + + + + + Sbjct: 111 EAAKSVATEFILSRPNDN---IGLVIFARESFTQCPLT--TDHAVLVNLFNGVNNGMIED 165 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + A + I K+ K I+LLTDG N + I AK+ Sbjct: 166 GTAIGLGLANAVNRI-----------KDGKSKSKVIILLTDGSNNSGDIAPITAAEIAKT 214 Query: 396 QGIRIMTIAFS------VNKTQQEKARY----------FLSNCA--SPNSFFEANSTHEL 437 GIRI TI + + +Y L N A + +F A +L Sbjct: 215 FGIRIYTIGVGTHGVINIPVSTPMGIQYQRVQSEFDAKSLENIANLTGGKYFGATDNSKL 274 Query: 438 NKIFRDRIGNEIFERVIRITK 458 I+++ +++ + I+I + Sbjct: 275 RNIYQEI--DKLEKTRIKIQQ 293 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D RMG F D + + Sbjct: 90 DMQSGQQMVDRLTAVKQVLSEFITK-REGD------RMGLILFADHAYLQTPLTLDRQTV 142 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 143 ISQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 189 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L AS Sbjct: 190 TAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTT 249 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 250 GGQYFRARNQQDLQSIY 266 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 80/271 (29%), Gaps = 44/271 (16%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + Q + Q + + + +V+C + +++ Sbjct: 39 EETRQAEIKLAYLPESKDSHKPKQWQQKSLSVAIWTLLVVACARPVWFGEPVEFQPKYRD 98 Query: 255 EEHFVDSSSLRH---------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 VD S I + V+ L+ + +K D R+G F D Sbjct: 99 LMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAK-RKGD------RLGVVLFGDH 151 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + +I+ I K I TAI D + T + S Sbjct: 152 AYLQTPLTADRQTVIQQI-KQTVIGLVGQ-RTAIGDGIGLGTKTFVDS-----------D 198 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQ 412 ++ ++LL+DG NT + I AK I T+ + Sbjct: 199 APQRVMILLSDGSNTAGVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAA 258 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L+ A + +F A +L KI+ Sbjct: 259 DLDEQTLTKIAEMTGGQYFRARDAEQLEKIY 289 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 56/151 (37%), Gaps = 15/151 (9%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F + + H ++R ++ + + TAI + + + + Sbjct: 166 RIALVGFGAHASTIAPLTLD-HAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRL----- 219 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV---NKTQQ 412 K + A K IVLLTDG + D + + A +G+ I T+ +++ Sbjct: 220 ------KESQAATKIIVLLTDGVHNADGMDPDTVAQTAAERGVVIYTVLMGQQTGDRSSV 273 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + + A+ + A T L F+D Sbjct: 274 DAGQLERLAGATDGYAYLAEDTQTLETSFQD 304 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 61/194 (31%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A + + R+G + + + ++ I Sbjct: 113 RMEALKRVAADFVEE----RPND---RIGLVLYASEAYTKTPVTSDKPIILEAIKGIRYD 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I + TA + + K++ + I+LLTDG N E Sbjct: 166 TVLQDG-TGIGMGLATAVNRL-----------KDSKAKSRVIILLTDGVNNAGFIEPETA 213 Query: 390 CNKAKSQGIRIMTIAFSVN-------KTQQEKA-----------RYFLSNCA--SPNSFF 429 + AK GI++ TI N + + A + ++F Sbjct: 214 ADIAKQYGIKVYTIGLGTNGMAESPYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGTYF 273 Query: 430 EANSTHELNKIFRD 443 A S +L +I+ Sbjct: 274 RATSNDKLAEIYNS 287 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 66/450 (14%), Positives = 152/450 (33%), Gaps = 68/450 (15%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ G + ++P ++ + + V +S QAA +A++ + Sbjct: 4 IQRQRGAITLTFTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACA--------- 54 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +N + I +Y NF + +V++N +S Sbjct: 55 ---YQQNDTGVVTEGILDYYRPNFVLP-------------ELNKSVKLNSNNGC---QIS 95 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK---EHGVSIQWVIDFSRSMLD 191 ++Y ++ ++ + ++ + + ++ + ++ V V+D S SM Sbjct: 96 AQYRFEPAMVNALPVAIDSDTEVVSNSQSSAKLVQNVNVNGIQNPVDFSLVLDISGSMTW 155 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + + + + + S + P+ S L P Sbjct: 156 HLPELKKIITDVISDIVPSSNQVRFS-----------IVPFQTGVGVSGAPWLLSSEASP 204 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + +L + + + ++ R T F Sbjct: 205 KCVDGLVYRNGNLDADKT-VQSLNYSSDRL--DFNEVTPGRWLDRCSETSF------ILP 255 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS--NEDEVHRMKNNLEAKK 369 + ++++IR + GSTA T+ E +V ++++ ++ Sbjct: 256 LTNNLNRVIRYVESLD----TSGGSTASYQGFIWGVRTLTDQWQKEWQVTPVQSSSLTQR 311 Query: 370 YIVLLTDGENTQ----DNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+ TDG++ + ++ +C+ + I++ I F V+ ++ + F CA Sbjct: 312 LILF-TDGDDNRRDYFNDLMSAGLCDVIQQDLNIQVSFIGFGVSA---DRIKQF-KQCAG 366 Query: 425 -PNSFFEANSTHELNKIFRDRIGNEIFERV 453 S F+AN+T EL F D I I +V Sbjct: 367 RNGSVFDANNTAELADYFEDAININIETKV 396 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 58/152 (38%), Gaps = 12/152 (7%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L+K +G+F ++TALL+PV+ G + +++ + ++ A A I A + Sbjct: 9 QRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIKAVRSYGEG 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E + R E F N + TD AVE ++ + Sbjct: 69 ESENAVR------------TEANRLFFANFQTPSATDGYNSASPESPAVEFTFSETGQET 116 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 S+ Y NP+ L+ I + + Sbjct: 117 RASASYAAQYNPVFWGLQPFVISRRSVAARLT 148 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 66/188 (35%), Gaps = 35/188 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL +V K + R+G + + + ++R++ + + E G Sbjct: 115 EALKNVASEFIKGRPND---RIGLVEYAGESYTKTPITSDKSIVLRSLQEIRYNNIIE-G 170 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K++ K I+LLTDG N + A Sbjct: 171 GTAIGMGLATAVNRL-----------KDSKAKSKVIILLTDGVNNSGSINPKIASELAVE 219 Query: 396 QGIRIMTIAFSVNKTQ------QEKAR------------YFLSNCA--SPNSFFEANSTH 435 GI+ TI N ++ + L A + +F A + Sbjct: 220 FGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQVTGGKYFRATNNK 279 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 280 KLAEIYDE 287 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 71/522 (13%), Positives = 149/522 (28%), Gaps = 84/522 (16%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 9 AKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR--------------- 109 Q + +Q+IE ++ + R Sbjct: 69 IAEDNQYRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLYLRS 128 Query: 110 -EVRDIVRDTAVEM----------------NPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + + V + N +A + + + + +++ Sbjct: 129 DDSKGQKNSSPVTIKEPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLV 188 Query: 153 IKSWLIQTK----AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP- 207 S L + + + + + V D SRSM+ + P Sbjct: 189 SSSQLYDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINATGNNPPEVNYPNR 248 Query: 208 ----------------ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 K S ++G + Y D Sbjct: 249 RIDALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTNCVLPYYVKSEDK 308 Query: 252 SLSEEHFVDSS-SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + I K + D + ++I +I + R + + + Sbjct: 309 KREISAKFKKGYNGGNHIAKGFELLDRDLDIPKTIDQISQFDGQKRTYDFTLDSKTSRNY 368 Query: 311 SFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRM 361 V+K V + G TA+ + + ++ N+D E Sbjct: 369 CLEDNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEKNKDFEAMPN 428 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGI------AICNKAKS--QGIR----------IMTI 403 K ++ +++L+DGE+ +++ + +C + K G++ I + Sbjct: 429 KIGTNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNYPKVETRIAFV 488 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 AF N +K + C + ++ +S L F+ I Sbjct: 489 AFGFNP--PQKQQEVWKKCVG-DQYYSVSSKQALFDAFKQII 527 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I SN ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDSN-----------APQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTLGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 66/197 (33%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I + R+G F D + + Sbjct: 14 DMQSGQQMVDRLTAVKQVLSDFIAK-------REGDRIGLILFADHAYLQTPLTLDRETV 66 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + A+ + TAI + + A ++ ++ I+LL+DG N Sbjct: 67 TQQLNQ--AVLKLIGTQTAIGEGIGLA-----------TKIFIDSDAPQRVIILLSDGSN 113 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L AS Sbjct: 114 TAGVLDPLEAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIASTT 173 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 174 GGQYFRARNQQDLQNIY 190 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + + AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 68/185 (36%), Gaps = 36/185 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + HK++ T ++ E G TAI + A + + Sbjct: 127 RIGLVVFAGEAFTQCPLTTD-HKILETFLEQLECGNLEDG-TAIGMGLAGAVNRL----- 179 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------- 404 K + K I+LLTDG N + + AK GI++ +I Sbjct: 180 ------KKSPAKSKVIILLTDGVNNVGYFKPLTAGELAKELGIKVYSIGVGTIGEALTPV 233 Query: 405 -------FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG--NEIFERV 453 F ++ Q E L A + +F A + +L +I+ + I + +V Sbjct: 234 SRLSDGSFFLDYAQVEIDEELLREIARMTGGQYFRAKNNQDLRQIY-NTIDRLEKTEIQV 292 Query: 454 IRITK 458 RI K Sbjct: 293 TRIKK 297 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 75/463 (16%), Positives = 140/463 (30%), Gaps = 58/463 (12%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K+ GH I+ A+++PV+ G+ + D R + ++ A + A + + Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLI---RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S + ++ +YL + ++ R DI ++ + + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFF 119 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + L + + + S T + ++ V + + DFS+SM Sbjct: 120 EYEVEA-----LTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSM 174 Query: 190 LDY---QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + R + K + ++ S S + Sbjct: 175 EEPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKF----- 229 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS-------IKKIDNVNDTVRMGA 299 + ++ + + +S H KK ++ L ++ K D + Sbjct: 230 DNYNSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDAVFHDIYL 289 Query: 300 T----FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 T FN + G I + + T + IIS Sbjct: 290 TNDFDTFNKEIRKFRP-GNGTASCQGIIRSAQMLRK----GTNSRRLL-----IIISDGN 339 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS--------QGIRIMT-IAFS 406 D + E K I N N +CNK + G I T IA Sbjct: 340 DWYYPYSGYKETDKEI------ANKLVNAG---MCNKIRETLNLDKTPSGQEIKTRIAVI 390 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + L NCA ++ F+A EL I EI Sbjct: 391 GFDYDANKNKALL-NCAGEDNVFKAQYRDELLDQILSLITEEI 432 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 67/189 (35%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLDPLK 223 Query: 389 ICNKAKSQGIRIMTIAFS-----------VNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+RI TIAF + + L A + FF A T Sbjct: 224 AAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 67/434 (15%), Positives = 149/434 (34%), Gaps = 57/434 (13%) Query: 48 YYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE---EYLIRNFENNLKK 104 Y E+ ++A A + + + ++ S + F ++I +Y+ + E + Sbjct: 1140 YKENGTEEATD-APLNNTSFTLDEMKNTSD-GVYTQIFQNEQIGGNEKYIYKVEETGSQV 1197 Query: 105 NFTDREVRDIVRDTAVEMNPRKS------AYQVVLSSRYD-LLLNPLSLFLRSM-----G 152 N E V V+ + + + + +++ Y + +N + + ++ Sbjct: 1198 NGYTVETTQTVSGGDVQSDGKSTKIGEKDSATFTITNTYTPIDINSVIEYNKTATLLDWN 1257 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 +++ I A ++T + + I V+D S SM + + + Sbjct: 1258 QRTYKIDLTASSKT---TQSMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKTYYI 1314 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 K+ + ++ + Y S Y +DP+ ++ + S+ K Sbjct: 1315 KTQNGIYQQLSWSWDNTWSYTDS------YSGKTVTVDPNTTDVYVAQKSNQ----TKID 1364 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT-----F 327 ++ A + + + N N R+G F++ + + L + Sbjct: 1365 ALKSAATTF---VNNVANKNSDCRVGIVTFSNDGYI-KPITNNSYTLAKVGTSKGDIINT 1420 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG----ENTQDN 383 G T + A + ++ + ++ KK +V LTDG NT + Sbjct: 1421 IDGLKTGGDTYPAKGLDKANEIFSENSSNSWETVEQTDGRKKMVVFLTDGVPAPANTNNF 1480 Query: 384 EEGIAIC--NKAK---SQGIRIMTIAF---------SVNKTQQEKARYFLSNCASPNSFF 429 +E +A N AK QG+ + N + Q +Y S +S + Sbjct: 1481 DENLAGAGTNSAKILHDQGVATYALGIFGAANSDGTMDNASVQRIDKYMQSIASSHEKYM 1540 Query: 430 EANSTHELNKIFRD 443 A+S L+ +F Sbjct: 1541 TADSVDNLSSLFES 1554 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 67/189 (35%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLDPLK 223 Query: 389 ICNKAKSQGIRIMTIAFS-----------VNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+RI TIAF + + L A + FF A T Sbjct: 224 AAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 54/170 (31%), Gaps = 34/170 (20%) Query: 297 MGATFFNDRVISDPSFSWGVH---KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + + + + TAI + +A + S Sbjct: 129 IGLTVFGGEAYTQCPLTTDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQS 188 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 K I+LLTDGEN + AK GIRI TI+ + + Sbjct: 189 KSKS-----------KVIILLTDGENNAGEISPLTAAEMAKRLGIRIYTISVGTDAAVNQ 237 Query: 414 KARYF------------------LSNCASP--NSFFEANSTHELNKIFRD 443 L A+ F++A S +L I+++ Sbjct: 238 TVATLPNGETYEAAIKQNTDPKTLEAIANSTGGKFYQARSKAKLRDIYQN 287 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 55/161 (34%), Gaps = 28/161 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D ++ + T++ I TAI DA+ A E Sbjct: 133 RLGLILFADTAYLQAPLTYD-RDTVSTLLSESVIGL-VGEQTAIGDAIGLAVKRFDEKEE 190 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 ++LLTDG+NT A ++ +++ TI +K Sbjct: 191 SN-----------NVLILLTDGQNTAGFITPEQAKELAVNKKVKVYTIGVGADKMLIQSF 239 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 QE LS+ AS +F A +EL I+ Sbjct: 240 FGSRQVNPSQELDEDMLSDLASSTGGQYFRARDVNELEAIY 280 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFITK-REGD------RVGLILFADHAYLQTPLTMDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ISQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L AS Sbjct: 200 TAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGQYFRARNQQDLQSIY 276 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 47/373 (12%), Positives = 117/373 (31%), Gaps = 65/373 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G+ + A+ + M+ G +D R + ++ A++ A Sbjct: 19 MVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAI-------- 70 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++I+EY +K F + + E+ KS + + Sbjct: 71 --------------KEIDEYDEDEIAEKIKDWFDAQSEKQSATYDLTEITVDKSGHTITA 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S+ + + + IK+ + + E + SY + + VID S SML Sbjct: 117 SASGTVP----TTLMTLADIKTVPVGVISAIEGPATSY-----LEVYIVIDKSPSMLLAA 167 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + L ++C + + P + ++ Sbjct: 168 TSEDQAMLRADAN---------------------------ITCEFACHDTKDPVKKNGTV 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + V + + DA+ V+ + D + +++G + + + Sbjct: 201 IASTYYNYIKSLGVKLRTDVALDAVEEVLDMVDAADEDHARIKVGLYSLGETISEVLEPT 260 Query: 314 WGVHKLIRTI--VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + + + + M +T A++ + ++ + + K + Sbjct: 261 YSTSTARKKLSDDSSGLTSATSMSATYFQTALKALKKKVGTAGDGT-----SAASPLKLV 315 Query: 372 VLLTDGENTQDNE 384 +LLTDG + + Sbjct: 316 LLLTDGVQSNRDW 328 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 57/372 (15%), Positives = 111/372 (29%), Gaps = 28/372 (7%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K +GH I+ + +PV+ GV + D R + L+ AA+ A++ S Sbjct: 2 KLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSA------ 55 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 T + + + + + + E ++ TA + + YQV Sbjct: 56 ---YGEEDEVSTQTGKDYVAHYLHDMSSLVDIKVEKLECSELPECTADDNDRPFVEYQVS 112 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++ + P + G + +R + + I +++DFS SM Sbjct: 113 GRTKH-ISWFPGNDVTVGFGE-----SFDVTGSSKARKFQSSQPMDITFILDFSGSMNYD 166 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP---YMVSCNKSLYYMLYPGPL 249 + S + + Y S + + G Sbjct: 167 WEGHAPSYMEEEIPKVPGRYSPPSRLSDLKYVVQMVTDELQVYNNSTAGPKHRVAMTGYN 226 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +++E I K + K I + GA Sbjct: 227 RRTVNESSNGKFVIRDQRITKYNSDGYDAGDTFYPKKTI--NKQFMVKGAAARVPNGDEK 284 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE--A 367 F+ ++ G TA + A + H + E Sbjct: 285 AEFTDIMYTSDFASFNHKIKSFEAFGGTASLQGIIRASQIV------SYHITNDGEEANP 338 Query: 368 KKYIVLLTDGEN 379 K+ I++L+DGE+ Sbjct: 339 KQLIIILSDGED 350 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFITK-REGD------RVGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ISQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L AS Sbjct: 200 TAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGQYFRARNQQDLQSIY 276 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 259 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 311 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 312 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 358 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 359 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 418 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 419 GGHYFRARNQQDLQNIY 435 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ANQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L + A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTSQDLDEKTLQSIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 52/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTAVGDAIGLALKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + +VL+TDG N + I A +G++I I + + Sbjct: 193 NS-----------RVLVLVTDGANNAGQIDPITAARLAADEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SALGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I ++ D R+G F D + + Sbjct: 100 DMQSGQQMVDRLTAVKQVLSEFIAK-REGD------RIGLILFADHAYLQTPLTLDRQTV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TAI + + A T I S ++ ++LL+DG N Sbjct: 153 ASQLNQ--AVLKLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L A+ Sbjct: 200 TAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTT 259 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 260 GGHYFRARNQQDLQNIY 276 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 56/277 (20%), Positives = 91/277 (32%), Gaps = 31/277 (11%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG-------IRDEKLSPYMVSC 236 D S ++ + ++ +QNG +G + +K + Sbjct: 175 DDSGP-PPSTVVNQNETFAQVIFKPTVVQQAKIAQNGILGDFILRYDVNRDKSIGDIQVL 233 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ N Sbjct: 234 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFN---- 289 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 290 --VIGFSNRIKV-----WKDHLVSVTPDSVRDGKVYIHHMSPTGGTDINGALQRAIRLL- 341 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 N H + I+ LTDG+ T + I N K I I TI + Sbjct: 342 --NNYVAHNDIEDRSVS-LIIFLTDGKPTVGETNTLKILNNTKEATRGQICIFTIGIGDD 398 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC + + F D I Sbjct: 399 VDFKLLEKLSLENCGLTRRVHDEDDAGSQLIGFYDEI 435 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 59/195 (30%), Gaps = 41/195 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + + R+G + + + + + RTI + Sbjct: 113 RLEALKKVAIDFVDR----RPND---RIGIVVYAGESFTQTPITSDKNIVKRTISELQWG 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G TAI + + + + K + K I+LLTDG N N + Sbjct: 166 QLD--GGTAIGMGLGSGVNRL-----------KESTAKSKVIILLTDGVNNAGNIDPRTA 212 Query: 390 CNKAKSQGIRIMTIAFSVNKT-------QQEKAR------------YFLSNC--ASPNSF 428 A+ I++ TI N + L A+ + Sbjct: 213 TELARELEIKVYTIGIGTNGMADFPWSKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKY 272 Query: 429 FEANSTHELNKIFRD 443 F A L +I+ + Sbjct: 273 FRATDNQSLKEIYDE 287 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 56/165 (33%), Gaps = 32/165 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F D + + + + K I TAI +A+ + +E Sbjct: 128 KLGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALSVKRFDLVDE 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VLLTDG N + + A +GI+I +I + ++ Sbjct: 186 SN-----------RILVLLTDGSNNSGSISPEQAADIAAKRGIKIYSIGVGADVMERRTL 234 Query: 416 RYFLSNCASP-----------------NSFFEANSTHELNKIFRD 443 F +P +F A + EL +I+++ Sbjct: 235 --FGKERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQE 277 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 72/426 (16%), Positives = 138/426 (32%), Gaps = 46/426 (10%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 ++ I A+ + + + ++K + ++ N + Sbjct: 72 MLNRASEDQEIEFQVQIPAAAFITNFTMLIGDKVYQGEITEREKKRDDRVKEKRNKTTEE 131 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 ++ +I R +AV + K+A+ + +LL L + S+ ++ + + + Sbjct: 132 NGEKGT-EIFRASAVIPSKDKAAFFLSYE---ELLQRRLGKYEHSISVRPQQLAGRLSVD 187 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--------YSS 217 + + + SR + + + P + T + Sbjct: 188 VNILESAGIASLEV-LPLHNSRHRGSGRGEDDSGPPPSTVINQNETFAHILFKPSVVQQA 246 Query: 218 QNGKVGIRDEKLSPYMVS----------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + GI + + Y VS N + P L P FV SS V Sbjct: 247 KIAQNGILGDFIIRYDVSREQSVGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMV 306 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV--- 324 K +DAL +++ ++ D R F++R+ W H + T Sbjct: 307 GTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKV-----WKDHLISVTPDSVR 355 Query: 325 --KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 K + + G T IN+A+QTA + N+ H + I+ LTDG+ T Sbjct: 356 DGKVYIHHMSPTGGTDINEALQTAIRLL---NKYVAHSDSGDRSVS-LIIFLTDGKPTVG 411 Query: 383 NEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + I N + + I TI + + + L NC E Sbjct: 412 ETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLI 471 Query: 440 IFRDRI 445 F D I Sbjct: 472 GFYDEI 477 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 72/210 (34%), Gaps = 25/210 (11%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + +D L+ + + + + ++ + I N N +G T Sbjct: 80 WDDKETEGIDIMLTMDVSASMLTDDVYPNRMAVAKEVASEF---ISSRPNDN----IGLT 132 Query: 301 FFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F + + + L+ + TAI + A + S Sbjct: 133 IFAGEAFTQCPMTLDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQDSKAKS 192 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 K ++LLTDG N + + AK GIR+ TI F ++ A Sbjct: 193 -----------KIVILLTDGSNNVGSISPMTAAAIAKKFGIRVYTIGFGRETGEEIGAID 241 Query: 418 F--LSNCA--SPNSFFEANSTHELNKIFRD 443 + L N A + F+ A S EL++I++D Sbjct: 242 YRALQNIAVSTNGEFYRAQSQAELSRIYQD 271 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 68/209 (32%), Gaps = 40/209 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ +A I +T R+G F + + + L+ + + Sbjct: 111 VTRLEAVKKVVADFIGK-------RETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNRMK 163 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI A+ R+K+ K ++LLTDG N Sbjct: 164 I--GMAGDRTAIGSAIAIG-----------GKRLKDLKSKSKILILLTDGRNNAGEISPQ 210 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCASP--NSFF 429 A + GI++ TI + + L N A +F Sbjct: 211 AAARAVREFGIKLYTIGVGGKGPAPFRMKTLFGTRLVPQHVDLDEVTLRNVAKTGGGKYF 270 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 A ++ EL +I+ I + + +++ + Sbjct: 271 RAANSQELQEIY--DIIDRAEKTDVKVKE 297 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/181 (22%), Positives = 69/181 (38%), Gaps = 25/181 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKT 326 + + ++ + I ++ +G T F + + + L+ + Sbjct: 109 RIEVAKEVASDFI----SGRPNDN---IGLTIFAGEAFTQCPMTVDHAALLNLLHNVRTD 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI M A + V R+K++ K I+LLTDG N + Sbjct: 162 LVVKGLIQDGTAI--GMGLA---------NSVSRLKDSKAKSKVIILLTDGSNNVGSISP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 + + AK GIRI TI A + L N A + F+ A S EL+KI++ Sbjct: 211 MTAASIAKKYGIRIYTIGLGKESEGDLGAIDYKTLQNIAVSTNGEFYRAQSQAELSKIYQ 270 Query: 443 D 443 D Sbjct: 271 D 271 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 52/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 111 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTAVGDAIGLALKRLRLRPA 168 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + +VL+TDG N + I A +G++I I + + Sbjct: 169 NS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 217 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 218 SALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 256 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 52/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTAVGDAIGLALKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + +VL+TDG N + I A +G++I I + + Sbjct: 193 NS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 50/151 (33%), Gaps = 11/151 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ + ++ + RT+ A ST I D + A + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTL--EEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QE 413 K +VLL+DG N + AK G+R+ TIA Sbjct: 201 AAADGAGPPPSRDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNPN 260 Query: 414 KARYF-----LSNCA--SPNSFFEANSTHEL 437 + L A S F +T +L Sbjct: 261 NEQDVVDVETLRAMAETSGGRAFRVKTTDDL 291 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 61/377 (16%), Positives = 112/377 (29%), Gaps = 38/377 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K +GH I+ + +PV+ GV + D R + L+ AA+ A++ + Sbjct: 4 KLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLA--------V 55 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV-----RDTAVEMNPRKS 127 + S K + Y+ + N D EV + TA + + Sbjct: 56 SAYGEEDEVSTQTGKDYVAHYM------HDMSNLVDIEVEKLECSELPECTADDNDRPFV 109 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 YQV +++ P + G + +R + + I +++DFS Sbjct: 110 EYQVSGRTKHKSWF-PGNDVTVGFGE-----SFDVTGMSKARKFQSSQPMDITFILDFSG 163 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP---YMVSCNKSLYYML 244 SM + S + + Y S + + Sbjct: 164 SMNYDWEGHAPSYMEEEVPKVPGRYSPPSRLSDLKDVVQMVTDELQVYNNSTTGPKHRVA 223 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 G +++E I K + K I + GA Sbjct: 224 MTGYNRRTVNESSNGKFVIRDQRITKYNSDGYDAGDKFYPKKTI--NKQFMVKGAAARVP 281 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 F+ ++ G TA + A + H + Sbjct: 282 NGDEKAEFTDIMYTSDFASFNHKIKSFEAFGGTASLQGIIRASQIV------SYHITNDG 335 Query: 365 LE--AKKYIVLLTDGEN 379 E K+ I++L+DGE+ Sbjct: 336 EEANPKQLIIILSDGED 352 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 65/498 (13%), Positives = 139/498 (27%), Gaps = 99/498 (19%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L+ G+ II A+ + ++ G+ VDV R + + + Sbjct: 8 RLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRM----------------SAA 51 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ A S + ++ + F+ N V+ T ++ A Sbjct: 52 LDAAALAVGSSSGTDAQLSAVAQKFFDANYPTGALGAHPSVAVKVTGDVISASAVAEVDT 111 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + F++ +G+ + + G+ + V+D + SM Sbjct: 112 V-------------FMKVVGLNDVPVHADSTVNRQI------AGLELAMVLDNTGSMTTN 152 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + K+ + + + S SL + Sbjct: 153 NNIQAVRDAANQLTDILFGT-ATVHPYLKIALVPYSAAVNVGSVAPSLITTGDTYAPNDL 211 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-- 310 L + V + + + A + + + +DN D + N + Sbjct: 212 LGWKGCVVERAGANGVGDTSAA-TAPWTRYKWLPAVDNNYDATKSSTVLANPSNGNASTG 270 Query: 311 ----------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + L I A++ G T + M + Sbjct: 271 PNLGCPTAITPLTNVKATLTPAIN---AMEAWSRGGTLSDVGMAWGLRVLSPEPPFTEGL 327 Query: 361 MKNNLEAKKYIVLLTDGEN----------------------------------------- 379 + K ++L+TDG+N Sbjct: 328 PWGTPKWSKAVILMTDGDNQFYKLTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATT 387 Query: 380 --TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHE 436 T N ++CN K++ I + T+ F + + CA+ S +F++ S E Sbjct: 388 AKTTINTRLTSVCNAMKAKNIIVYTVTF--TSGINQATKDIYKACATDASKYFDSPSQDE 445 Query: 437 LNKIFRDRIGNEIFERVI 454 L FR I + + Sbjct: 446 LKSAFR-AIATSLSNLRV 462 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 62/194 (31%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ + I R+G + + + ++ ++ Sbjct: 113 RLEALKEVASQFIADRPSD-------RIGLVEYAGESYTRTPITSDKSIVLSSLNDIQYN 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI + T+ + + K++ K I+L+TDG N E Sbjct: 166 SIIE-GGTAIGMGLATSVNRL-----------KDSRAKSKVIILMTDGVNNAGFIEPSTA 213 Query: 390 CNKAKSQGIRIMTIAFSVN-------KTQQEKA-----------RYFLSNCA--SPNSFF 429 A+ GI++ TI N + + + L A + +F Sbjct: 214 SELAQEFGIKVYTIGLGTNGTALSPVALRPDGSFQYGSIPVEIDEALLQEIADKTGGLYF 273 Query: 430 EANSTHELNKIFRD 443 A L +I+ + Sbjct: 274 RATDNESLEEIYAE 287 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 70/185 (37%), Gaps = 24/185 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V+ + ++ DT R+G F D + + L Sbjct: 102 DFAIDGQRVNRLTAVKKVVHDFVKR-------RDTDRIGLVVFGDYAFTQAPLTLDKGLL 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + I TAI DA+ A R+K+ K ++LL+DGEN Sbjct: 155 LNLIENLRI--GMAGRKTAIGDALGVA-----------GKRIKDIPAMSKVVILLSDGEN 201 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELN 438 T + + GI+I TI T+Q ++ A ++ A++T +L+ Sbjct: 202 TAGDMTPQGAAEALAALGIKIYTIGMG---TEQAGSKELAQIAAIGQGKYYHASNTEQLD 258 Query: 439 KIFRD 443 I+++ Sbjct: 259 SIYKE 263 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 42/166 (25%), Positives = 66/166 (39%), Gaps = 29/166 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F S + +KL+ +++ + + E TAI A+ +A + Sbjct: 147 RIGMVVFAGEAYSLAPLTND-YKLLTDLIQDISFNMMEAKGTAIGSAIASATN------- 198 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 RMK + A K ++LL+DGE+ N + + A + I+I TIA + Sbjct: 199 ----RMKESESASKVLILLSDGESNAGNVDPLFAAQLASALDIKIYTIAVGKDGMVPYGT 254 Query: 416 RYF--------------LSNCAS--PNSFFEANSTHELNKIFRDRI 445 +F L A FF A+ LN IF DRI Sbjct: 255 DFFGRPQMVESYLDETNLREIAKIGNGEFFRASDGGTLNNIF-DRI 299 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 80/271 (29%), Gaps = 44/271 (16%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + Q + Q + + + +V+C + +++ Sbjct: 39 EETRQAEIKLAYLPESKNSRKPKQWLQKSLAVGVWTLLVVACARPVWFGEPVEFQPKYRD 98 Query: 255 EEHFVDSSSLRH---------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 VD S I + V+ L+ + +K D R+G F D Sbjct: 99 LMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAK-RKGD------RLGVVLFGDH 151 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + +I+ I K I TAI D + T + S Sbjct: 152 AYLQTPLTADRQTVIQQI-KQTVIGLVGQ-RTAIGDGIGLGTKTFVDS-----------D 198 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQ 412 ++ ++LL+DG NT + I AK I T+ + Sbjct: 199 APQRVMILLSDGSNTAGVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAA 258 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L+ A + +F A +L KI+ Sbjct: 259 DLDEQTLTKIAEMTGGQYFRARDAEQLEKIY 289 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 80/271 (29%), Gaps = 44/271 (16%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + Q + Q + + + +V+C + +++ Sbjct: 33 EETRQAEIKLAYLPESKNSRKPKQWLQKSLAVGVWTLLVVACARPVWFGEPVEFQPKYRD 92 Query: 255 EEHFVDSSSLRH---------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 VD S I + V+ L+ + +K D R+G F D Sbjct: 93 LMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAK-RKGD------RLGVVLFGDH 145 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + +I+ I K I TAI D + T + S Sbjct: 146 AYLQTPLTADRQTVIQQI-KQTVIGLVGQ-RTAIGDGIGLGTKTFVDS-----------D 192 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQ 412 ++ ++LL+DG NT + I AK I T+ + Sbjct: 193 APQRVMILLSDGSNTAGVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAA 252 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L+ A + +F A +L KI+ Sbjct: 253 DLDEQTLTKIAEMTGGQYFRARDAEQLEKIY 283 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLSDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGALNPLK 223 Query: 389 ICNKAKSQGIRIMTIAFS-----------VNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + + L A + FF A T Sbjct: 224 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 234 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLSDSVV 289 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 290 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 336 Query: 389 ICNKAKSQGIRIMTIAFS-----------VNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + + L A + FF A T Sbjct: 337 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 396 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 397 EELAGIYAE 405 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 61/176 (34%), Gaps = 29/176 (16%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + + A A ++ ++ D V G F+ F + + V Sbjct: 426 IPKLEIAKSASAKMVEHLESSDGV------GVIAFDHNYYWAYKFG---KLVRKEDVIES 476 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI + A T+ K + K +VLLTDG Q E Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTL-----------KKSKAKNKLVVLLTDGMGEQSGYEIP 525 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 A ++AK I+I TI LS A + F+ ++ EL +F Sbjct: 526 A--DEAKRNNIKITTIGVGKFVNA-----SVLSWIADYTSGRFYLVSNPSELVDVF 574 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 27/162 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + + + + + + TAI DA+ + + + Sbjct: 144 RVGLLVFGQRAYALTPLTADLTSVRDQLADSVV--GLAGRETAIGDAIALSVKRLREQKQ 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--------- 406 ++ +VLLTDG NT + AK++G+R+ TIAF Sbjct: 202 -----------GQRVVVLLTDGVNTAGVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFG 250 Query: 407 --VNKTQQEKARYF-LSNCA--SPNSFFEANSTHELNKIFRD 443 + + L A + FF A T EL I+ + Sbjct: 251 VPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYAE 292 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 27/162 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + + + + + + TAI DA+ + + + Sbjct: 144 RVGLLVFGQRAYALTPLTADLTSVRDQLADSVV--GLAGRETAIGDAIALSVKRLREQKQ 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--------- 406 ++ +VLLTDG NT + AK++G+R+ TIAF Sbjct: 202 -----------GQRVVVLLTDGVNTAGVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFG 250 Query: 407 --VNKTQQEKARYF-LSNCA--SPNSFFEANSTHELNKIFRD 443 + + L A + FF A T EL I+ + Sbjct: 251 VPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYAE 292 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 58/194 (29%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A I K + R+G + + + +R I + Sbjct: 116 RLEATKKVAADFI----KGRPND---RIGVVVYAGESYTKTPITTDEMISLRAINEIAFD 168 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI + TA + + K++ K I+L+TDG N + Sbjct: 169 GVLENG-TAIGMGLATAVNRL-----------KDSEALSKVIILMTDGVNNSGFIDPKIA 216 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA------------------RYFLSNCASP--NSFF 429 A I++ TI N + A +F Sbjct: 217 SELALEYDIKVYTIGIGTNGNAPSPVAQIGRNKFRMAMMPVEIDEELMKQIAVDTGGKYF 276 Query: 430 EANSTHELNKIFRD 443 A + +L +I+ + Sbjct: 277 RATNNKKLEEIYGE 290 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 62/447 (13%), Positives = 139/447 (31%), Gaps = 85/447 (19%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K +G ++ +++PV+L + +++V + A+ A IQ+ ++ Sbjct: 34 KQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQTGDKN 93 Query: 76 SSRAKNSFTFPKQKIE------EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ A + + F +L+++ D E Sbjct: 94 AALAAARDAAERNPVAGKVVPINMSDLEFGISLRESL-DEGYSFQPLSDDDEFGNAVRLT 152 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + LS+ + +PL + + + V+ + + + VID S SM Sbjct: 153 TLSLSNSPQPVFSPL--------FPTMGTNLEIRPQRVA--VSTQSTMDVALVIDRSGSM 202 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 ++ +N P T Y P+ Sbjct: 203 AYANDEAPDPYVNPAAAPPGWT---------------------------------YGDPV 229 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P+ V S +A + + ++ ++D D Sbjct: 230 PPNSRWLDLVASV-------------NAFNGFLA------DSPQYEKLCLATYSDNASRD 270 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + ++ + + G T++ ++ + D H K + + Sbjct: 271 CDLTHTYAEISNQLDAISY--QFNGGGTSVGYGLEHGLAVLT----DATHARKFAV---R 321 Query: 370 YIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +VL+TDG NT + E + ++ G+ + TI FS + Q + + C Sbjct: 322 VMVLMTDGHHNTGKSPESMTY--HLQNHGVTLFTITFS-DDADQSRMSNLANACG--GEN 376 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIR 455 F A +L F+ +I ++ + + Sbjct: 377 FHATDASQLQNAFQ-KIAKKLPSLMTQ 402 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 50/117 (42%), Gaps = 12/117 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + + H + T++ + + + TAI A+ A + + +S Sbjct: 144 RIGLVVFSGKGYTPCPLTLD-HLTLGTVLDNISSEVIQEEGTAIGTAILIAVNRLRAS-- 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +K I+LLTDG+N + + + A GI+I TIA + + Sbjct: 201 ---------ESRQKAIILLTDGQNNAGDIDPLTAAGFALQDGIKIYTIAATAQDARP 248 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 80/216 (37%), Gaps = 28/216 (12%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 MV K Y ++ S+ E+ F + + K +VR + + I ++ D Sbjct: 72 IDNMVHLKKKGYDIVLAIDASGSMQEKGFDPTDPQK---TKFDVVRSLVKAFISK-RRND 127 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N +G F ++ + ++ I+ I TAI+DA+ + Sbjct: 128 N------IGVVIFGSFAYIASPLTFN-KEAVKKILDYLDIGVAGSK-TAIDDALIESVRL 179 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K + K ++LLTDG +T AK G++I TI +K Sbjct: 180 L-----------KESQAKSKIVILLTDGIDTASKTPPDVAVKMAKKYGVKIYTIGIG-DK 227 Query: 410 TQQEKARYFLSNCASPNS--FFEANSTHELNKIFRD 443 ++A FL A +F A L KI+ + Sbjct: 228 RGIDEA--FLRWLAQQGHGYYFYAKDASMLRKIYDE 261 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 61/176 (34%), Gaps = 29/176 (16%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + + A A ++ ++ D V G F+ F + + V Sbjct: 426 IPKLEIAKSASAKMVEHLESSDGV------GVIAFDHNYYWAYKFG---KLVRKEDVIES 476 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI + A T+ K + K +VLLTDG Q E Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTL-----------KKSKAKNKLVVLLTDGMGEQSGYEIP 525 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 A ++AK I+I TI LS A + F+ ++ EL +F Sbjct: 526 A--DEAKRNNIKITTIGVGKFVNA-----SVLSWIAAYTSGRFYLVSNPSELVDVF 574 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 52/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTAVGDAIGLALKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + +VL+TDG N + I A +G++I I + + Sbjct: 193 NS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 52/162 (32%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + + K I TAI +A+ + Sbjct: 129 RIGLILFADHAYLQSPLTQDRRSVAQYL-KEAQIGL-VGKQTAIGEAIALGVKRFDKVEQ 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N A +GI I TI + ++ Sbjct: 187 SN-----------RVLILLTDGSNNAGAITPEQASQIAAQRGITIYTIGVGADVMERRTL 235 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + +F A +T EL +I++ Sbjct: 236 FGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQ 277 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 51/279 (18%), Positives = 86/279 (30%), Gaps = 29/279 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + F +M S +P G R + + P + Sbjct: 33 LRLPFFDTMAQLTGKSPSRPAAPIGGMQRWLNIVVWLLLVVTLARPQWVEPPLTHTEPVR 92 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +L LD S S + + + + +V+ +A I K+ D+ R+G Sbjct: 93 DILL---ALDISQSMDSQDFRDAQDRQVSRWTVVKAVVADFID--KRTDD-----RLGLI 142 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + L + T TAI DA+ + S+ E Sbjct: 143 VFGTGAFPQAPLTRDHKSLRLLLDHTAVGMAGPN--TAIGDAIGMGIRMLDSAQER---- 196 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-- 418 K ++LLTDG +T + N A + + TI + F Sbjct: 197 -------DKVLILLTDGNDTGSAVPPLRAANLAAQHHVTVHTIGIGSPTASGDDQVDFDT 249 Query: 419 LSNC--ASPNSFFEANSTHELNKIFR--DRIGNEIFERV 453 L AS FF+A L+ ++ DRI + + Sbjct: 250 LRGISSASGGQFFQAQDGAALHDVYATLDRITPREVKTL 288 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 63/217 (29%), Gaps = 39/217 (17%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 Y +D LS + + + + + + S +K D R+G + Sbjct: 98 YKNGIDIILSIDASGSMLAQDFDPNRLEVAKRVAKKFVDS-RKGD------RVGLVVYEG 150 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + L I TAI + A + S + Sbjct: 151 EAYTACPATLDYKLLKEQISAIEPGHLEP--GTAIGSGLGVAVTRLRSDS---------- 198 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR-------- 416 K I+LLTDG + E + + AK++ R+ TI + Sbjct: 199 -LISKVIILLTDGSSNTGP-EPLEVAELAKAKKCRVYTIGVGADGMAPTPVNTPFGVVYQ 256 Query: 417 --------YFLSNCAS--PNSFFEANSTHELNKIFRD 443 L AS +F A L KI+ + Sbjct: 257 NLPVEIDEGVLKEIASATNGKYFRAQDEKSLEKIYAE 293 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 50/151 (33%), Gaps = 11/151 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ + ++ + RT+ A ST I D + A + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTL--EEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QE 413 K +VLL+DG N + AK G+R+ TIA Sbjct: 201 AAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNPN 260 Query: 414 KARYF-----LSNCA--SPNSFFEANSTHEL 437 + L A S F +T +L Sbjct: 261 NEQDVVDVETLRAMAETSGGRAFRVKTTDDL 291 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 50/151 (33%), Gaps = 11/151 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ + ++ + RT+ A ST I D + A + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTL--EEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QE 413 K +VLL+DG N + AK G+R+ TIA Sbjct: 201 AAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNPN 260 Query: 414 KARYF-----LSNCA--SPNSFFEANSTHEL 437 + L A S F +T +L Sbjct: 261 NEQDVVDVETLRAMAETSGGRAFRVKTTDDL 291 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 51/156 (32%), Gaps = 26/156 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + H ++ ++ + + TAI A+ A + Sbjct: 140 RIGLVVFRGKGYTQCPLTID-HDVLAMLIDHISPQVIQDEGTAIGSAILIATN------- 191 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--AFSV------ 407 R K + +K I+L+TDGEN + A GIRI + F Sbjct: 192 ----RFKGSTSLQKVIILITDGENNTGDVGPATAATLAAQNGIRIYVVNAGFKSGGSAGN 247 Query: 408 ----NKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + L A + +F A L Sbjct: 248 LSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVL 283 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 70/192 (36%), Gaps = 34/192 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +V+ + + L+ + ++ D R+G F R + + + + + Sbjct: 121 NVVDRLTAAKAVLSDFLDR-REGD------RVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + + + ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALSVKRLREQRQ-----------GQRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRD 443 T EL I+ + Sbjct: 281 RDTEELAGIYAE 292 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 66/202 (32%), Gaps = 25/202 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + ++ I++ VR+ + + + + RM F Sbjct: 99 AIDISGSMDARDFATPEGERIQRLSGVREVVRAFVS-------GREGDRMALIVFGTSAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + +I + +T M + + + + + + Sbjct: 152 LQAPLTDDLETIIALLDRTEV-------------GMAGPHTALGDAIGLSIRTFETSEID 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++ ++LL+DG +T + A +G+ I TI E L A Sbjct: 199 QRLLILLSDGSDTASRMSPVNAAEIAADRGVEIYTIGVGDPDATGENRVDLTTLKEVAQR 258 Query: 424 SPNSFFEANSTHELNKIFRDRI 445 + +F A L ++ DRI Sbjct: 259 TGGQYFFAEDAASLEAVY-DRI 279 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 77/248 (31%), Gaps = 44/248 (17%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS---------SLRHVI 268 Q G+ + + + ++Y VD S S I Sbjct: 50 QWFTKGLVIALWIGLVTAAARPVWYGDPVTTSPKHRDMMLVVDLSYSMSQQDMKSGDQFI 109 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ L+ I ++ D R+G FF D + + + + + A Sbjct: 110 DRLSAVKQVLSDFIAK-RQGD------RLGLIFFADHAYLQTPLTLDRQTIAQQLNQ--A 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + + A T I S ++ ++LL+DG NT + + Sbjct: 161 VLRLIGTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTSGVLDPME 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANS 433 AK I T+ + ++ L A + +F A + Sbjct: 210 AAKIAKKYHTTIYTVGVGAGEMMVKEFFMTRKINTAEDLDEKTLQAIADETGGQYFRARN 269 Query: 434 THELNKIF 441 +L I+ Sbjct: 270 QQDLQHIY 277 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 65/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 96 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 151 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 152 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 198 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQ-------QEKARY-----FLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + L A + FF A T Sbjct: 199 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDT 258 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 259 EELAGIYAE 267 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 65/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQ-------QEKARY-----FLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + L A + FF A T Sbjct: 224 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 61/190 (32%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + V+ ++ S ++ D R+G F D + + + + Sbjct: 108 YVDRLTAVKKVVSD-FASKREGD------RLGLVLFADHAYLQTPLTLDRKTIAEQVNQL 160 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 E TAI + + A T + S ++ +VLL+DG NT + Sbjct: 161 VLRLIGEK--TAIGEGIGLATKTFVDS-----------DAPQRVMVLLSDGSNTSGVLDP 207 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + AK I TI + Q+ L + A + +F A Sbjct: 208 LEAAKIAKKYNATIYTIGVGAGEMVVKEFFMTRKVNTAQDLDERTLMDIAQVTGGQYFRA 267 Query: 432 NSTHELNKIF 441 EL I+ Sbjct: 268 RDAKELATIY 277 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 51/144 (35%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + ++ I+ R G F R I + H + Sbjct: 109 TRLERAKREIIDLMGMIQSD-------RAGLVAFAGRAILQCPLTLD-HSAFNLFLNALE 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D +G T + A++TA + E E K I+L+TDGENT + I Sbjct: 161 PDYLPVGGTDLGGAIETALNGFEKEVESE-----------KAIILITDGENTTGDS--IE 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 + KA QG++I I + Sbjct: 208 MAKKAADQGVKIFCIGVGSPEGAP 231 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + + A +G++I I + + Sbjct: 193 TS-----------RALVLVTDGANNAGQIDPVTAARLAAEEGVKIYAIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + + A +G++I I + + Sbjct: 193 TS-----------RALVLVTDGANNAGQIDPVTAARLAAEEGVKIYAIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 34/154 (22%), Positives = 59/154 (38%), Gaps = 18/154 (11%) Query: 297 MGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + L+ + TAI + A + Sbjct: 129 IGLTIFAGEAFTQCPMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRL--- 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 K++ K ++LLTDG N + + A+ GIRI TI F ++ Sbjct: 186 --------KDSKAKSKIVILLTDGSNNAGSISPMTAAAIARKFGIRIYTIGFGKETGEEI 237 Query: 414 KARYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 A + L + A + F+ A S EL++I++D Sbjct: 238 GAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQD 271 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 87/290 (30%), Gaps = 40/290 (13%) Query: 165 ETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S Y ++ + +D S + S+ Q + F +R VK + Sbjct: 312 DSYSADYDMVISDNVDFGLDRLMQYSFVVLCNVSKNQLTDKFLDDCERYVKDLGGGLLVI 371 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV---------DSSSLRHVIKKKHL 273 G + + + + V S I K + Sbjct: 372 GGENSYALGNYSNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGGSNLRNINKLEI 431 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A A +I ++ D+V G F+ +G K +++ + Sbjct: 432 AKSAAAKMIDHLESSDSV------GVIAFDHNFYWASK--FGKLKSKNEVIENIST-IQV 482 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TAI + A + + K + K IVLLTD + + A Sbjct: 483 GGGTAIIPPLTEAVNLL-----------KKSKAKDKVIVLLTD--GYGEEGGYEYPASIA 529 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 K I+I TI + LS A + F+ L +F Sbjct: 530 KRNNIKITTIGVGSSINAP-----ILSWMAAYTSGRFYYVKDASNLIDVF 574 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 66/182 (36%), Gaps = 28/182 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V+ + + + ++ D R+G F DR ++ V + R + + Sbjct: 103 QPISRLDAVKRTASRFVAA-RRGD------RIGLVIFGDRAYFAQPLTFDVDAVARAVDE 155 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A +TAI+D + A + +S + +VL++DG +T N + Sbjct: 156 --AQIGISGRATAISDGLGLAMKRLAAS-----------EAPTRVVVLMSDGVDTSGNVQ 202 Query: 386 GIAICNKAKSQGIRIMTIAFS------VNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + A GIRI TIA +++ L A S + F +L Sbjct: 203 AVDAARLAAGHGIRIHTIALGPEDLENQPRSRDAVDTKTLREVAELSGGTAFRVRGMADL 262 Query: 438 NK 439 Sbjct: 263 EA 264 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRIWLDEARIGI-AGKNTALGDAIGLALKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 57/162 (35%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D ++ + + + + ++ TAI +++ A Sbjct: 131 RLGLILFADHAYLQTPLTFDLKTIQQMVDESEI--GLAGTRTAIGESIAMA--------- 179 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 + R N ++ ++L++DG N + E I +A I I TI + Sbjct: 180 --IKRFVENKNEQRVLILVSDGANNSGSIEPIQAAKQAAKNNITIYTIGMGAEQMIKRGL 237 Query: 411 --------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L+ A + +F A + EL I++ Sbjct: 238 FGNQRINPSADLDEKTLTEIANLTGGKYFRARNQTELQNIYQ 279 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 62/199 (31%), Gaps = 37/199 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R+A + + + R+G F + + L R + Sbjct: 71 TRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLDYSFLQRMLEDVEV 123 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ A+ TA + + K++ K +LLTDG N + + Sbjct: 124 GAVED--GTAVGTALATAVNRL-----------KDSEAESKVAILLTDGRNNRGQIDPRT 170 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN-----------------CASPNSFFEA 431 A++ G+R+ I ++ + ++ +F A Sbjct: 171 AAEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSVSTGGQYFSA 230 Query: 432 NSTHELNKIFRDRIGNEIF 450 + L +I+ + E Sbjct: 231 TNRDALERIYAEIDTMEAT 249 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 51/230 (22%), Positives = 78/230 (33%), Gaps = 23/230 (10%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + E+ + N + P L P FV SS V K +DAL +++ Sbjct: 264 VNREQSIGEIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILH 323 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTA 338 ++ D R F++R+ W + + T K + + G T Sbjct: 324 DLRPQD------RFNIIGFSNRIKV-----WKDNLISVTPNSIRDGKIYIHHMSPTGGTD 372 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG- 397 IN A+QTA + N H + IV LTDG+ T + I N K Sbjct: 373 INGALQTAIRLL---NNYVAHNDIEDRSVS-LIVFLTDGKPTVGETHTLKILNNTKEAAQ 428 Query: 398 --IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + I TI + + + L NC E + F D I Sbjct: 429 GRVCIFTIGIGNDVDFKLLEKLSLENCGLTRRVHEEENAGAQLIGFYDEI 478 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 234 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 289 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 290 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 336 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + + L A + FF A T Sbjct: 337 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDT 396 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 397 EELAGIYAE 405 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 234 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 289 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 290 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 336 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + + L A + FF A T Sbjct: 337 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDT 396 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 397 EELAGIYAE 405 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 51/241 (21%), Positives = 82/241 (34%), Gaps = 34/241 (14%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G+ I +K + +SL L P P P + + S +K L R A Sbjct: 360 GEHNIYVDKKGKPEDALERSLPAKLAP-PRSPEGTAVVLIIDKSSSMEGRKIELARLAAI 418 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 V+ +++ ID+V G F++ R +K G T I Sbjct: 419 GVVENLRPIDSV------GVLIFDNSFQWAVPIR---KAEDRATIKKLISGITPDGGTQI 469 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+ AY I+ H IVLLTDG + + + + +A++ + Sbjct: 470 APALTEAYQRILPQTAMYKH-----------IVLLTDGISEEGDSMTLT--KEAQANHVT 516 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHE----LNKIFRDRIGNEIFERV 453 I T+ Q+ R FL AS + N L + + G E+ Sbjct: 517 ISTVGL-----GQDVNRAFLEKVASNADGKAYFLNDPSGLEQLLLRDVEEHTGVTAVEKA 571 Query: 454 I 454 I Sbjct: 572 I 572 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 66/491 (13%), Positives = 149/491 (30%), Gaps = 67/491 (13%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F ++ TG + +L+ ML ++DV+R + + A A+ + Sbjct: 9 MRFTKGDNMRKQTGGISVFMLVLLMSMLVFAAWVMDVMRIYSVHNQMANATDAAL---AS 65 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK-----------------NFTDR 109 +I + E ++ Y+ +L+ N + Sbjct: 66 AIISEVPESTAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVVLDFVPNSLNIAAQ 125 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--- 166 E I + ++ K+ +L + P++ ++ + + + Sbjct: 126 ESVPIRTNAKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYV 185 Query: 167 -----VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 S + + I ++R Q S + Sbjct: 186 VSIVPASGNVNTGPMEEIYL--------GSFRRYDHAQVKRENRWSDMFDRASGRTPAVP 237 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 R+ N L + +S + I K ++ + Sbjct: 238 GRQRNAMCRDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTP 297 Query: 282 IRS--IKKIDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + N+ A F + +P + R ++ + Sbjct: 298 LDPPVYPSTNPSNNYRPFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMN 354 Query: 337 TAINDAMQTA-------YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEE 385 T + M A + I E+ R ++ + KY+V+ +DG + D +++ Sbjct: 355 TNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKK 414 Query: 386 GIAICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 IC + K +G+++MT+ F ++ + +CAS ++ S + K+F Sbjct: 415 MKLICTQLKQPGRGVKVMTVNFGGAASE-----RLMQSCASGPEYYHVASLFSVEKVF-- 467 Query: 444 RIGNEIFERVI 454 +I E+VI Sbjct: 468 ---EQIAEQVI 475 >gi|332185631|ref|ZP_08387379.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] gi|332014609|gb|EGI56666.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] Length = 420 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 68/464 (14%), Positives = 134/464 (28%), Gaps = 73/464 (15%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ AL +PV+ GM VD R A S + + Sbjct: 1 MMFALALPVLTCSIGMGVDYAR--------------AAKAQSKLNAIADAAALLAVSKNA 46 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-AVEMNPRKSAYQVVLSSRYDLLLN 142 Y R+F + + + T + VV + Sbjct: 47 MRADDATAAYFARSFFSLQSAALVKSDGITLSNVTVQAPTDGNGRRTAVV-----NYRAT 101 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 ++F R +G+ + I K+E + ++D S SM S + Sbjct: 102 SENVFARILGMSTLTISGKSETANAIAPD-----IDFYMLLDVSASMALPTTSSGLNKV- 155 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 ++ + ++ G+ R P +D + + ++ Sbjct: 156 ---AQSNTSRCVFACHTGEKRFRGYDAHGKQTDLYGVALSYGLPLRIDAEGDAVNQLTAT 212 Query: 263 SLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + K R A+ + + + + T + + + Sbjct: 213 ARSMASKNGSDYRIAITTFRGARGFSVRQPLTND----LTAAGHKAANLKPPYYASIGCP 268 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + K+ + N+ T +DAM I N + + + ++TDG Sbjct: 269 TSACKSSEVGWNDR-DTGSSDAMDQINAMIPQPGSGV-----NGQDPQAVVFMVTDGMRN 322 Query: 381 QDNEEG--------IAICNKAKSQGIRI---MTIAFSVNKTQQEKARY-----------F 418 + + +G A C+ K +GIRI T Sbjct: 323 EKSPKGARPEVAFDTAKCDMIKHRGIRIAVLYTEYLRDAVKGTTNLERSVEPYLYQVEPA 382 Query: 419 LSNCASPNSFFEANS----THELNKIFRDRIGNEIFERVIRITK 458 L +CASP + + + + LN +F++ + RITK Sbjct: 383 LQSCASPGLYTKVTTDGDISAALNTLFQNAVA------TTRITK 420 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 87/258 (33%), Gaps = 27/258 (10%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S++ +G+ + A+ + +L G +D R + A++ A + + S L Q Sbjct: 25 SRRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSRDLSQ 84 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + A + T+ K +T + + + + + SA Sbjct: 85 GTITAADVAAKASTYFK----------------ALYTSTDAQSVAVTASYTASTSSSASN 128 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + L++ + ++ F++ +G + TKA + + + +D + SM Sbjct: 129 IQLNASGQI----VTQFMKLVGFPTMTFNTKATTTWG------DVKMRVALALDNTGSMA 178 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + Q + + + G V I + +V+ S Y + D Sbjct: 179 YSGKMTALQNAVAGSGGLIDQLSALAKSPGDVYISLIPFAK-VVNVGASNYAQSWIDWTD 237 Query: 251 PSLSEEHFVDSSSLRHVI 268 ++ S + I Sbjct: 238 WQNPPTIQPNNGSYQAAI 255 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 63/452 (13%), Positives = 128/452 (28%), Gaps = 52/452 (11%) Query: 37 GGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIR 96 + +D Y + + + S S P K+ Sbjct: 167 VALALDNTGSMAYSGKMTALQNAVAGSGGLIDQLSALAKSPGDVYISLIPFAKVVNVGAS 226 Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL------SSRYDLLLNPLS-LFLR 149 N+ + + + P +YQ + S++D++ S F Sbjct: 227 NYAQSWIDWTDWQNPP--------TIQPNNGSYQAAIPNASFTQSQWDMVGPGSSCPFTS 278 Query: 150 SMGIKSWL-IQTKAEAETVSRSYHKEHGVSIQWVID-FSRSMLDYQRDSEGQPLNCFGQP 207 G + A A + + S S S Y + Sbjct: 279 GNGFPYFSCTSGPATASSSASKVPSSGSYSGYICPGYDSASHSYYNGCWNSVQNSTRVNW 338 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + S + V + S + + + + + V + + Sbjct: 339 CTGSYCSCPTTGSNVPNNTCSCTGSGSSTVCKVNTFTHTWIANATSTWTGCVADRTQPND 398 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +A++ + + ++N A + ++ S+ T +K+ Sbjct: 399 ANAVSPASSDVATLFPANQHMENNVQYCSSSA---STKLGQIVPLSYN-----WTSLKSA 450 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----- 382 G T M A ++I + N + I+LL+DG NT+D Sbjct: 451 VNAMEPTGGTNQAIGMAWAVQSLIPNGVLGAPAEDANTTYNRVIILLSDGLNTEDRWPDY 510 Query: 383 -NEEGIA-----------ICNKAKSQG-------IRIMTIAFSVNKTQQEKARYFLSNCA 423 N A +C+ K+ I TI + + + L NCA Sbjct: 511 GNGSTQASGNPIDARQALLCSNLKNTKDSKGNAMYTIYTIQVN-TSSPADPTSTVLQNCA 569 Query: 424 -SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 SP+ F+ S+ ++ F + IG + + + Sbjct: 570 SSPDKFYMLTSSSQIVTTF-NSIGTALSKLRV 600 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 56/194 (28%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + I + R+G + + + + I Sbjct: 113 RLEALKKVAINFIEG----RPND---RIGLVIYAGESYTKTPLTTDKSIIFNAI-NDLEY 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +N G TAI + T+ + + K++ K I+LLTDGEN + Sbjct: 165 SQNIEGGTAIGMGLATSVNKL-----------KDSKAESKVIILLTDGENNAGFIDPKTA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------QEKAR------------YFLSNCASP--NSFF 429 A I+ TI N + + L A +F Sbjct: 214 TQLATEYDIKTYTIGVGSNGMALSPVGIKANGQFEYRNIEVKIDEALLKTIAESNGGKYF 273 Query: 430 EANSTHELNKIFRD 443 A + I+ + Sbjct: 274 RATDNQKFEAIYEE 287 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 30/132 (22%), Positives = 54/132 (40%), Gaps = 20/132 (15%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD- 382 V+T G+T + +Q + + ++ ++ E +K +++LTDG NTQ+ Sbjct: 410 VETAVNKLTPSGNTNVTIGVQWGMEALTAAAPLT--GVRTGSEVRKVMIVLTDGLNTQNR 467 Query: 383 ----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEA 431 + +A CN AK+ GI + T+ E L CA + + + Sbjct: 468 WWGSRDRNKIDARTLAACNNAKAMGIELYTVRLV------EGNEDLLKTCAETEDKYHYV 521 Query: 432 NSTHELNKIFRD 443 S +L F D Sbjct: 522 TSASQLKTTFAD 533 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 66/196 (33%), Gaps = 37/196 (18%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 V + + K+ G F +IT L + ++ G VD+ ++A++ A T+ Sbjct: 19 VMTSVIKEFLKQYRDDRRGQFAVITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSA 78 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A + E RA F + + DT + Sbjct: 79 LAAVSYSGEEQTEREKRADTL-----------------------FLNNTAGIAIEDTDLS 115 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 Y+ + Y + N FLR +GI + ++++ A + + Sbjct: 116 EEDGAWVYK----AEYKIKTN----FLRVVGIDEFEMESQGAAALA------NSPMDVVL 161 Query: 182 VIDFSRSMLDYQRDSE 197 V+D S SM R E Sbjct: 162 VLDSSGSMAQDNRMVE 177 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 79/515 (15%), Positives = 153/515 (29%), Gaps = 77/515 (14%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 TK + K+ K+ G + IITALL +L VD + L QA A + Sbjct: 9 TKLLSTVKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR--------------- 109 Q + +Q+IE ++ + R Sbjct: 69 IAEDNQYRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLYLRS 128 Query: 110 -EVRDIVRDTAVEM----------------NPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + + V + N +A V + + + +++ Sbjct: 129 DDSKGQKNSSPVTIKEPFLAECLEEKTQPKNQNGTAKSVACVVQGSVQRKFWLPWGQTLV 188 Query: 153 IKSWL----IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY-QRDSEGQPLNCFGQP 207 + L + + + + + V D S SM Y R + N Sbjct: 189 SSNQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYIDRKGDAHKPNRRIDA 248 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 V + EK + L + D S + S Sbjct: 249 LVEVVGEVQNILVTPRKIKEKCESLQSNGLLYLLQQVLAKKGDTSGCVLPYYVQQSKTEY 308 Query: 268 I----------KKKHLVRDALASV--IRSIKKIDNVND-TVRMGATFFNDRVI------- 307 I + +++ + +++ +I N N+ + + FN+ Sbjct: 309 ISELLRGRRGRTIREGLKEIERHMDIAKTVNQIKNFNNGEKQSYSFSFNNGDFCLGGNEG 368 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN-EDEVHRMKNNLE 366 + + +W K + V +G TA+ M + + S N E E K N Sbjct: 369 KETTQAWFDQK--KPNVSEALGKIEPLGGTAVTSGMLIGINLMTSKNSEPEAAPSKLNTN 426 Query: 367 AKKYIVLLTDGENTQDNEEGI------AICNKAKSQ---------GIRIMTIAFSVNKTQ 411 ++ +++L+DGE+ Q +E+ + +C + K + G +AF T Sbjct: 427 TRRVLLILSDGEDNQPSEKTLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRVAFIAFGTN 486 Query: 412 -QEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + C ++ S L F+ I Sbjct: 487 LPDNQLNAWKQCVG-KHYYSVFSKQGLLDAFKQII 520 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 59/188 (31%), Gaps = 34/188 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 AL V + + R+G + + + + + +G Sbjct: 115 TALKKVAAEFIEGRPSD---RIGLVVYAGESFTKTPITTDKSIIQNALKDIKYKHGELIG 171 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K++ K I+LLTDG N E A Sbjct: 172 GTAIGMGLATAVNRL-----------KDSKAKSKVIILLTDGVNNAGFIEPQIASELAVE 220 Query: 396 QGIRIMTIAFSVN------------------KTQQEKARYFLSNCA--SPNSFFEANSTH 435 GI+ TI N Q E L A + +F A +T Sbjct: 221 YGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQVEIDEKLLQQIAKTTGGKYFRATNTK 280 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 281 KLAEIYDE 288 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 109 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 165 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLNPLK 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANST 434 AK++G+R+ TIAF + + L A + FF A T Sbjct: 212 AAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDT 271 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 272 DELAGIYAE 280 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 72/192 (37%), Gaps = 34/192 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +V+ + + L+ + ++ D R+G F R + + + + + Sbjct: 121 NVVDRLTAAKAVLSDFLDR-REGD------RVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + V R++ ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALS-----------VKRLREQRHGQRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRD 443 T EL I+ + Sbjct: 281 RDTEELAGIYAE 292 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 47/374 (12%), Positives = 117/374 (31%), Gaps = 71/374 (18%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ + L + TG+ I+ AL + ML G D +R ++ A+I + Sbjct: 19 RYFHTLRGLRRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIA-A 77 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V I + + + + +++ T E++ Sbjct: 78 VKQINNTADADALKAKVSDWFHAQVDN-----------------------SYTLGEIDID 114 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + ++ + + F++ I + + + + + SY +++ VID Sbjct: 115 TANHNITATASGTVP----TTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDT 165 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + Sbjct: 166 SPSMLLAATTSGQSTM-------------------------------YSGIGCQFACHTG 194 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + S+ +++ + + DA+ V+ I D+ ++ +++G D Sbjct: 195 DAHTVGKKTYANNYEYSTAKNIKLRADVAGDAVKDVLSLIDTSDSNHERIKVGLYSLGDT 254 Query: 306 VIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + P+ S + + +T + ++ T + + + Sbjct: 255 LTEVLAPTLSTDTARTRLSTASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----T 309 Query: 364 NLEAKKYIVLLTDG 377 + K ++LLTDG Sbjct: 310 SGTPLKLVLLLTDG 323 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 63/179 (35%), Gaps = 28/179 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ S + ++ D R+G F +R ++ + + R I A Sbjct: 113 TRLEAVKRVARSFVEE-RQGD------RIGLVLFANRAYVAAPLTFDLAAVGRAI--EEA 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STAI D + A ++ S+ A + IVLL+DG++ + Sbjct: 164 SIGITGRSTAIADGLGLALKSVTESS-----------AASRVIVLLSDGQDNAHQIDARQ 212 Query: 389 ICNKAKSQGIRIMTIAFSVN--KTQQEKARYF----LSNC--ASPNSFFEANSTHELNK 439 + A G+RI TIA + +T+ L AS + +L Sbjct: 213 VAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 63/179 (35%), Gaps = 28/179 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ S + ++ D R+G F +R ++ + + R I A Sbjct: 113 TRLEAVKRVARSFVEE-RQGD------RIGLVLFANRAYVAAPLTFDLAAVGRAI--EEA 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STAI D + A + R+ + A + IVLL+DG++ + Sbjct: 164 SIGITGRSTAIADGLGLA-----------LKRVTESAAASRVIVLLSDGQDNAHQIDARQ 212 Query: 389 ICNKAKSQGIRIMTIAFSVN--KTQQEKARYF----LSNC--ASPNSFFEANSTHELNK 439 + A G+RI TIA + +T+ L AS + +L Sbjct: 213 VAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 66/199 (33%), Gaps = 37/199 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R+A + + + R+G F + + L R + Sbjct: 99 TRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLDYSFLQRMLEDVEV 151 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ A+ TA + + K++ K +LLTDG N + + Sbjct: 152 GAVED--GTAVGTALATAVNRL-----------KDSEAESKVAILLTDGRNNRGQIDPRT 198 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ------EKARY---------FLSNCASP--NSFFEA 431 A++ G+R+ I ++ + + + L + ++ +F A Sbjct: 199 AAEVAQTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSTSTGGQYFSA 258 Query: 432 NSTHELNKIFRDRIGNEIF 450 + L +I+ + E Sbjct: 259 TNRDALERIYAEIDTMEAT 277 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 61/179 (34%), Gaps = 28/179 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ S + ++ D R+G F +R ++ + + R I A Sbjct: 126 TRLEAVKRVARSFVEE-RQGD------RIGLVLFANRAYVAAPLTFDLAAVGRAI--EEA 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STAI D + A + S A + IVLL+DG++ + Sbjct: 177 SIGITGRSTAIADGLGLALKRVTESG-----------AASRVIVLLSDGQDNAHQIDARQ 225 Query: 389 ICNKAKSQGIRIMTIAFSVN--KTQQEKARYF----LSNC--ASPNSFFEANSTHELNK 439 + A G+RI TIA + +T+ L AS + +L Sbjct: 226 VAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 284 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 70/234 (29%), Gaps = 44/234 (18%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH---------VIKKKHLVRDALASVI 282 +V+C + ++Y VD S I + V+ L+ + Sbjct: 76 LVVACARPVWYGDPVEFQPKYRDMMLVVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFV 135 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 R+G F D + +++ I +T + TAI D Sbjct: 136 EK-------RQGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQR--TAIGDG 186 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T + S ++ ++LL+DG NT + + AK I T Sbjct: 187 IGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPLEAAEIAKKYNATIYT 235 Query: 403 IAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + + L+ A + +F A EL I+ Sbjct: 236 VGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIY 289 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 71/186 (38%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + H +I+ +++T I + E G Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVD-HNVIQQMLETTEIGQLEDG 165 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + + K I+LLTDG N + A+ Sbjct: 166 -TAIGLGLATAINTL-----------RGSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKAR----------------YFLSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKL 273 Query: 438 NKIFRD 443 ++I+++ Sbjct: 274 HEIYKE 279 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 41/207 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++V+ + I + D +G F+ + + L++ + KT Sbjct: 112 RLYVVKQVVKDFINK-RSTDP------IGLVVFSANAYTQCPLTLDYGILLQFLEKTEIG 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A+ ++ D + +N K IVLLTDG N + + Sbjct: 165 LLED--GTAIGSAIASSVDRL-----------RNTKAQSKVIVLLTDGRNNSGQIDPLTA 211 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF-----------------LSNCA--SPNSFFE 430 A++ I+I TI AR L+ A + ++ Sbjct: 212 AELAQAFNIKIYTIGAGSKGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANITGGRYYR 271 Query: 431 ANSTHELNKIFRDRIGNEIFERVIRIT 457 A T L +I++ + + + IT Sbjct: 272 ATDTGSLKEIYQQI--DALEKTETEIT 296 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 63 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 116 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 117 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 170 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 171 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 227 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 228 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 63 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 116 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 117 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 170 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 171 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 227 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 228 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 73/234 (31%), Gaps = 44/234 (18%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH---------VIKKKHLVRDALASVI 282 +V+C + ++Y VD S I + V+ L+ + Sbjct: 76 LVVACARPVWYGDPVEFQPKYRDMMLVVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFV 135 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ D R+G F D + +++ I +T + TAI D Sbjct: 136 EK-RQGD------RLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQR--TAIGDG 186 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T + S ++ ++LL+DG NT + + AK I T Sbjct: 187 IGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPLEAAEIAKKYNATIYT 235 Query: 403 IAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + E L+ A + +F A EL I+ Sbjct: 236 VGVGAGEMMVKDFFMTRKVNTASELDEQTLTKIAEMTGGKYFRARDAKELETIY 289 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 53/155 (34%), Gaps = 18/155 (11%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F D + + + K A+ TAI DA+ A + E Sbjct: 2 FADHAYLQAPLTLDNLAVQSLLQK--AVIGMAGRDTAIGDAIGLAVKKLRERPE------ 53 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FL 419 + ++LLTDGEN + + AK IRI TI + L Sbjct: 54 -----GSRILILLTDGENNAGALKPLQAAELAKQYDIRIYTIGVGGKGGMFSRGLNETEL 108 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 A + ++F A + LN ++ + I + + Sbjct: 109 KKIAQLTNGAYFPATNLGALNNVY-EHIDKTLQKT 142 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 83 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 136 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 137 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 190 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 191 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 247 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 248 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 284 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 152 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 205 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 206 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 259 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 260 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 316 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 317 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 353 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 63 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 116 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 117 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 170 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 171 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 227 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 228 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 159 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 212 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 213 FSIIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 266 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 267 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 323 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 324 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 360 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 59/460 (12%), Positives = 133/460 (28%), Gaps = 48/460 (10%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 F + S G ++ A ++P++ + +++ + + L A A Sbjct: 72 RFALTQRDDSRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRA 131 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 EE + A + + E ++ N TD + + + + Sbjct: 132 FS---EEQTVEAAKAAARLTAAMNEVAGEPYQLN-----TDDSANEFEFGVSAQTDGNTG 183 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA-------EAETVSRSYHKEHGVSIQ 180 + D+ N +++ + K ++ ++Q Sbjct: 184 RFYFTKVPTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQ 243 Query: 181 ------WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 V+D S SM D++ + +G+ + + + + + R+ P + Sbjct: 244 VDRDISLVLDRSGSM-DWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNG--QPQYI 300 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + Y Y + E + + A+ + +R + + Sbjct: 301 R--RVSYNRGYDEYDLYDHAWEEVFGLGPAPN--TPWEDLVLAVDAFLRVLDQTPQNEQV 356 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +N D + + GST I + M + N Sbjct: 357 ---SIASYNSHGTLDCWLLDDFDSVRAAV-----AQLGPNGSTGIGNGMNSGKTAFTHEN 408 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 A K +V++TDG + + S + I T+ F Q+ Sbjct: 409 --------ARPYASKTMVVMTDGNHNYGTQPNTVAQQLMSSSNLNIQTVTFGGGADQETM 460 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ + A+S EL F + I N + + Sbjct: 461 QEVAVTGL---GRHYHADSGDELVSAFEE-IANNLPTILT 496 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 66/197 (33%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ + V+ L+ I + R+G F D + + Sbjct: 30 DMQSGQQMVDRLTAVKQVLSEFIAQ-------REGDRIGLILFADHAYLQTPLTLDRQTV 82 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A+ + TA+ + + A T I S ++ ++LL+DG N Sbjct: 83 TEQLNQ--AVLKLIGTQTAMGEGIGLATKTFIDSA-----------APQRVMILLSDGSN 129 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCASP- 425 T + + N AK I T+ + Q+ L AS Sbjct: 130 TAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTT 189 Query: 426 -NSFFEANSTHELNKIF 441 +F A + +L I+ Sbjct: 190 GGQYFRARNQQDLQSIY 206 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 51/151 (33%), Gaps = 11/151 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ + ++ + RT+ A ST I D + A + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTAAVARTL--EEATIGLVGRSTGIGDGLGLALKRLAPAQL 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QE 413 + K +VLL+DG N + AK G+R+ TIA Sbjct: 201 ADAEGGGPPPSRDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNPN 260 Query: 414 KARYF-----LSNCA--SPNSFFEANSTHEL 437 + L A S F +T +L Sbjct: 261 NEQDVVDVETLRAMAETSGGRAFRVKTTDDL 291 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 17/163 (10%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 +++ + N + R+G F + G + + I++T + TAI Sbjct: 123 KLLKRFIEKRNND---RLGLVVFAGAAYLQAPLT-GDRESLSEILETIEEETVTEQGTAI 178 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 DA+ + +R++N+ K IVL+TDG + + + + A+ G + Sbjct: 179 GDAIILS-----------TYRLRNSKARSKVIVLITDGVSNTGKIDPVTATDLAEQIGAK 227 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I ++ E L + + FF A E+ + Sbjct: 228 IYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAV 270 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 41 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 98 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 99 TS-----------RALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 147 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 148 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 186 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 17/163 (10%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 +++ + N + R+G F + G + + I++T + TAI Sbjct: 115 KLLKRFIEKRNND---RLGLVVFAGAAYLQAPLT-GDRESLSEILETIEEETVTEQGTAI 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 DA+ + +R++N+ K IVL+TDG + + + + A+ G + Sbjct: 171 GDAIILS-----------TYRLRNSKARSKVIVLITDGVSNTGKIDPVTATDLAEQIGAK 219 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I ++ E L + + FF A E+ + Sbjct: 220 IYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAV 262 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 45/205 (21%), Positives = 70/205 (34%), Gaps = 18/205 (8%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA-SVIRSIKKIDNVNDTVRMGAT 300 Y L P S S+ + R +++ ++R K R+G Sbjct: 81 YKLSPDSTKGVDIMIALDISGSMVNSYDFLPRNRLSVSKDLLREFVK---KRLYDRIGIV 137 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F S L I T + D E G TA+ DA+ + +R Sbjct: 138 VFAGAAYLQSPLSSDRFALDELIAGTSSEDIEEQG-TAVGDALVLS-----------SYR 185 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +KN+ K I+LLTDG + + K+ GI++ I + Q E L Sbjct: 186 LKNSEAKSKVIILLTDGVSNTGKLDPDTAAYTTKTMGIKVYCIGIGKEEGQYEINYESLQ 245 Query: 421 NCAS--PNSFFEANSTHELNKIFRD 443 +S FF A S L + + Sbjct: 246 KISSNTNGKFFRAESPEVLESVLNE 270 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 56/277 (20%), Positives = 90/277 (32%), Gaps = 31/277 (11%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG-------IRDEKLSPYMVSC 236 D S ++ ++ +QNG +G + E+ + Sbjct: 175 DDSGP-PPSTVINQNDTFAKVTFKPSVVQQAKIAQNGILGDFIIRYDVNREQSIGDIEVL 233 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ + Sbjct: 234 NGYFVHYFAPRDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFS---- 289 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 290 --IIGFSNRIKV-----WKDHLVSVTPDNVRDGKIYIHHMSPTGGTDINGALQRAIKLL- 341 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N K + I TI + Sbjct: 342 --NDYVAHNDIEDRSVS-LIVFLTDGKPTVGETHTLKILNNTKEAARGQVCIFTIGIGDD 398 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 399 VDFKLLEKLSLENCGLTRRVLEEEDAGSQLIGFYDEI 435 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 59/460 (12%), Positives = 134/460 (29%), Gaps = 48/460 (10%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 F + +S G ++ A ++P++ + +++ + + L A A Sbjct: 9 RFALTQRDESRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRA 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 EE + A + + E ++ N TD + + + + Sbjct: 69 FS---EEQTVEAAKAAARLTAAMNEVAGEPYQLN-----TDDSANEFEFGVSAQTDGNTG 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA-------EAETVSRSYHKEHGVSIQ 180 + D+ N +++ + K ++ ++Q Sbjct: 121 RFYFTKVPTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQ 180 Query: 181 ------WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 V+D S SM D++ + +G+ + + + + + R+ P + Sbjct: 181 VDRDISLVLDRSGSM-DWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNG--QPQYI 237 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + Y Y + E + + A+ + +R + + Sbjct: 238 R--RVSYNRGYDEYDLYDHAWEEVFGLGPAPN--TPWEDLVLAVDAFLRVLDQTPQNEQV 293 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +N D + + GST I + M + N Sbjct: 294 ---SIASYNSHGTLDCWLLDDFDSVRAAV-----AQLAPNGSTGIGNGMNSGKTAFTHEN 345 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 A K +V++TDG + + S + I T+ F Q+ Sbjct: 346 --------ARPYASKTMVVMTDGNHNYGTQPNTVAQQLMSSSNLNIQTVTFGGGADQETM 397 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ + A+S EL F + I N + + Sbjct: 398 QEVAVTGL---GRHYHADSGDELVSAFEE-IANNLPTILT 433 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 78/231 (33%), Gaps = 17/231 (7%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + + Y + N+ + + D S + K +V +A Sbjct: 67 RPQAGTSYSLEVNEGIAIQMLVDVSSSMDMSVKNFDGKSTTRMEVAKEMVERFIAGDGED 126 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAM 343 ++ + +G F + ++G H + IV+ I E TA DA+ Sbjct: 127 LQGRPHDL----IGLITFARYADTRSPLTFG-HDALLQIVRHLTIQERPNEDGTAYGDAL 181 Query: 344 QTAYDTIISSNEDEVHRMKNNL---EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A + + E + + K I+LLTDGEN + I AK+ +I Sbjct: 182 ALAAARLKNPQELRHGKRPDAQAEAIESKVIILLTDGENNSGSHLPIEAAGLAKAWDCKI 241 Query: 401 MTIAFSVNKTQQEKARYF------LSNCA--SPNSFFEANSTHELNKIFRD 443 I+ + + L + + + F +A+ L ++ + Sbjct: 242 YAISLGESLDAENPLDALSPAERVLEHISIETGGVFRQAHDFESLLSVYEE 292 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 65/227 (28%), Gaps = 31/227 (13%) Query: 179 IQWVIDFSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM- 233 I V+D S SM D +C + + + + Y Sbjct: 93 IVLVLDTSGSMDQEINQLCEDCAYYCKDCDKWIYETRENHKNQKPYINHTIVRRFGGYYC 152 Query: 234 VSCNK-----SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 CNK + P +++ + + K H ++ A + I S+ Sbjct: 153 YDCNKYIENEETHINYRPYANHSFSDKKYCNNHKAYESYTTKIHELKKAAKNFIDSLTST 212 Query: 289 D-----NVNDTVRMGATFFNDRVISDPSFSWGVHKLIR-----TIVKTFAIDENEMGSTA 338 +++G +N+ + + +K + G T Sbjct: 213 KTDGQTPNVKNLKIGIVSYNNSGYINEGLVQVTDSDRKNNGNINELKDTIENLRADGGTN 272 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D ++ A + NE K ++ + DGE T + + Sbjct: 273 TGDGLRKAAYLLNEENEAN-----------KTVIFMGDGEPTYYSSD 308 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 78/477 (16%), Positives = 160/477 (33%), Gaps = 51/477 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +LI G II L+ ML G +D +R L+ AA +A++ A+ + Sbjct: 3 RLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKYKSGM 62 Query: 73 -EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E ++ N+F + E + + ++ +V D V T +++ + + Sbjct: 63 PEATIAKTINAFLSANGEFETAVAGKPQVASDESELCLDVADAVPTTFMKL-ANIQSVPI 121 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + S L + + S +I+ + S + S++D Sbjct: 122 SIRSCAALPGVKQLEIALVLDVSSSMIEENRFTPMQTAVAGFLQAFS------SNTSLVD 175 Query: 192 YQRDS--EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + S FG +KSY+ + S Y S K Y++ P+ Sbjct: 176 KTKISIVPFSSRVNFGLANTAWLKSYNGTAAVPKRWTDPESVYTSSGYKLSYWIDGVTPV 235 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDRVIS 308 + +++ R ++ + ++ + + V M A + Sbjct: 236 MSTSKNYYWMGCIEPRADVEVRDT-----GAIGDGMGDAPPSTSAFVAMDANPKSGTSFC 290 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE----DEVHRMKNN 364 P + + + K+ + GST ++ + + T+ + D+ + Sbjct: 291 PPPVT-PLTGDFAYL-KSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPVS 348 Query: 365 LEAKKYIVLLTDGENT-----QDNEEGI------------------AICNKAKSQGIRIM 401 K +V +TDGE D + I C K GI I Sbjct: 349 DSVHKVMVFMTDGEMNTKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIEIY 408 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 T+++S + + NCA+ + F S + ++ + I I +R+T+ Sbjct: 409 TLSYSADADVVN-----IRNCATNTAHFFTASPATIKTVY-ETIAAAIRGDTLRLTQ 459 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A A + + + D R+G F R + + + + + + Sbjct: 121 KVVDRLTAAKAVLSDFLDRRDGD----RVGLLVFGQRAYALTPLTADLTSVRDQLRDSVV 176 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + + Sbjct: 177 --GLAGRETAIGDAIALSVKRLREQKQ-----------GQRVVVLLTDGVNTAGVLDPLK 223 Query: 389 ICNKAKSQGIRIMTIA-----------FSVNKTQQEKA-RYFLSNCA--SPNSFFEANST 434 AK++G+RI TIA + + L A + FF A T Sbjct: 224 AAELAKAEGVRIHTIAFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDT 283 Query: 435 HELNKIFRD 443 EL I+ + Sbjct: 284 EELAGIYAE 292 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 64/194 (32%), Gaps = 40/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ A I+K N R+G + + + ++ + + Sbjct: 113 RLSALKEVAAEF---IRKRPND----RIGLVAYAGESYTKTPITSDKSIVLGALREITYG 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N+ TAI + T+ + + K + K I+LLTDG N E Sbjct: 166 QLND--GTAIGMGLATSVNRL-----------KESTAISKVIILLTDGVNNAGFIEPQTA 212 Query: 390 CNKAKSQGIRIMTIAFSVNKT------------------QQEKARYFLSNCASP--NSFF 429 + A GI+ TI N Q E L++ A+ +F Sbjct: 213 ADLALEYGIKTYTIGLGTNGNALSPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEYF 272 Query: 430 EANSTHELNKIFRD 443 A +L I+ + Sbjct: 273 RATDNEKLEAIYEE 286 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 64/155 (41%), Gaps = 19/155 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + H ++ +V+ ++D TA A+ A + + +S Sbjct: 138 RIGLVLFKGKSFTQCPLTLD-HDVLSMLVRAASVDAVPESGTATGSAILIAVNRLRAS-- 194 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM--TIAFSVNKTQQE 413 ++ ++LLTDGE+ + + A +G+RI T++ +++ ++ Sbjct: 195 ---------ESPERVLILLTDGEHNAGEVDPVTAAGIAAGEGVRIYMATVSVPGSRSGED 245 Query: 414 KARYF--LSNCAS---PNSFFEANSTHELNKIFRD 443 LS S F AN + LN+ F + Sbjct: 246 MLASARDLSGEVSRITGGRSFRANDANSLNRTFSE 280 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 70/212 (33%), Gaps = 13/212 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 60 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 113 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F++R+ V K + + G T IN A+Q A + N+ Sbjct: 114 FSIIGFSNRIKVRKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKY 170 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQE 413 H + IV LTDG+ T + I N + + I TI + + Sbjct: 171 VAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRL 229 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L NC E F D I Sbjct: 230 LEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 261 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 63/194 (32%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A I + R+G + + + ++R++ Sbjct: 114 RLEALKKVAADFIEG----RPND---RIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKYN 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI + T+ + + K++ K I+LLTDG N + Sbjct: 167 TIIE-GGTAIGMGLATSVNRL-----------KDSRAKSKVIILLTDGVNNGGFIDPKIA 214 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKAR------------------YFLSNCA--SPNSFF 429 A GI++ TI N T R L A + +F Sbjct: 215 SELAVEYGIKVYTIGLGTNGTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVTGGKYF 274 Query: 430 EANSTHELNKIFRD 443 A + +L +I+ + Sbjct: 275 RATNNKKLAQIYDE 288 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 71/186 (38%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + H +I+ +++T I + E G Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVD-HNVIQQMLETTEIGQLEDG 165 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + + K I+LLTDG N + A+ Sbjct: 166 -TAIGLGLATAINTL-----------RGSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKAR----------------YFLSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKL 273 Query: 438 NKIFRD 443 ++I+++ Sbjct: 274 HEIYKE 279 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRFWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 56/145 (38%), Gaps = 19/145 (13%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 V+ A G+T I +Q + + + ++ KY++++TDGENTQ+ Sbjct: 435 VRAHAQKLTPAGNTNITIGVQWGMELLSPELPFNTAKPYSDKTNYKYMIVITDGENTQNR 494 Query: 384 EEGIA---------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANS 433 A C AK GI + TI E L +CAS P F++ + Sbjct: 495 WSTSASTINARTLLACQAAKDLGITVYTIRVM------EGNSDMLKSCASRPEYFYDVTA 548 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 + +L + R+TK Sbjct: 549 SSQLTSTLAKVF---YSIQSTRLTK 570 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ + Sbjct: 234 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFS---- 289 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 290 --IIGFSNRIKV-----WKDHLVSVTPDNVRDGKVYIHHMSPTGGTDINGALQRAIKLL- 341 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + I+ LTDG+ T + I N K I I TI + Sbjct: 342 --NDYVAHNDIEDRSVS-LIIFLTDGKPTVGETHTLKILNNTKEAARGQICIFTIGIGND 398 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L NC E + F D I Sbjct: 399 VDFMLLEKLSLENCGLTRRVLEEDDAGAQLIGFYDEI 435 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ + Sbjct: 232 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFS---- 287 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 288 --IIGFSNRIKV-----WKDHLVSVTPDNVRDGKVYIHHMSPTGGTDINGALQRAIKLL- 339 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + I+ LTDG+ T + I N K I I TI + Sbjct: 340 --NDYVAHNDIEDRSVS-LIIFLTDGKPTVGETHTLKILNNTKEAARGQICIFTIGIGND 396 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L NC E + F D I Sbjct: 397 VDFMLLEKLSLENCGLTRRVLEEDDAGAQLIGFYDEI 433 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 41/148 (27%), Positives = 61/148 (41%), Gaps = 18/148 (12%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G + D ++ K I T + + +TAI +A+ Sbjct: 106 GVVLYGDFAFIASPITY--EKEIVTQMLGYLTQGMAGQNTAIGEAIA-----------MG 152 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 V K++ K IVLL+DGE+ + AK QGI+I TIA NK + ++A Sbjct: 153 VRSFKHSKAKTKVIVLLSDGEHNSGSVSPKEATELAKEQGIKIYTIA-MGNKGEADEA-- 209 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRD 443 L A S FF A+S EL I+ + Sbjct: 210 LLETIAKDSNGEFFSASSAKELKNIYDE 237 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 76/200 (38%), Gaps = 24/200 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + ++ +++ V+D + + + ++ D R+ F + Sbjct: 100 AVDISGSMDDRDMTAPDGTRLQRLQAVKDVVGAFVAE-REGD------RISLIVFGAKPF 152 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + ++ + + +TAI DA+ A + S Sbjct: 153 IQAPFTEDLDSVVELLNQ--VQTGMAGPNTAIGDAIGLAIRSFEDS-----------EIE 199 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCAS- 424 ++ ++LL+DG +T I A +GI I TI E+ L + A+ Sbjct: 200 ERLLILLSDGADTASTMTPINAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATR 259 Query: 425 -PNSFFEANSTHELNKIFRD 443 +F+ A+ L++I+ + Sbjct: 260 GGGAFYFADDVEGLSEIYAE 279 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 64/480 (13%), Positives = 146/480 (30%), Gaps = 64/480 (13%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + K+ G I +++P + + + ++ + + L A + A + + Sbjct: 11 FNHFTLFKKNEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATLALT-- 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 I++ + + N N V + N Sbjct: 69 -IENNAIPDEPQQ-------------IKNNALVLSYANAYLPSKEFSVPIININDNTYYL 114 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y ++ Y + ++ + A + + +V D+S Sbjct: 115 EYNAAVTMAYPAKFLTQTSLTNAITDINITDNGVAIKNKAIEASDLTDVI---FVADYSG 171 Query: 188 SMLDYQRDSE-------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 SML +E + F + D + + + + + Sbjct: 172 SMLYNFDVNEPNDHERINALRSAFRKLHDIIMNNSNINAIGYIPFSWGTKRIVFENQQQK 231 Query: 241 YYMLYPGPLDPSLSEEHFV--------DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 Y +P + +++ ++ L I + S+ + + ID Sbjct: 232 IYCHFPFSSKIYKPKGNYLSDEIKKSSNALLLLDYIGDIIDYDKTIESITGNAQPIDIPM 291 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 VR V S ++ + + I G T I+ + +A + + Sbjct: 292 SDVRT------KNVCLQASNAYSLEQEQYINNIDNIIKMEPYGWTLISSGILSA-NNLFK 344 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EGIAI---------CNKAKSQGIR 399 + HR K +++L+DG +T + +G+ I C + S GI+ Sbjct: 345 KEANNRHR--------KLMIILSDGVDTYQDNFLPNKGLFISKTLVEKGMCERVISSGIQ 396 Query: 400 IMTIAFSVNKTQQEKARYFL--SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + IA + + ++ C ++++EA++ EL + + I V R T Sbjct: 397 MAFIAIAYSPDDDVNEPEYINWRQCVGKDNYYEAHNADELMRDIQQAISKSATSEVGRNT 456 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 73/234 (31%), Gaps = 44/234 (18%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH---------VIKKKHLVRDALASVI 282 +V+C + ++Y VD S I + V+ L+ + Sbjct: 95 LVVACARPVWYGDPVEFQPKYRDMMLVVDLSGSMQKEDMNDNGEYIDRLTTVKRVLSDFV 154 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ D R+G F D + +++ I +T + TAI D Sbjct: 155 EK-RQGD------RLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQR--TAIGDG 205 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T + S ++ ++LL+DG NT + + AK I T Sbjct: 206 IGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPLEAAEIAKKYNATIYT 254 Query: 403 IAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + + L+ A + +F A EL I+ Sbjct: 255 VGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIY 308 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 56/162 (34%), Gaps = 27/162 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + ++R ++ +I +T+I DA+ A T+ Sbjct: 136 RIGLILFSTRAYVQAPLTLD-RNVVRQLLAEASIGMTGR-NTSIGDAIGLAVKTLRDRPA 193 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 + ++LLTDG NT + + A + +RI TI + Sbjct: 194 K-----------DRVLILLTDGANTSGVLDPMEAAAIAAKENVRIHTIGVGADSNFTDIQ 242 Query: 411 -------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + +F A + L I+ D Sbjct: 243 PGMLMNPSGDLDEEALKKIAGLTGGQYFRARNDKGLAAIYAD 284 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 66/191 (34%), Gaps = 35/191 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ L I ++ D R+G F ++ + +RT+++ Sbjct: 113 INRLQAVKRVLDDFISR-RQGD------RLGLILFGTEPYVQAPLTFDLE-TVRTLMREA 164 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + +TAI DA+ A + + +D ++ +VLLTDG NT Sbjct: 165 GLGM-AGRATAIGDAVGLATKRLRNRPQD-----------QRVVVLLTDGANTAGEITPD 212 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKA-------------RYFLSNCA--SPNSFFEAN 432 A + IR+ TI Q L+ A + +F A Sbjct: 213 KATEIAAAASIRLYTIGIGAESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRAR 272 Query: 433 STHELNKIFRD 443 S EL I+ Sbjct: 273 SLPELELIYES 283 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 64/190 (33%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ + +K D R+G F D + +++ I +T Sbjct: 120 YIDRLTAVKKVLSDFVAK-RKGD------RLGVVLFGDHAYLQTPLTADRKTVMQQINQT 172 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + T + S ++ ++LL+DG NT E Sbjct: 173 VIGLVGQR--TAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP 219 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + AK I T+ + + L+ A + +F A Sbjct: 220 LEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRA 279 Query: 432 NSTHELNKIF 441 T EL KI+ Sbjct: 280 RDTEELEKIY 289 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 70/202 (34%), Gaps = 23/202 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 +D L+ + ++ + + A A+ I + R+G F Sbjct: 144 TEGVDIILAIDTSGSMAAQDFDPNRITAAKVAAANFIA-------NRLSDRIGIVVFASD 196 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + L+ + I TAI DA+ + VH ++ Sbjct: 197 AMLQSPLTLDYESLL-DFLADVRIGMVRTDGTAIGDAIAVS----------SVHLERSPA 245 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--EKARYFLSNCA 423 K I+LLTDGE+ + A GI++ TIA ++ + L A Sbjct: 246 R-SKVIILLTDGESNSGVISPLDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIA 304 Query: 424 --SPNSFFEANSTHELNKIFRD 443 + ++ A + EL KI+ + Sbjct: 305 KLTGGKYYRAYNEAELTKIYAE 326 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + A TAI +A+ A +E Sbjct: 128 RLGLILFADHAYLQAPLTLDRRSVATFLD--DAQIGLVGKQTAIGEAIALAVKRFDKVDE 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + ++LLTDG N N E A + I I TI ++ Sbjct: 186 SN-----------RVLILLTDGSNNAGNIEPEVAAQIAAKRNITIYTIGVGAEILERRTI 234 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + +F A ++ EL I+++ Sbjct: 235 FGKERINPSMDLDEDQLKKLAAMTKGRYFRARNSEELASIYQE 277 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 68/458 (14%), Positives = 147/458 (32%), Gaps = 60/458 (13%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L K GH I+ A+++P + G+ + D R + ++ A++ A + S Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAH--ND 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR----DIVRDTAVEMNPRKS 127 ++ + + T +Q + +Y+ N + TD +V +++ + + Sbjct: 60 PDQPDNGSYTPSTRNRQIVVDYV--NAYISDVDAVTDIKVAKRRCELIPECVAGLYDGDM 117 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y L D+ S F + I+ ++++R Y E ++ DFS Sbjct: 118 RY---LEHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDAMF-AADFSG 173 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SMLD S + N R++ S + + Sbjct: 174 SMLDTWSGSSNPKYVDLIEIIRNISAELQKFNDLPENRNKSTMGIS---AFSTFTNSFTS 230 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S V+ + + + +A++ D + GA Sbjct: 231 DTGIQCSLSQGVNGRNGPATWFRPVKAANTVANIWN-----PKTEDYCKSGAYA----GF 281 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 D + + + L + + G TA A+ ++ + Sbjct: 282 HDVNLTSNFNYLNGQVGSFY-----AGGGTASYQALIRG-----------AQLLRKGNNS 325 Query: 368 KKYIVLLTDGENTQ----DNEEGIAICNKAKSQ------------GIRIMTIAFSVNKTQ 411 ++ +++L+DG + D +C ++ ++ I F N Sbjct: 326 RRLLIVLSDGMDNDTQLADGLVSAGMCRDIQNGLESDRTPDRRPIAAKMAVIGFDYNPFA 385 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + L +C + ++A E+ I + I EI Sbjct: 386 NKA----LKDCVGEKNVYKAEDADEVEDIILELINEEI 419 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 57/149 (38%), Gaps = 19/149 (12%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F S ++ + L+ + +TAI DA+ A + S Sbjct: 127 IGIVIFGSFSFSASPLTYDLKALLEMFDLMSDVGIAGN-NTAIGDAIFEAIKNLES---- 181 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K I+LLTDG++ + +AK +GI+I T+ + + Sbjct: 182 -------GEAKSKVIILLTDGKHNFGKKSPKEGVVEAKKRGIKIYTVGIGTDYD-----K 229 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + F A ++ EL ++F++ Sbjct: 230 KLLEKMAKETNAKSFFAKNSKELEEVFKE 258 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 62/189 (32%), Gaps = 33/189 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + A A + + + D R+G F + + + + + + + Sbjct: 129 VDRLTAAKAVLADFLDRRDGD----RIGLLVFGQQAYALTPLTADLATVRDQLRDSVV-- 182 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 TA+ DA+ A + + ++ ++LLTDG NT + + Sbjct: 183 GLAGRETALGDAIALAVKRLREQPQ-----------GERVLILLTDGVNTAGVLDPLKAA 231 Query: 391 NKAKSQGIRIMTIAFSVNKTQ--------QEKA------RYFLSNCASP--NSFFEANST 434 AK++ +R+ TIA + L A FF A T Sbjct: 232 ELAKAEHVRVYTIALGGDGGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFRARDT 291 Query: 435 HELNKIFRD 443 +L I+ + Sbjct: 292 AQLASIYAE 300 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 62/179 (34%), Gaps = 30/179 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G T F+ S + L I G TAI A+ + Sbjct: 145 RIGLTIFSGEAYSLSPLTTDYKMLKNQITDIDFKMMEASG-TAIGSALAVGTN------- 196 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 RM+ + K ++LL+DG+N N + A + GI+I TIA Sbjct: 197 ----RMRESDSKSKVLILLSDGDNNAGNIDPETSAKLANAYGIKIYTIAIGKEGKVPYGK 252 Query: 416 RYF--------------LSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +F L N A F+ A L ++F I ++ + I+ T+ Sbjct: 253 DFFGRTRYIENSMDVTGLKNIAKIGEGQFYRATDNQALEEVF--SIIDQYEKAEIKETR 309 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 52/163 (31%), Gaps = 28/163 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F D + + + + K I TAI +A+ A +E Sbjct: 128 KLGLILFADHAYLQAPLTQDRRSVAQFL-KEAQIGL-VGKQTAIGEAIALAVKRFDRVDE 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV-------- 407 + +VLLTDG N + A +G++I +I Sbjct: 186 SN-----------RILVLLTDGSNNSGSISPEQAAAIAAKRGVKIYSIGVGAEVMERRTL 234 Query: 408 -------NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + + +F A + EL I+++ Sbjct: 235 FGKERVNPSMDLDETQLTALAQTTGGLYFRARNAQELESIYQE 277 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 59/154 (38%), Gaps = 18/154 (11%) Query: 297 MGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + L+ + TAI + A + Sbjct: 129 IGLTIFAGEAFTQCPMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRL--- 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 K++ K ++LLTDG N + + A+ GIR+ TI F ++ Sbjct: 186 --------KDSKAKSKIVILLTDGSNNAGSISPMTAAAIARKFGIRVYTIGFGKETGEEI 237 Query: 414 KARYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 A + L + A + F+ A S EL++I++D Sbjct: 238 GAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQD 271 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 75/229 (32%), Gaps = 19/229 (8%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G++ ++TA+ M ++G + +D + + + A A + L + Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLTEG 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ Y + F NL + T ++ Sbjct: 66 ATD-------------DQLRAYALDFFNANLND---LNPANATLNLTLPSNTAGGGLLKM 109 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 Y P F + +G + + S+ ++ + + V+D S SM Sbjct: 110 TARLNYKPYFYP--AFAQLVGKSATDANQSISFDVTSQ-VRLKNTLEVALVLDNSGSMTT 166 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S + ++ + + V + + Q + D + +V S+ Sbjct: 167 LGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDRPVQFGLVPFAASV 215 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 34/95 (35%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++ + + T +K G T + + M + + S R+ Sbjct: 436 YSCTTNAITPLTDVSVTDGATAIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRL 495 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + K +++LTDG NT ++ + A S+ Sbjct: 496 ETEKGNDKVVIVLTDGANTYYTPSSLSYSDPADSK 530 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 64/190 (33%), Gaps = 37/190 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ L I ++ D R+G F ++ + + F Sbjct: 113 INRLQAVKRVLDDFIDQ-REGD------RLGLILFGTEPYVQAPLTFDRETVRTLL---F 162 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G +TAI DA+ + + ++ ++ ++LLTDG NT Sbjct: 163 EAGLGMAGRATAIGDAIGLSVKRLRERPQE-----------QRVVILLTDGANTAGQVSP 211 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKA-------------RYFLSNCA--SPNSFFEA 431 A++ G+R+ TI + Q L+ A + +F A Sbjct: 212 DKATEIAQAAGVRLYTIGIGADTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGRYFRA 271 Query: 432 NSTHELNKIF 441 S EL I+ Sbjct: 272 RSLPELEMIY 281 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 72/521 (13%), Positives = 138/521 (26%), Gaps = 104/521 (19%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + + AL + +L G D + L+ AA A I + Sbjct: 16 RFVGDSKANIATTFALALLPILTAIGCGTDYSMAMRLKVKLQSAADAASIASISVNSAGY 75 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + + + N + + +T+ R Sbjct: 76 AAAMAMTSDGSVTAGVNEADNIFKG-------NASTFGGYTLTSETSTVTKTRS-----T 123 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML-- 190 LSS+ + FL +G +S + + + Y + +D S SM Sbjct: 124 LSSQVQFTAAVPTTFLTVIGYQSITVSGSSSSSVTLPLY-----LDFYLTLDVSGSMGLP 178 Query: 191 ---------------DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 +Y++ G L C P + +Q + VS Sbjct: 179 STSAEAQRMQAISPDNYRQYPTGCTLACHFSPQNSACTDSGTQGYPTNNYCLGYAISRVS 238 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK-------------------HLVRD 276 + + + + S + K V Sbjct: 239 QSGYKSLLTTNKNNPKGVQLPSSIVSGLPNSLYNKLPTVANCPTDGTDDCIQLRLDAVGY 298 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR-------------TI 323 A+ + + V + R+G F + S + + T+ Sbjct: 299 AVNQLFTTANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSSTINYAAANLATL 358 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE----- 378 + T G T I+ A+ + I S + N Y+ L+TDG Sbjct: 359 LDTNTNASLGSGGTHIDTALSSVNSLITSVGDGSA---TTNTLP--YVFLVTDGAQDPQV 413 Query: 379 -----------NTQDNEEGIAICNKAKSQGIRIMTI------------AFSVNKTQQEKA 415 N C K++GI I + +F+ ++ Sbjct: 414 KGVPNGSWSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNASFAGDEDDYANN 473 Query: 416 R-----YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 L CASP F+ AN+ ++ + + E Sbjct: 474 NIPNIPPSLQACASPGFFYTANTPADITSALNAMFNHAVSE 514 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 46/408 (11%), Positives = 113/408 (27%), Gaps = 79/408 (19%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + Y +K G+ ++ A+ +L G +D + L+ A A++ Sbjct: 3 RLLRYLRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLAGV 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 +A + + N + + T + Sbjct: 63 AA---------GKAALDSGATQAAAIAMAQAASSSYFTGNTAKIDATPTINFTTMGRTLS 113 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + + F+R +G + + + + + Y +++ ++D Sbjct: 114 GTGSATSVMN---------TSFMRLVGFPTMTLNASSASSATMQPY-----LNVYLLVDI 159 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML + + A + S Y L Sbjct: 160 SSSMLLPATQAGITQMRNGTGCALACH-------------------ETTNGTDSYSYALK 200 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 +V+ + +V + +++ + + V++G F+++ Sbjct: 201 N-------------------NVLLRYQVVNQGVQNLLTYLNSSAVYKNYVKVGLWSFDNQ 241 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S + ++ F A + + +SS + Sbjct: 242 LTQLSS----LTSSFSSVAANFPAPGLAYNDAAAATPFDSLIGSFVSSVGTAGDG-STSA 296 Query: 366 EAKKYIVLLTDGENTQDNEEGIAI-------------CNKAKSQGIRI 400 +K +++ TDG N CN KS G+ + Sbjct: 297 TPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFNTAFCNTFKSNGVTV 344 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 71/212 (33%), Gaps = 13/212 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F++R+ + V K + + G T IN A+Q A + N+ Sbjct: 331 FSIIGFSNRIKVWKDYLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKY 387 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQE 413 H + +V LTDG+ T + I N + + I TI + + Sbjct: 388 VAHSDIGDRSVS-LVVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRL 446 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L NC E F D I Sbjct: 447 LEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 31/137 (22%), Positives = 55/137 (40%), Gaps = 19/137 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + +D ++ + ++ D R+G F++R + S + KLI+ +V + Sbjct: 106 RLEVAKDKISDFVA-LRPTD------RIGLIMFSERAFTLLPLSTDL-KLIKQMVGEINV 157 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T I DA+ A + + K I+LLTDG + I Sbjct: 158 GGMLGSGTNIGDALGLA---VARGAQSLAKN--------KVIILLTDGVSNVGFLTPIQA 206 Query: 390 CNKAKSQGIRIMTIAFS 406 +AK QGI++ TI Sbjct: 207 AEEAKKQGIKVYTIGIG 223 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 48 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 105 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 106 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDALQ 154 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 155 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 193 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 62/188 (32%), Gaps = 36/188 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL V K + R+G + + + ++ ++ + + Sbjct: 115 EALKKVASQFVKDRPND---RIGLVVYAGESYTKTPVTTDKGIILSSLAELTYGQVED-- 169 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K + + I+LLTDG N + + A Sbjct: 170 GTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGVIDPLIAAELAAE 218 Query: 396 QGIRIMTIAFSVNKT------------------QQEKARYFLSNCA--SPNSFFEANSTH 435 GI++ T+ N Q E + A + +F A + Sbjct: 219 YGIKVYTVGIGTNGMALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQ 278 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 279 KLQQIYDE 286 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 70/458 (15%), Positives = 150/458 (32%), Gaps = 60/458 (13%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L K GH I+ A+++P + G+ + D R + ++ A++ A + S Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAH--ND 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR----DIVRDTAVEMNPRKS 127 ++ + + T +Q + +Y+ N + TD +V +++ + + Sbjct: 60 PDQPDNGSYTPSTRNRQIVVDYV--NAYISDIDAVTDIKVAKRRCELISGCVAGLYKGDA 117 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y L D+ S F + I+ ++++R Y E V + + DFS Sbjct: 118 RY---LEHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSE-AVDVMFAADFSG 173 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SMLD S + N RD+ S + + Sbjct: 174 SMLDTWSGSSNPKYIDLIEIIRNISVELQKFNDLPENRDKSTMGIS---AFSTFTNSFTS 230 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S V+S + + + +A++ D + GA Sbjct: 231 DTGIQCSLSQGVNSKNKPGNWFRPVKPANTVANIWNE-----KTEDYCKSGAYA----GF 281 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 D + + + L + + G TA A+ + + Sbjct: 282 HDVNLTSNFNSLNGQVGSFY-----AGGGTASYQALIRG-----------AQLLDRGRNS 325 Query: 368 KKYIVLLTDGENTQDNEE----GIAICNKAKS--------QG----IRIMTIAFSVNKTQ 411 ++ +++L+DG + N +C + ++ G ++ I F + Sbjct: 326 RRLLIVLSDGMDNDRNLANGLVSNGMCREIQAGLESDRTPDGRPIAAKMAVIGFDYDPFA 385 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + L +C + ++A E+ I + I E+ Sbjct: 386 NKA----LKDCVGEKNVYKAEDADEVEDIILELINEEV 419 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 59/190 (31%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I R+G F D + H + + +T Sbjct: 57 YIDRLSAVKQVLSDFISK-------RQGDRLGLVLFADHAYLQTPLTLDRHTVAEQLNQT 109 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI + + A T + S ++ ++LL+DG NT + Sbjct: 110 VLRLI--GTKTAIGEGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 156 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 I AK I T+ + Q+ L A + +F A Sbjct: 157 IEAAKIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDEKSLMEIAKLTGGQYFRA 216 Query: 432 NSTHELNKIF 441 + EL I+ Sbjct: 217 RDSKELATIY 226 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 62/177 (35%), Gaps = 27/177 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND-RVISDPSFSWGVH 317 +D S + K ++ I + + ++G ++ IS S Sbjct: 74 IDRSGSMNDENKMQSAINSAKGFIDLMDLSKH-----KVGIVDYSSANNISSFPLS---- 124 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + VK + G TA DA++ A + +++ D + IVLLTDG Sbjct: 125 -TDKEAVKNYVNGLRANGGTATGDAIKKARELLVNHRPDA----------QPVIVLLTDG 173 Query: 378 ENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQE--KARYFLSNCASPNSF 428 + T+ N N+AK +GI TIA + L A+ + Sbjct: 174 DATEPNGNAYNYALTNSNEAKQEGIVFYTIALLNTNANPDTSGPNLLLKQMATTSHH 230 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 57/189 (30%), Gaps = 39/189 (20%) Query: 279 ASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +++ + + NV R+G F + + + + + Sbjct: 108 GAIVNRLDAVKNVVKDFIMKRSGDRIGMVVFGSEAFTQMPLTRDYDTIAFVLSRLKIGAA 167 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI DAM + + ++LLTDG++ A + Sbjct: 168 GPS--TAIGDAMGISLKRLEDVKSKS-----------NIVILLTDGKSNSGEITPGAAAD 214 Query: 392 KAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA--SPNSFFEAN 432 A+ +G+++ TI + L A + +FF A Sbjct: 215 IARERGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVDMDHEALKEIADKTGGAFFAAA 274 Query: 433 STHELNKIF 441 T L KI+ Sbjct: 275 DTDSLKKIY 283 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ + Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFS---- 332 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 333 --IIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ + Sbjct: 29 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFS---- 84 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q A + Sbjct: 85 --IIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 136 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 137 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 193 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 194 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 230 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 50/377 (13%), Positives = 121/377 (32%), Gaps = 71/377 (18%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L + TG+ I+ AL + ML G D +R ++ A+I +V I + Sbjct: 14 RSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA-AVKQINN 72 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E+ + + + ++E E+ + + + Sbjct: 73 TEDTDALKQKVSDWFHAQVENSYALG-----------------------EIEIDTTNHNI 109 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ + + F++ I + + + + + SY +++ V+D S SML Sbjct: 110 TATASGTVP----TTFMKIANIDTVPVSVASAVKGPATSY-----LNVYIVVDTSPSMLL 160 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + + + GK Sbjct: 161 AATTAGQSTMYSGIKCQFACHTGDTHTIGK------------------------------ 190 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS--D 309 + + D S+ + + + + DA+ V+ I + D+ ++ +++G D + Sbjct: 191 -KTYANNYDYSTEKGIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYGLGDTLTEVLA 249 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 P+ S + + +T + ++ T + + + + K Sbjct: 250 PTLSTDIARTRLADSSYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----TSGTPLK 304 Query: 370 YIVLLTDGENTQDNEEG 386 ++LLTDG +Q Sbjct: 305 LVLLLTDGVQSQREWVT 321 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 64/190 (33%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ + +K D R+G F D + +++ I +T Sbjct: 120 YIDRLTAVKKVLSDFVAK-RKGD------RLGVVLFGDHAYLQTPLTADRKTVMQQINQT 172 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + T + S ++ ++LL+DG NT E Sbjct: 173 VIGLVGQR--TAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP 219 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + AK I T+ + + L+ A + +F A Sbjct: 220 LEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEMTGGQYFRA 279 Query: 432 NSTHELNKIF 441 T +L KI+ Sbjct: 280 RDTDQLEKIY 289 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 86/296 (29%), Gaps = 40/296 (13%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 ++ ++ S Y ++ + +D S + S+ + F +R VK Sbjct: 306 NVRSLLDSYSADYDMVISDNVDFGLDRLMQYSFVVLCNVSKNHLTDKFLNDCERYVKDLG 365 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV---------DSSSLRHV 267 +G + + + + V S Sbjct: 366 GGLLVIGGENSYALGNYSNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGESNLGN 425 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + + A A +I ++ D+V G F+ +G K ++ Sbjct: 426 INKLEIAKSAAAKMIDHLESSDSV------GVIAFDHNFYWASK--FGKLKSKNEVI-EN 476 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI + A +T+ S K IVLLTD + Sbjct: 477 ISGIQIGGGTAIIPPLTEAVNTLRKSKAK-----------DKVIVLLTD--GYGEEGGYE 523 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + AK I+I TI + LS A + F+ L +F Sbjct: 524 YPASIAKRNNIKITTIGVGSSINAP-----ILSWMAAYTSGRFYYVKDASNLIDVF 574 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 64/438 (14%), Positives = 139/438 (31%), Gaps = 69/438 (15%) Query: 10 YSKKLIKSCTGHFFI-ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +S +L+ G I A+L+P+++ +V + A++ A TA + Sbjct: 2 HSSRLLSKQRGSVAISYLAMLIPMIIAAASTIVIGYQVLLSNRAMQ-AVDTASLAC---- 56 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 ++ ++ + + D I + TA S Sbjct: 57 ------------------------EFRGEYDRSIAQGYLDYYKPKIDKVTAT--LGASSG 90 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +V L Y + L+ S++ A + + + V+D S S Sbjct: 91 CKVELGYSYSSIFTSLT-----FSDASYVAGVTASQKVYVTEVTDSDPIELVLVLDISGS 145 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M+ + L T++S + + P+ + + L G Sbjct: 146 MMGALDE-----LKSILNRGLTTLRSQQANVAGQDHIKVSIVPFSNGVSVTDAPWLKSG- 199 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +L + A+ + ++ + ++ Sbjct: 200 -------------GTLCVDATVNSGGSFSPANTVANLDVTHDQAPVTTSSSSSDCSLTSV 246 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR--MKNNLE 366 + ++ ++ + + +GSTA + + + + +N Sbjct: 247 ILPLTSNLNDVVDAVNR-----LQTIGSTASYQGLLWGLRQLTPNWQSAWRVGPNRNQDN 301 Query: 367 AKKYIVLLTDG-ENTQDNEEGIA--ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++ +VL+TDG ++ +E I +C +AK GI + I F V + F Sbjct: 302 VQRKLVLMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSW---RLEQFTRCAG 358 Query: 424 SPNSFFEANSTHELNKIF 441 S + F AN+T +L+ F Sbjct: 359 SAGAVFSANNTQDLDDYF 376 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 68/482 (14%), Positives = 149/482 (30%), Gaps = 97/482 (20%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ +I AL++ ++ G VD R + L+ AA A + A + Sbjct: 9 RFHRDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVSRTSPAY 68 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S + + + +R F NL N T + + V + +V Sbjct: 69 IAAGSMSTDGAITSG---ADDALRIFNGNL-ANLTGYTLDSV--AATVSKSGEAVTSKVT 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S++ ++F++++ + S + + A Y + ++D S SM Sbjct: 123 FSAQIS------TMFMKAVAVSSMTVGGVSTATASMPKY-----IDFYLLLDNSPSMGVG 171 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + + NKS D + Sbjct: 172 ATPTDVAAMVA------------------------------ATANKSSDDHCAFACHDVN 201 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--- 309 ++ ++ + + + + ++R A ++ + + RM F + Sbjct: 202 -NKNNYYNLAKALGITTRIDVLRSATQQLMDTAAATATYTNQFRMAIYDFGASAQTAGLR 260 Query: 310 --PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL-- 365 + S L AID + N+ T Y I+ + + + Sbjct: 261 NLFALS---ASLSSAKTAAGAIDLMTVKGQNDNNDQDTQYTAILPAIDKLIAAPGTGAAG 317 Query: 366 EAKKYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRI---MTIAFSVN 408 KY+ ++DG + N + A+C K +G+R+ T + Sbjct: 318 SPLKYLFFVSDGVADEYNPACLKPKTGNRCQSPINPALCKTLKDRGVRVAVLYTTYLDLP 377 Query: 409 ------------------KTQQEKARYFLSNCASPNSFFEANS----THELNKIFRDRIG 446 + + + CASP +FE + +N +F+ + Sbjct: 378 SNDWYKKWIAPFNEGPYGPSPNSEIAKNMEACASPGFYFEVSPTQGIAEAMNALFKRAVA 437 Query: 447 NE 448 + Sbjct: 438 DA 439 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 61/179 (34%), Gaps = 28/179 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ S + ++ D R+G F +R ++ + + R I A Sbjct: 113 TRLEAVKRVARSFVEE-RQGD------RIGLALFANRAYVAAPLTFDLAAVGRAI--EEA 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STAI D + A + S A + IVLL+DG++ + Sbjct: 164 SIGITGRSTAIADGLGLALKRVTESG-----------AASRVIVLLSDGQDNAHQIDARQ 212 Query: 389 ICNKAKSQGIRIMTIAFSVN--KTQQEKARYF----LSNC--ASPNSFFEANSTHELNK 439 + A G+RI TIA + +T+ L AS + +L Sbjct: 213 VAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 64/190 (33%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ + +K D R+G F D + +++ I +T Sbjct: 120 YIDRLTAVKQVLSDFVAK-RKGD------RLGVVLFGDHAYLQTPLTADRKSVMQQINQT 172 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + T + S ++ ++LL+DG NT E Sbjct: 173 VIGLVGQR--TAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP 219 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 + AK I T+ + + L+ A + +F A Sbjct: 220 LEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRA 279 Query: 432 NSTHELNKIF 441 T EL KI+ Sbjct: 280 RDTEELEKIY 289 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 71/196 (36%), Gaps = 41/196 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +D A I ++ +G T F + + H + +++T Sbjct: 109 RMEAAKDVAAEFIAD----RPNDN---IGLTIFAGEAFTQCPMTTD-HTSLLNMLQTVRT 160 Query: 330 DENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D G TAI + A + K++ K ++LLTDG N + Sbjct: 161 DIAAKGLIQDGTAIGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDLS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNS 427 + N AKS GIR+ TI NK + E L + A + + Sbjct: 210 PMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGN 269 Query: 428 FFEANSTHELNKIFRD 443 F+ A + EL +I++D Sbjct: 270 FYRATNNQELKQIYKD 285 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 72/190 (37%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H+ + +++ D + G Sbjct: 111 EAAKDVAAEFISGRPNDN---IGLTIFAGEAFTQCPMTTD-HQSLLNLLQNVRTDLSARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + K++ K ++LLTDG N + + + N Sbjct: 167 LIEDGTAVGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNRGDLSPMTSAN 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCAS--PNSFFEANS 433 AKS GIR+ TI NK +Y LS AS +F+ A + Sbjct: 216 IAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQYVNIPVEIDTKTLSGIASVTHGNFYRATN 275 Query: 434 THELNKIFRD 443 +EL +I++D Sbjct: 276 NNELKQIYKD 285 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 71/196 (36%), Gaps = 41/196 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +D A I ++ +G T F + + H + +++T Sbjct: 109 RMEAAKDVAAEFIAD----RPNDN---IGLTIFAGEAFTQCPMTTD-HTSLLNMLQTVRT 160 Query: 330 DENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D G TAI + A + K++ K ++LLTDG N + Sbjct: 161 DIAAKGLIQDGTAIGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDLS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNS 427 + N AKS GIR+ TI NK + E L + A + + Sbjct: 210 PMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAATTDGN 269 Query: 428 FFEANSTHELNKIFRD 443 F+ A + EL +I++D Sbjct: 270 FYRATNNQELKQIYKD 285 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 59/190 (31%), Gaps = 35/190 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + + R+G F+ S + L R I Sbjct: 132 RLEAAKLVAKNFV-------HGRKYDRIGLVIFSGEAYSVSPLTTDYKLLKRYIEDIRED 184 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TAI A+ RM+ + K ++L++DG+NT N + I Sbjct: 185 MIQENG-TAIGSALG-----------MGTIRMQESASRSKVVILISDGDNTAGNLDPITA 232 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF--------------LSNCAS--PNSFFEANS 433 A + I+I TI + F L A F+ A+ Sbjct: 233 SRLATAHNIKIYTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREIAKIGEGKFYRASD 292 Query: 434 THELNKIFRD 443 L +F + Sbjct: 293 NQALKNVFAE 302 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 70/190 (36%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A +V ++ +G T F + + LI ++++ D G Sbjct: 111 EAAKNVASEFISDRPNDN---IGLTIFAGEAFTQCPMTTDHASLIN-MLRSVRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI + A + K++ K ++LLTDG N + + Sbjct: 167 LISDGTAIGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDISPLTSAQ 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANS 433 AKS GIR+ TI NK +Y L N A + +++ A S Sbjct: 216 IAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDSKTLKNIAETTDGNYYRATS 275 Query: 434 THELNKIFRD 443 ++L +I++D Sbjct: 276 NNQLKQIYKD 285 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 66/435 (15%), Positives = 127/435 (29%), Gaps = 61/435 (14%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S+K + G ++ A L+ V++ + +DV L+ A A ++ L + Sbjct: 9 SRKPRLARRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRAATDAAAKAGTLALAK 68 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + +S + RN T +V+ +A Q Sbjct: 69 TDGDAASARTAAIQ--------AAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQ 120 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 +S L S S+ + + + + + I VID S SM Sbjct: 121 TPYTSVKILSSMSDSTAAGSVPLFLGTFMGRGSFQPAQSATASQMEQEICLVIDRSHSMC 180 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 E P +T + V+ P Sbjct: 181 FNMSGVEWS-----YPPGTKTTPHTICYPPHATLSRWAALQSSVNLFMDTILETNNTPRV 235 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 ++ + +++ + KK V Sbjct: 236 ALITWGSTIGTNTAEYSYTKK--------------------------------TEVAVAN 263 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + +G T ++ + A T+++ N AKK Sbjct: 264 ELGLSTDYAAVKSKIAARTTKVMLGGTNMSAGID-AGRTLLNGNTVRA-------LAKKT 315 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++L+TDG+ Q + I A +GI+I TI F A+ + A + + Sbjct: 316 MILMTDGQWNQG-RDPIDAAEDAADEGIQIHTITFLSGS-----AQNTMRQVAEITGGKY 369 Query: 429 FEANSTHELNKIFRD 443 + +++ EL + FRD Sbjct: 370 YVSSNQAELEEAFRD 384 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 66/215 (30%), Gaps = 34/215 (15%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVI----RSIKKIDNVNDTVRMGATFFNDRV 306 +S ++ + + + + + I + + +G F Sbjct: 92 VDVSSSMDINMKYGEERLTRMDVAKIVVEKFIGGDGDELVGRPDDL----IGLITFARYA 147 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + S LI + T AY + ++ ++ + + Sbjct: 148 DTIAPLSLAHEALISIVQDVTINTRPNEDGT--------AYGDATALAAAQLDLLQGDQD 199 Query: 367 AK-KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA----------------FSVNK 409 K K I+LLTDGEN N + + AK GI+I TI+ F V Sbjct: 200 IKSKIIILLTDGENNCGNHLPLQAASLAKEWGIKIYTISIQNKPTPERKKTDQGTFFVPP 259 Query: 410 TQQEKARYFLSNCASPNSFFE-ANSTHELNKIFRD 443 T + S F A+ L ++++ Sbjct: 260 TPSAGDQVLKKMAESTGGVFRLAHDYDSLKSVYKE 294 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 60/188 (31%), Gaps = 35/188 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL V + + R+G + + + ++ + + E G Sbjct: 115 EALKDVASEFIQGRPND---RVGIVLYAGESYTKTPITSDKSIVLGALNDVKFSEVLENG 171 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + T+ + + K++ K I+LLTDG N + A Sbjct: 172 -TAIGMGLATSVNRL-----------KDSKALSKVIILLTDGVNNSGTIDPKLASELAVE 219 Query: 396 QGIRIMTIAFSVNKTQ------QEKAR------------YFLSNCA--SPNSFFEANSTH 435 GI+ TI N + + L A + +F A + Sbjct: 220 YGIKTYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVTGGQYFRATNNQ 279 Query: 436 ELNKIFRD 443 +L I+ + Sbjct: 280 KLEAIYEE 287 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 72/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F +R+ W H + T K + + G T IN A+Q A + Sbjct: 331 FSIIGFPNRIKV-----WKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 385 --NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 69/176 (39%), Gaps = 18/176 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ IDN++ R F+D S + + +N G+T + Sbjct: 136 FSAALNLIDNMDKNNRFSMYKFDDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTNMR 195 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIR 399 DA++ AY+ I SS + + M +++L+DG +T D +++ K + I Sbjct: 196 DALEKAYEEIKSSETKDKNAM---------VIMLSDGGDTYDLSKKFDETLKPFKEKNIS 246 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I TI L A S +++ +L +F ++I + +R+ Sbjct: 247 IYTIG-----MSNGNNFSMLKEIAKESGGNYYNVKEIKDLKNVF-NKIYRDRQQRL 296 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 51/162 (31%), Gaps = 28/162 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D + + + A TAI +A+ A +E Sbjct: 129 RLGLILFADHAYLQAPLTQDRRSIATFL--ADAQIGLVGKQTAIGEAIALAVKRFDQVSE 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV-------- 407 + +VLLTDG N N E A + + I T+ Sbjct: 187 SN-----------RVLVLLTDGSNNAGNIEPDVAAEIAAKRNVTIYTVGVGAELMERRTI 235 Query: 408 -------NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 ++A+ + +F A ++ +L +I++ Sbjct: 236 FGKERVNPSMDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQ 277 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 82/486 (16%), Positives = 132/486 (27%), Gaps = 88/486 (18%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK+ G II ALL+ V+LG + VDV RW L ++ I + + Sbjct: 5 IKNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISN---- 60 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + EE NF E A E +++ + Sbjct: 61 ----PYLGEDGHLRLAEEVARENFSAGYLMTPDSGERSATFTAYADE------DHRIRVE 110 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML---- 190 N LF S + K++ V I V+D S SM Sbjct: 111 GTVSSPGNLAGLFGVDWVATSAM------------GVAKKNEVEIMLVLDRSGSMDGTPM 158 Query: 191 --------------DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY-MVS 235 + +D + L F V ++ + + + + Sbjct: 159 NDLKKAARSFVSFFEETQDQDKMGLVSFATSVKVDVPLGNNYVSSMTSKINAMDAVGATN 218 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 SL P + + + VI A + DN++ V Sbjct: 219 AEDSLSQAGNPAKGGLTDQSGVPGNKRVQQFVIFFSDGNPTAFRGKFK-YNGTDNIDAVV 277 Query: 296 ------------------RMGATFFNDRVIS---DPSFSWGVHKLIRTIVKTFAIDENEM 334 R +N R G K ++ Sbjct: 278 CGTGNDCGTVYTKLGKPEREEWLSYNPRFTGDGKPKPPGTGTSKCTTRYGGSYVNTTKWY 337 Query: 335 GSTAINDAMQTAYDTIISSNED----EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + Y ++E V YI T + Sbjct: 338 VLDD--PDYRLTYRGTTYNSESCFIPTVGSSNTTAPLSTYIC-------TTARGMAVEHA 388 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIGNEI 449 + K ++I TI R FLS AS SF F A ++ EL IF ++I +I Sbjct: 389 QELKDNNVKIYTIGLG------NIDRDFLSQIASGPSFEFYAPTSGELQAIF-NKIAKDI 441 Query: 450 FERVIR 455 R+++ Sbjct: 442 KLRLVQ 447 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 65/483 (13%), Positives = 147/483 (30%), Gaps = 67/483 (13%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ TG + +L+ ML + DV+R + + A A+ + +I + E Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQMANATDAAL---ASAIISEVPE 57 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKK-----------------NFTDREVRDIVRD 117 ++ Y+ +L+ N +E I + Sbjct: 58 STAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTN 117 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--------VSR 169 ++ K+ +L + P++ ++ + + + S Sbjct: 118 AKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASG 177 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + I ++R Q S + R+ Sbjct: 178 NVNTGPMEEIYL--------GSFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMC 229 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS--IKK 287 N L + +S + I K ++ + + Sbjct: 230 RDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPS 289 Query: 288 IDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + N+ A F + +P + R ++ +T + M Sbjct: 290 TNPSNNYRPFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMNTNNAEGMV 346 Query: 345 TA-------YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKA 393 A + I E+ R ++ + KY+V+ +DG + D +++ IC + Sbjct: 347 WAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQL 406 Query: 394 KS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 K +G+++MT+ F ++ + +CAS ++ S + K+F +I E Sbjct: 407 KQPGRGVKVMTVNFGGAASE-----RLMQSCASGPEYYHVASLFSVEKVF-----EQIAE 456 Query: 452 RVI 454 +VI Sbjct: 457 QVI 459 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDEN 332 +A V ++ +G T F + + L+ + + A Sbjct: 111 EAAKDVAAEFISGRPNDN---IGLTIFAGEAFTQCPMTTDHASLLTLLQDVRTDMATRGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TAI + A + K++ + ++LLTDG N + + Sbjct: 168 INDGTAIGMGLANAVSRL-----------KDSKTKSRVVILLTDGANNAGDISPLTAAQM 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANST 434 AKS GIR+ TIA +K + +Y L A + +F+ AN+T Sbjct: 217 AKSLGIRVYTIAVGTSKVAPYPIEVGGRVQYISRPADIDTKTLREIAAVTEGNFYSANNT 276 Query: 435 HELNKIFRD 443 +L +I+ D Sbjct: 277 AQLKQIYHD 285 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 60/182 (32%), Gaps = 29/182 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ A+ + + D R+ F F++ + R I Sbjct: 115 VTRLEAVK-AVGADFARRRAGD------RLALVVFGSEAYFASPFTFDTESVARRI--EE 165 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A +T+I+D + A + R+ + + ++LL+DG N Sbjct: 166 ATIGISGRATSISDGLGLA-----------LKRLSTSTATSRVVILLSDGINNAGATNPR 214 Query: 388 AICNKAKSQGIRIMTIAFS------VNKTQQEKAR-YFLSNCA--SPNSFFEANSTHELN 438 + A G+R+ TIA ++ L + S F +T +L Sbjct: 215 GVAELAARYGVRVHTIALGPKDLTTAEVGERGVVDAATLRAISQISGGESFRVRTTEDLV 274 Query: 439 KI 440 + Sbjct: 275 AV 276 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 65/186 (34%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + +++ + T + Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVDHDVILQMLDATEIGQLED-- 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + + K I+LLTDG N + A+ Sbjct: 165 GTAIGLGLATAINTL-----------RGSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKAR----------------YFLSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRHIAQQTGGKYYRATDETKL 273 Query: 438 NKIFRD 443 ++I+++ Sbjct: 274 HEIYKE 279 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 85/209 (40%), Gaps = 17/209 (8%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y + FV S K+ ++++ +I +KK D +D V+ GA Sbjct: 786 ICSPYDECRRIERLDIVFVIDGSGSIDPKEYDIMKE---FMISLVKKSDVSHDRVQFGAV 842 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 ++ + + K I++ D++ +T A++ + + + S E H Sbjct: 843 KYSAEPETFFYLNRYTTK--SAIIRAIQNDKSIGETTYTAKALRHS-EGLFS----EEHG 895 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 + + + ++++TDG+ + D E + K ++ GI I I ++ + L+ Sbjct: 896 SRKHRGVPQVLIVITDGD-SHDAAELDEVSKKLRANGIIIYAIGI-----ERARPDELLT 949 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEI 449 S + +F N+ L ++ RI +I Sbjct: 950 MAGSEDKYFYVNTFEGLKHLY-PRISEKI 977 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 19/160 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + K D D + +GA F+ R + S + I++ ST Sbjct: 634 DFMSELVNKSDIGLDRMHVGAIQFSSRNKEEFRLS--QYATKSDIIRAIGRMSLMGQSTL 691 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEA-KKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q D +K + KK ++L+TDGE ++ + +G Sbjct: 692 TGGALQFVL--------DYFRPIKGSRPYVKKILILITDGE---AQDDVKTPAEALRQEG 740 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I + ++ + P F L Sbjct: 741 IIVYSVGVF-----NANRTQLVEISGKPEMVFYVEDFDIL 775 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 56/170 (32%), Gaps = 15/170 (8%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L D VD S + I+K + + + + +D + +R+G + Sbjct: 211 ALEACDADAVADVAFIVDVGSSQTNIEKIQIF------LEKLVSSLDVKDKCMRIGLVTY 264 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +++ + K ++++ + G + + + + S + + Sbjct: 265 SNKPQATSLLRMATDKT--HVLQSIQSISPKGGKANLGSVIHFTKEKVFSKSAGS----R 318 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++ +++T + ++ + G+ I I T Q Sbjct: 319 KSQGVEQIAIVIT---HRSSEDDVSGAATALRRAGVTIFAIGIDAANTTQ 365 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 84/251 (33%), Gaps = 18/251 (7%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS--LEEV 75 G ++ A+LM VM+G + VDV + L+++ +A + + LI+ + Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATG 78 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV-----RDTAVEMNPRKSAYQ 130 + + + +E+ N +N+ K T + DT+ ++ + Sbjct: 79 TVHEYLTHNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKNPT 138 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 +S R P F +G S+ I ++ A SR I V+D S SM Sbjct: 139 -TVSVRMTYENMPF-FFGHLLGRDSFDITAESIATYQSRD--------IMLVLDLSGSMN 188 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D + L F + + Y+ + + Y V + Sbjct: 189 DDSEFNSIGKL-GFDHIYSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSV 247 Query: 251 PSLSEEHFVDS 261 V S Sbjct: 248 TYRGNSVVVKS 258 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/278 (13%), Positives = 90/278 (32%), Gaps = 26/278 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G++ ++T + M ++G + VD + + + A A + L + Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLTEG 82 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE-VRDIVRDTAVEMNPRKSAYQ 130 + +++ Y + F N D + + T Sbjct: 83 ATD-------------DQLKAYALDFFN----ANLNDIDPASATLNVTLPSNTSGGGLLT 125 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + Y P F + +G + K +V+ ++ + + V+D S SM Sbjct: 126 MTAQLAYKPYFYP--AFAQLVGKSATDANQKINF-SVTSQVRLKNTLEVALVLDNSGSMT 182 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 S + ++ + + V + + Q + D+ + +V S+ GP + Sbjct: 183 TLGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDKPVQFGLVPFAASV----NVGPAN 238 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + S S + + + +A + I Sbjct: 239 GNASWMDTEGLSPVSNENFDWSTL-NAANKYAQQTNGI 275 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 28/76 (36%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T +K G T + + M + + S R + K +++LTDG NT Sbjct: 472 ATSIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRRETEKGNDKVVIVLTDGANT 531 Query: 381 QDNEEGIAICNKAKSQ 396 + + A S+ Sbjct: 532 YYTPSSLGYSDPANSK 547 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 63/190 (33%), Gaps = 35/190 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ + +K D R+G F D + +I+ I +T Sbjct: 12 YIDRLSAVKKVLSDFVAK-RKGD------RLGVVLFGDHAYLQTPLTADRQTVIQQIKQT 64 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI D + T + S ++ ++LL+DG NT + Sbjct: 65 VIGLVGQR--TAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 111 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEA 431 I AK I T+ + + L+ A + +F A Sbjct: 112 IEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRA 171 Query: 432 NSTHELNKIF 441 +L KI+ Sbjct: 172 RDAEQLEKIY 181 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 64.6 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 66/194 (34%), Gaps = 40/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ A IR + R+G + + + ++ ++ + Sbjct: 112 RLSALKEVAADFIRQ----RPND---RIGLVAYAGEAFTKTPITSDKSIVLNSLREITYG 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N+ TAI + T+ + + K + K I+LLTDG N E Sbjct: 165 QLND--GTAIGMGLATSVNRL-----------KESKAISKIIILLTDGVNNSGFIEPQTA 211 Query: 390 CNKAKSQGIRIMT------------IAFSVNKTQQEKARY------FLSNCA--SPNSFF 429 + A GI+ T IA++ + + + R L A + +F Sbjct: 212 ADLAVEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLEGIAETTGGKYF 271 Query: 430 EANSTHELNKIFRD 443 A +L I+ + Sbjct: 272 RATDNEKLEAIYDE 285 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 64.6 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 85/262 (32%), Gaps = 22/262 (8%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + + T S + R + + S L Sbjct: 1 MHFRSSLVPYKTCTTDALDKEETYYYVLSTDAATMYRVKYSDGNWYYIDDSK-EGLGEKL 59 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D + + ++ R ++ A + + + ++ + FN Sbjct: 60 TDNKERLYYTTNDANDRFYY-----LKQAATNFTTQLAQSSPNSE---IALVTFNKTATE 111 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F V K I +T G T N+ + AY + + N K Sbjct: 112 QFDFK-NVGKDSAYITETI-NAMETSGGTHQNEGLDRAYKILNNDQ--------NTSNLK 161 Query: 369 KYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCASP 425 +Y+VLLTDG + I NK KS +++T+ +++ T + A+ +L A Sbjct: 162 RYVVLLTDGCPNGVTYDQITTSINKIKSTNTKLITVGVGLDETNTGLKAAKDYLQANADD 221 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 N + AN LN IF +G Sbjct: 222 NMAYNANDASHLNTIFTQILGQ 243 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/149 (21%), Positives = 60/149 (40%), Gaps = 19/149 (12%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNNLEAKKYIVLLTDGENTQD 382 + + N G+T + + T+ ++N+ + N + YIV+L+DG NTQ+ Sbjct: 454 LNSKVNAMNPSGNTNQAIGLFWGWQTLNTANDPFKAPSKDPNWVYQDYIVILSDGLNTQN 513 Query: 383 NEEG--------------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS- 427 +C+ K+ I I TI ++N E L +CAS S Sbjct: 514 RWYTCPNAGPCPTIDGREKTLCDNIKADKITIFTIQVNINSKDPES--QVLKDCASSGSG 571 Query: 428 FFE-ANSTHELNKIFRDRIGNEIFERVIR 455 +F+ S ++ F + + R+ + Sbjct: 572 YFQLITSANDTATAFDNVLNKIAKLRIAQ 600 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 77/239 (32%), Gaps = 41/239 (17%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 +F + K G+ II L+ ++ + G VD R S A++ A +A + Sbjct: 7 KRFASLANGFRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAAL-- 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + R + N+L N T + Sbjct: 65 -------MISKDAATMSDSEITTR------ARQYVNSLYTN------------TETPIQT 99 Query: 125 RKSAYQVVLSSRYDLLLNP----LSLFLRSMG--IKSWLIQTKAEAETVSRSYHKEHGVS 178 + Y S +LLN + F++ +G + I T + + S + Sbjct: 100 FSAVYTPNNGSGATILLNAGGNMPTYFMKIVGTNFSTLPINTASTTKWGSSR------MR 153 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V+D + SM + + + +K S+ N G + P+ N Sbjct: 154 VALVLDNTGSMDQNGKMTALKKAAA--NATTGLIKKLSAFNTNEGDVYISVVPFAKDVN 210 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/293 (13%), Positives = 96/293 (32%), Gaps = 30/293 (10%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLD--------YQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 V + +V+D S SM Y+ D E + + Sbjct: 544 EYSDTRPPVDVTFVLDRSGSMGPHNPTSWSAYEPDYEIDIGEEWEPIPTDEPFRNTHDWK 603 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 + +RD+ + + ++ + + ++ S + + + Sbjct: 604 SIQVRDDDGTIRTLEHRDFVHPDDWTEIRVHPYHQFGYIPGSIGIYPHPGNDPTNQRVEA 663 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I ++D D R+G F + S + ++V T G T + Sbjct: 664 TRNVIDELDPSAD--RVGVYDFASSGRALHPLSDDLESAKESVVGT------AYGGTNMA 715 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAKSQGI 398 ++ A + + D+ ++ ++LL+DG+ NT ++E + +++ Sbjct: 716 AGLEAALNDYATRGTDDR---------ERIVILLSDGKNSNTANDERMDELADRSDDLDY 766 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEI 449 + T+ + L A+ ++++ EL +F + + EI Sbjct: 767 TLHTVGLDALEHD-SIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDEEI 818 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 77/209 (36%), Gaps = 26/209 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 KS M+ L S+ + FVD + +++ V+ L S ++ + D R+ Sbjct: 25 KSGREMMVAVDLSGSMEAKDFVDQQGIN--VRRIDGVKLLLESFLQQ-RTGD------RV 75 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F D + L + + TA+ DA+ A + Sbjct: 76 GLIAFGDDAYLQAPVTEDFDTLSLLLEQMDV--RMAGAGTALGDAIGVAVNHF------- 126 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKA 415 H NN K ++LLTDG++T + + A +G+ I IA N ++ Sbjct: 127 EHSESNN----KVLLLLTDGKDTTSQFPPVDAAHFAGERGVTIYPIAIGDATNVGEEAID 182 Query: 416 RYFLSNCAS--PNSFFEANSTHELNKIFR 442 L+ AS FEA L +++ Sbjct: 183 LEMLARIASYTGGQVFEALDGDALAAVYQ 211 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 60/197 (30%), Gaps = 35/197 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + V+ ++ I D R+G F D + + Sbjct: 101 DMSDGSDYVDRLTAVKKVVSDF-----AIKREGD--RLGVVLFADHAYLQTPLTLDRTTV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + TAI + + A T I S ++ ++LL+DG N Sbjct: 154 ADQVNQLVLRLIGDK--TAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--S 424 T + I AK I TI + Q+ L A + Sbjct: 201 TSGVIDPIEAAKIAKKYDATIYTIGVGAGEMMVKEFFMTRKVNTAQDLDEKALMQIAQIT 260 Query: 425 PNSFFEANSTHELNKIF 441 +F A EL I+ Sbjct: 261 GGQYFRARDAKELATIY 277 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 70/199 (35%), Gaps = 24/199 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E ++++ V+ L S + D+ R+G F + Sbjct: 98 AVDLSGSMEARDFVDPQGEILRRVDGVKALLQSFLLK-------RDSDRIGLIAFGENAY 150 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ L + + + TAI DA+ A + S Sbjct: 151 LQAPFTQDKQILSQLLQQMDV--RMAGAGTAIGDAIGVAVNHFEQS-----------EVE 197 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 K ++LLTDG +T + + A QG+ I IA K E + L A Sbjct: 198 NKVLLLLTDGNDTSSEFPPLDAAHYAGEQGVVIYPIAIGDPKNVGEDSLDIATLERIADL 257 Query: 424 SPNSFFEANSTHELNKIFR 442 + FEA+ L ++++ Sbjct: 258 TQGRVFEADDGQSLIEVYK 276 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 71/464 (15%), Positives = 149/464 (32%), Gaps = 77/464 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K ++ +GH I+ A+++P + G+ + D R + ++ A++ A++ + Sbjct: 2 KALRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAH--NDD 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA-----VEMNPRKS 127 + S + + ++ +YL + N + + PR Sbjct: 60 NKNSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECVAGLARGEPRFF 119 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y+V +SR+ +S F I + A+ +R Y E V I +V D+S Sbjct: 120 QYEVEATSRH------VSWFPGDSSIPGFGKTFDAKGAATARKYQSE-AVDILFVADYSG 172 Query: 188 SMLDYQRDSEGQP-------LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 SM + + + ++ N VG+ N+S Sbjct: 173 SMAGGWNGGSNRKYIDLRNIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYTKTKPTNRSN 232 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + + + + +++ + K+ V + +S R I DN + Sbjct: 233 SCFMTQLVYNNNYNINYTKTVNNIFNEKNNKYCVSHSDSSRFRDIDLTDNYSSF------ 286 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 T TA + Sbjct: 287 ------------------------NTTVNGFYPNHGTASYQGIMRG-----------AQM 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA-------ICNKAK--------SQGIRIMTIAF 405 +K ++ +++L+DG+++ +++ I +C K K S G I Sbjct: 312 LKKGTNPRRLLIVLSDGDDSGTSQKNIHKQLVNAGMCTKIKQELSTGISSSGQSIKARLA 371 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 V L +CA + F+A +T ++ + I EI Sbjct: 372 VVGFDYNVNNNTALRDCAGAENVFKAQNTDDILNKILELITEEI 415 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 62/194 (31%), Gaps = 41/194 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I+ + ++ F R ++ L I K Sbjct: 67 TRLEASKKTMIDFIKK-------RNFDKISLVSFALRASVLSPATFDYTSLEEEIKK--- 116 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 I+ +E GST+I + TA D + S ED K I+LLTDGEN + Sbjct: 117 IEIDEEGSTSIGLGIATAVDMLRSVKEDNE----------KIIILLTDGENNSGEIDPKL 166 Query: 389 ICNKAKSQGIRIMTIAFS--------VNKTQQEKARYFLS-------------NCASPNS 427 A + I+I TI V + + + Sbjct: 167 ASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGGK 226 Query: 428 FFEANSTHELNKIF 441 +F A + L+ ++ Sbjct: 227 YFNAKNASALDNVY 240 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 50/203 (24%), Positives = 84/203 (41%), Gaps = 15/203 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ LA ++ +K D R+G F D Sbjct: 108 DLSGSMAEQDFTSKQGKKISRLQATKEVLADFAKT-RKGD------RLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-NEDEVHRMKNNLEAK 368 F+ + + +T ST + DA+ A S + + +N++E + Sbjct: 161 TPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSGKQMSAEQAQNDIERE 218 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP- 425 K +++LTDG +T E I AK++G+RI IA +T E A + AS Sbjct: 219 KVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRIASES 278 Query: 426 -NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + ++IG Sbjct: 279 GGEAFEALNRDELSTAY-EQIGQ 300 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 61/397 (15%), Positives = 123/397 (30%), Gaps = 55/397 (13%) Query: 12 KKLIKS------CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K+ ++ +G I+T L+ ++ G VDV+R+ L+ A A++ A+ Sbjct: 17 KRGLQDDSFATSESGAMTILTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAA 76 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + PK + +YL + + E AV R Sbjct: 77 DLDQELC-------------PKDVVIDYLKKEGLDKYLTGDPKVEPDVCGSTAAVLKGYR 123 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + D+ + F++ GI++ + A +V+ +S V+D Sbjct: 124 R------VEANADMDI--EMHFMKWRGIETI----ASAATSVAEESIGNVEIS--LVLDV 169 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SM + ++ + N F + V ++ P + NK Sbjct: 170 SGSMRGSKLENLKKAANLFIDDMFAKTEDGKVSISIVPYSEQVSIPDYL-MNKLNTQGTN 228 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI---KKIDNVNDTVRMGATFF 302 + F + + + + D + G Sbjct: 229 SIANCVDFASADFATTRFTAFDVTDPVTGIVTPGTTLARTIHHDIGDGSDRRPYNGFVSS 288 Query: 303 NDRVISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + + L + V K N G T+I+ + + S + + Sbjct: 289 TICRPNTSTNHREITILQKDPVALKKEINLLNASGWTSIDVGAKWGVTLLDDSFQPLTKK 348 Query: 361 M----KNNLEAK------------KYIVLLTDGENTQ 381 + K K K ++L+TDGENT+ Sbjct: 349 LVTESKVPSIFKDRPDQNKGYDTMKVMILMTDGENTK 385 Score = 53.4 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 7/72 (9%) Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIG 446 +IC KAK + I I +IAF + + L C S + +++EA + IG Sbjct: 535 SICQKAKDEKIIIFSIAFDA----PDGVKPLLKGCVSDDGAYYEAKDNDKDIISVFSSIG 590 Query: 447 NEIFERVIRITK 458 + I + +R+T+ Sbjct: 591 STI--QNLRLTQ 600 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 72/209 (34%), Gaps = 40/209 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +V+ +A I+ +T R+G F + + L+ + K Sbjct: 110 VTRLTVVKKVVADFIKE-------RETDRIGLVVFGQEAFTQSPLTMDKGLLLSLVDKME 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 STAI +A+ A R+K+ K +++LTDG + + Sbjct: 163 I--GMAGDSTAIGNAIAVA-----------GKRLKDLKAKSKIMIILTDGRSNTGDITPE 209 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCAS--PNSFF 429 A + GI+I TI K F L A+ +F Sbjct: 210 EAAGAAAALGIKIYTIGVGGTGPAPFKVNTFFGPRIVNQSVDLDEKTLKEIAAIGKGKYF 269 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 A + EL I+ I N+ + +++ + Sbjct: 270 RATDSKELANIYE--IINKAEKTEVKVKE 296 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 68/190 (35%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + L+R + T D G Sbjct: 111 EAAKDVATEFISGRPNDN---IGLTIFAGEAFTQCPMTTDHASLLRLLQATR-TDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + K++ K ++LLTDG N + Sbjct: 167 LIDDGTAVGMGLANAVSRL-----------KDSKSKSKVVILLTDGSNNMGEISPMTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANS 433 AKS GIR+ TI NK +Y LS+ A + +F+ A + Sbjct: 216 IAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDTKTLSDIAQTTDGNFYRATN 275 Query: 434 THELNKIFRD 443 +EL KI+RD Sbjct: 276 NNELKKIYRD 285 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 62/434 (14%), Positives = 140/434 (32%), Gaps = 98/434 (22%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN 97 + +D+ + A + A + + + +K E R Sbjct: 36 SVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVAD------GMSKNQARKRAETDARK 89 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 F K+ + ++ V+ + V +S +++ S +MG + Sbjct: 90 FLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHE------SYMTHAMGFDNID 143 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +E+ + +++D S SM + + Q + Sbjct: 144 YTADSESTISFG----QGKYEFIFLVDVSPSMGIGASNRDRQIMQ--------------- 184 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +G + P+ S +++ + +V+DA Sbjct: 185 --RAIGCQFACHEPWYSSVSRAKSAGARL-----------------------RIDVVKDA 219 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGS 336 L S++ +++ V+ +R G F++ + + G+ K R K E G Sbjct: 220 LKSLVTQLEEATEVD--LRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGG 277 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-------- 388 T + + S + +K + K++I++++DG N + G Sbjct: 278 TNFHG--------VFSDFNGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTP 329 Query: 389 ------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYF------LSNCASP-NSFF 429 C++ K + T+ + + ++A Y + CA+ + F+ Sbjct: 330 NWRPYNYSFNPRWCDEFKKG--EVRTVHTML--VEPDRAHYVRASTSSMRACATSADFFY 385 Query: 430 EANSTHELNKIFRD 443 ANS E++K F+D Sbjct: 386 SANSAAEIDKAFKD 399 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 71/220 (32%), Gaps = 26/220 (11%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 L V+ + S +L L S+ + FV+ I + V+ L ++ K Sbjct: 80 NLIGEPVTKDVSQRELLISVDLSGSMMTKDFVNKEGKA--IDRLEAVKMVLRDFLKERKG 137 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ++G F + F+ + L + TA+ D++ A Sbjct: 138 E-------KIGLILFGNAAFVQAPFTQDLDALEHLLDSLRVGMAGP--QTAMGDSIGLAV 188 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 SN + +++++DG++T A G+ + TI Sbjct: 189 KMFRESN-----------VTDRMLIVMSDGDDTGSKVPPKTSAELAAKNGVNVFTIGIGD 237 Query: 408 N--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + F+ A + +L I++ Sbjct: 238 PKNAGEHPIDTDTLKEIAAITGGKFYYAWNLDDLQDIYKQ 277 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 64/186 (34%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDN---IGITLFAGESFTQCPLTVD-HAVLLNLLKDMKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + A + K++ K I+LLTDG N + + + AKS Sbjct: 167 -TAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGVNNKGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIRI TI L+ A + ++F A S +L Sbjct: 215 FGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 65/483 (13%), Positives = 147/483 (30%), Gaps = 67/483 (13%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ TG + +L+ ML + DV+R + + A A+ + +I + E Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQIANATDAAL---ASAIISEVPE 57 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKK-----------------NFTDREVRDIVRD 117 ++ Y+ +L+ N +E I + Sbjct: 58 STAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTN 117 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--------VSR 169 ++ K+ +L + P++ ++ + + + S Sbjct: 118 AKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASG 177 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + I ++R Q S + R+ Sbjct: 178 NVNTGPMEEIYL--------GSFRRYDHAQVKRENRWSDMFDKASGRTPAVPGRQRNAMC 229 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS--IKK 287 N L + +S + I K ++ + + Sbjct: 230 RDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPS 289 Query: 288 IDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + N+ A F + +P + R ++ +T + M Sbjct: 290 TNPSNNYRPFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMNTNNAEGMV 346 Query: 345 TA-------YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKA 393 A + I E+ R ++ + KY+V+ +DG + D +++ IC + Sbjct: 347 WAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQL 406 Query: 394 KS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 K +G+++MT+ F ++ + +CAS ++ S + K+F +I E Sbjct: 407 KQPGRGVKVMTVNFGGAASE-----RLMQSCASGPEYYHVASLFSVEKVF-----EQIAE 456 Query: 452 RVI 454 +VI Sbjct: 457 QVI 459 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 75/196 (38%), Gaps = 35/196 (17%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + A + + + TV +G FN+ S + ++ I + Sbjct: 110 AAKRAARRFVDEVPR------TVNLGVISFNNTATVLQSPTRNRSDVLTAIDRLAV---- 159 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G TA +A+ TA + + + + N IVL++DG +T + I + Sbjct: 160 -SGGTATGEAIATATEMLRNQPGE------NGRRPPSAIVLISDGTSTNG-RDPIEAAAE 211 Query: 393 AKSQGIRIMTIAFSVNKT--------QQEKARYF------LSNCA--SPNSFFEANSTHE 436 A+ I I T+AF ++ E+ L+ A + F A+S Sbjct: 212 ARRLRIPIYTVAFGTDQGTITVPGRDGVERTERVPPDPTALAQIAEMTGGETFTADSADR 271 Query: 437 LNKIFRDRIGNEIFER 452 L+ +F +R+G+++ R Sbjct: 272 LDTVF-ERLGSQLGTR 286 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 55/132 (41%), Gaps = 16/132 (12%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 T V T+ + G+T I +Q + + + + + KKY++++TDG NTQ Sbjct: 382 TSVNTYLSSLSPGGNTNITLGVQFGMEMLSPAEPYTKATAFGDTDVKKYMIIVTDGANTQ 441 Query: 382 DNEEGIA---------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEA 431 + C AK+QGI + + ++ L CAS + +++ Sbjct: 442 NRWSTSNSAINARTALACTAAKAQGITLFVVRV------EDGDSSLLEACASQSSYYYDL 495 Query: 432 NSTHELNKIFRD 443 + +L K +D Sbjct: 496 SQASDLTKTMQD 507 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/290 (13%), Positives = 94/290 (32%), Gaps = 36/290 (12%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K I G+ II L +++G G VD++R+ L+ A A++ A+ + Sbjct: 10 KFIGDRQGNTLIIFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLKATQKI---E 66 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ + + + ++ ++ T + + + + P Sbjct: 67 VSEAAAKTAAAMAFEMNLSDHPELQTASHTFAIETSDNAKVVHYTSEITQRP-------- 118 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 FL+ +G+ I+ + A++ S + +V+D + SM Sbjct: 119 -------------YFLQLLGLGEQTIRVASSAQSESD------PFELLFVLDTTGSMASN 159 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + + + +++ KVG+ + + + L S Sbjct: 160 NKMTYLKTSVSSVLSSLISTYGDGNEDVKVGVVAFNTQVRLPASTSYSFVDYTQCYLRTS 219 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 V+ + R V + + D + S I + G + Sbjct: 220 ------VNYYACRTVWRAYEALCDQVGSTICATATSKAFYRVYTSGGVTY 263 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 70/543 (12%), Positives = 151/543 (27%), Gaps = 122/543 (22%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT------------- 59 K +S +G+F ++ L +PV+L DV + L+ A + Sbjct: 5 KFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASSHLGDLDI 64 Query: 60 -----------AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE--------- 99 A I L + ++ +F K + NF Sbjct: 65 TRNDAFNRYFQANIVGHGELDNAQATLTVDKGVNFVKTKAVASADVHLNFAFLFGDSKHI 124 Query: 100 ---NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL-----------SSRYDLLLNPLS 145 + ++ EV ++ +T R +A + S + +P+ Sbjct: 125 VVDASAVESNNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEAAKSPTRKIRASPV- 183 Query: 146 LFLRSMGI------KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 F+ ++ + SW+ + + + + Sbjct: 184 PFVTAVNVNGDGFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKKTGWADAGWNG 243 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 A + S D PY + P + + + Sbjct: 244 TGWKGCVEARPGAYNISDTPPDPAKPDTLFVPYFAPDD----PEDAQKPSGSYGNSDKYY 299 Query: 260 DSSSLRHVIKKKHLVRDALA-------SVIRSI-KKIDNVNDTVRMGATFF-----NDRV 306 ++S L V K L + + +++ ++ + + + R + Sbjct: 300 NNSYLNDVSDKTRLAQSGINILGIDLSNLLGNLIELLSPDDRDAREKIAKYVAPAKELIT 359 Query: 307 ISDPSFSWGVHKLIRTIV------------KTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + G ++ T V + E T +++ + + + Sbjct: 360 EIGSPVTVGPNRACPTPVVPLTDDFDKLRKAASQMTEWNGSGTNVSEGLSWGMRVLSPAA 419 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGEN----------TQDNEE------------------- 385 K ++LLTDGEN T+ + Sbjct: 420 PYTDGAPWKTPGISKIVLLLTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAA 479 Query: 386 ------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELN 438 +C + K++G++I T V ++ R S CAS P++++ N +L Sbjct: 480 RNVDGWTKNVCTQLKNEGVQIYT---MVLQSDTAANRALYSACASDPSNYYAVNDPTKLP 536 Query: 439 KIF 441 +F Sbjct: 537 NVF 539 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 77/229 (33%), Gaps = 41/229 (17%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++ + +D L+ + + + + + I ++ Sbjct: 76 TRNSWDSKTVEGIDIMLAMDVSTSMLAEDLRPNRIEAAKQVASEFI----IGRPNDN--- 128 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTIIS 352 +G F + + L+ +++ D G TAI + A + Sbjct: 129 IGLAIFAGESFTQCPMTTDHASLLN-LLQNVRTDIAARGLIEDGTAIGMGLANAVSRL-- 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K++ K ++LLTDG N + + AKS GIR+ TI NK Sbjct: 186 ---------KDSKAKSKVVILLTDGSNNRGDISPSTAAEIAKSLGIRVYTIGVGTNKVAP 236 Query: 413 ------EKARYF----------LSNCAS--PNSFFEANSTHELNKIFRD 443 +Y LS AS F+ A +T+EL KI+++ Sbjct: 237 YPMPVAGGVQYVNVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKE 285 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 28/211 (13%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 S+ S + ++ + + ++R ++D F+D V Sbjct: 73 ATTPLSMVMAIDTSESVITQFQTERDAAKRFVKQMLREQDEMD---------LISFSDTV 123 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 F+ ++ I D +T++ DA+ A + + D R Sbjct: 124 DEIVPFTNDAGRMNAGIGNLHKGD-----ATSLYDAIYLASQRLTEAKRDATRR------ 172 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---AFSVNKTQQEKARYFLSNCA 423 K +V++TDG NT A+ G I I + + + L A Sbjct: 173 --KILVIVTDGGNTTKGMRYQQAVEAAERAGAAIYPIIMVPIEADAGRNTGGEHALIQMA 230 Query: 424 SP--NSFFEANSTHELNKIFRDRIGNEIFER 452 +F H+L+K F + +++ + Sbjct: 231 QDTGGKYFYVLDKHDLDKAF-AHLSDDLRTQ 260 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 80/269 (29%), Gaps = 21/269 (7%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ + + S G+ C+ Sbjct: 157 ALRRAGILVYAVGVKDAVWAELREIASSPQENFTSFVPSLSGLSSLAQKLRQELCDTLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + P+ E D L + + + +D +D VR+G Sbjct: 217 AASRVDHISPACREGAVADVVFLVDSSTSIGPQNFQKVKNFLYSVVSGLDVSSDRVRVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L +++ G T +A++ + ++ Sbjct: 277 AQYNDDIYPAFQL--NQHPLKSMVLEQIQNLPYRTGGTNTGNALEFIRTSYLTEGSGSRA 334 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + ++L+TDGE N+E ++ K GI + + +V Q+ L Sbjct: 335 KDRVPQ----IVILVTDGE---SNDEVQEAADQLKEDGIVVYVVGINVQDVQE------L 381 Query: 420 SNCASP--NSF-FEANSTHELNKIFRDRI 445 AS F F + + L F I Sbjct: 382 QKIASEPFEKFLFNIENFNILQD-FSGSI 409 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 44/304 (14%), Positives = 95/304 (31%), Gaps = 48/304 (15%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWV--IDFSRS-----MLDYQRDSEGQPLNCFGQPADRT 211 ++ + S + GV+I V D S S M Y PL F Q + Sbjct: 748 NFSQDSVSGPASRLRRAGVTIYAVGTQDVSESKDLEKMASYPPWKHSVPLESFLQLSIIG 807 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 K + ++ + + Y + G + ++ +F+ S + Sbjct: 808 SKLTNQLCPEIVDSQVSV--------RGTSYPVQEGCVHIQKADIYFLIDGSGSIYPEAF 859 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ + VI+ + D V+ G ++D + S S V + Sbjct: 860 LDMKVFMNEVIKMFQIGP---DRVQFGVIQYSDEIKSKFVLS------QYPTVAELKVAI 910 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + + A + + D +Y++++TDG+++ + Sbjct: 911 DNIQQGGGGTTTGEALNNMTQVFADTARIN-----VARYLIVITDGKSSDPVAD---AAE 962 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ G+ I I + L A FF ++ + +I + Sbjct: 963 GLRASGVIIYAIGVREANIDE------LKEIAKDKIFF----------VYEFDLLKDIQK 1006 Query: 452 RVIR 455 V+R Sbjct: 1007 EVVR 1010 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 63/198 (31%), Gaps = 24/198 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E ++ V+ L + + D R+G F D Sbjct: 98 AVDLSGSMEARDFVDLKGDKTRRIDGVKSLLLDFLAQ-RASD------RVGLIAFGDAAY 150 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ L + + TA+ DA+ A + S+ D Sbjct: 151 LQAPFTEDKGALSLLLKEMDV--RMAGAGTALGDAIGVAVNHFSHSDTDN---------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCA-- 423 K ++LLTDG +T + A QGI I IA N + L A Sbjct: 199 -KVLLLLTDGNDTSSEFPPLEAARYAAQQGIVIYPIAIGDPANVGEDSLDIEMLQQIADI 257 Query: 424 SPNSFFEANSTHELNKIF 441 + FEA +++ Sbjct: 258 TYGQVFEAQDGEAFTQVY 275 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 44/176 (25%), Positives = 61/176 (34%), Gaps = 29/176 (16%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + + A A +I ++ D V G F+ FS + + V Sbjct: 426 ISKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYYWAYEFS---KLVRKKDVIES 476 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI + A T+ S IVLLTDG Q E Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTLKKSKAKSK-----------LIVLLTDGMGEQGGYEIP 525 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 A N+AK I+I TI LS AS F+ ++ +EL +F Sbjct: 526 A--NEAKRNNIKITTIGVGKFVNLP-----VLSWIASFTSGRFYLVSNPYELVDVF 574 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 63/332 (18%), Positives = 110/332 (33%), Gaps = 36/332 (10%) Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + PL + T + V S H + V+D S SM Sbjct: 97 SLGYSVTPLLPHFNYERYDQNVTATGGGFKGVVESKHSAIPTELVLVLDVSGSMS----- 151 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 Q L A T++S S+ + + P+ S P L + Sbjct: 152 PNIQSLKSILSNALNTIQSQSNNANDLDSVSISIVPF-----DSGVATHRPPWLSEETAG 206 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTVRMGATFFNDRVISDP-SF 312 + +D S R+ L D LA++ +K + S Sbjct: 207 IYCIDGLSYRNGDFSASLTVDNLATLHSERPVKFTPPSKWLS-------DCNQESPMLPL 259 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-NEDEVHRMKNNLEAKKYI 371 + ++ +I G T + +I S + ++ + E ++ + Sbjct: 260 TNVFSRVQNSINS-----LTANGGTRSYQGLVWGVRQLIPSWQQAWGMKVSSVPETRRKL 314 Query: 372 VLLTDGENTQDNEEGI---AICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPN 426 VL TDG + D + C A K GI + I + V+ ++ + NCA +P Sbjct: 315 VLFTDGADEGDAFNQLVNAGFCTTAIKQYGIEMNFIGYGVSPSRITQ----FENCAGNPL 370 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F A +T +LN+ F D + E I +T+ Sbjct: 371 RVFSATNTTQLNEYFSDILAVEYSA-SIHLTR 401 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 95/292 (32%), Gaps = 19/292 (6%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM +LG GM VD L AA A + + I Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVGS----IAEKS 63 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + A + + + RN + D+ ++ K+A ++ Sbjct: 64 GAVAAAMAMNSNGTVSLGKTDARNIFMSQMSGELAEVQVDL------GIDVTKTANKLNS 117 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + F++ +G S I A AE Y + ++D + SM Sbjct: 118 QVSFTATVP--TTFMQILGRDSITISGTATAE-----YQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + L A + + N + ++ + ++ + + Sbjct: 171 TPDDVSKLEAKAGCAFACHQMDKTINNYTIAKSLGVAMRIDVVRQATQALTDTAKTERVS 230 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 S++ K + ++ + + K+ N D V + + + Sbjct: 231 SDQ--FRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDAVDLMTIPYQNY 280 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 59/166 (35%), Gaps = 31/166 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + H ++ + + E G TAI + TA + + Sbjct: 132 RIGLVIFSGESFTQCPLTID-HDVLINLFSDISNGMVEDG-TAIGMGLATAVNRL----- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K++ K ++LLTDG NT + + AK +R+ TI Sbjct: 185 ------KDSEAKSKVVILLTDGSNTTGSIPPLTAAEIAKQMKVRVYTIGVGTKGYAPYPV 238 Query: 416 R----------------YFLSNCA--SPNSFFEANSTHELNKIFRD 443 + LS A + +F A + +L +I++ Sbjct: 239 KTPFGTQYQQVPVTIDEGVLSKIAGITGGKYFRATNNEKLKEIYQQ 284 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 80/532 (15%), Positives = 158/532 (29%), Gaps = 160/532 (30%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G + L++ +M+ +GG+ VD++R ++ A A++ A+ Sbjct: 20 RRFAREEDGLVTLFAILMILLMILLGGVGVDLMRHERERARVQAVADRAVLAAA----DL 75 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + +S A F K + +Y+ ++V + + +V Sbjct: 76 DQTLSPEAVARDYFDKSGLADYI-----------------------SSVTVEEGLNYRRV 112 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + DL ++F+ G + + KA AE K V I V+D S SM + Sbjct: 113 TVDASRDLK----TMFIDKFGQEKLHVPAKATAE------EKVAKVEISMVLDISGSMRE 162 Query: 192 YQ-----------------RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + ++ P V ++ + + V Sbjct: 163 NDKMNNLHDASNVFIDTVIQTDTEDLISISVVPYTAQVNVGKDIMDELNVTQLHSYSHCV 222 Query: 235 SCNKSLY--------------------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 S + Y D + ++ + + ++ Sbjct: 223 DFEDSDFNLTTISQTRSYEHMQHFEAGYYWNGNDRDRTGHYDNISNPGCPKQSYEEIETF 282 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A++ I +T WGV + +F + Sbjct: 283 SQNAAALKSRIANFQPRANTA------------IHLGLKWGVA----LLDPSFRAINEAI 326 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN--------------- 379 G A+ Y+ I + K ++L+TDG N Sbjct: 327 GGDAVFRGRPAEYNDIDTL---------------KTVILMTDGVNVTTRRIAPEAYSNRD 371 Query: 380 ---------------------------------TQDNEEGIAICNKAKSQGIRIMTIAFS 406 Q + IC+ AK++GI I +I F Sbjct: 372 HYRHWSDYPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAKAKGIVIWSIGFE 431 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 V + + NCAS +S F E+ F + I +I + +R+T+ Sbjct: 432 VT----DHGAAVMKNCASSDSHFFRVEGVEIVDAF-EAIARQINQ--LRLTQ 476 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 52/150 (34%), Gaps = 17/150 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + ++ +TAI DA+ A + + +E Sbjct: 30 RLGLIVFGTAAYPQAPLT--LDHAALQLLLRHTAVGMAGPNTAIGDAIGLAIRMLDAVDE 87 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K ++LLTDG +T A IRI TI + + + Sbjct: 88 -----------PDKVLILLTDGNDTGSAVPPQRAATLAAQHHIRIHTIGMGDPQARGDDK 136 Query: 416 --RYFLSNC--ASPNSFFEANSTHELNKIF 441 L + A+ FF+AN L +++ Sbjct: 137 VDFDLLEHIAQATGGRFFQANDRESLQQVY 166 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 50/159 (31%), Gaps = 27/159 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEAKIGI-AGKNTALGDAIGLGLKRLRLRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + + Sbjct: 193 TS-----------RVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDALQ 241 Query: 416 RYF------------LSNCA--SPNSFFEANSTHELNKI 440 L A S +F A +L KI Sbjct: 242 SVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 65/444 (14%), Positives = 140/444 (31%), Gaps = 39/444 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G AL++ +L + G+++++VR Y +L AA A++ ++ + E+ + Sbjct: 7 QGGGLSPAFALMLTGILALTGVVIELVR-GYSGQSLLSAAADAVLYSAADSDTAAEDAVA 65 Query: 78 RAKNSFT-------FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + P E R D V D V R S+ + Sbjct: 66 LVQANLAGRPLQVGPPSLSQSEQGARVILQGHVPALMDLSVIGEGGDMPVAAAARASSAR 125 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV---IDFSR 187 + L ++ IK + E +++ VSI ++ Sbjct: 126 TRIEIALVLDVSNSMSGAPMKAIK-QGLTEFGEVLFGRERRNQDRVVSIIPATGLVNIGD 184 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + +S P +R + +++ R + + Sbjct: 185 HPELFHPESLAFPFGLQTLAHERGWSNLLTRDVPGRQRKAFCARLPEHVDGIDRLAELTP 244 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVR-----DALASVIRSIKKIDNVNDTVRMGATFF 302 L + ++ H K ++ L + ++ + R F Sbjct: 245 GWIRKLEQAPVGETQPRLHYSTKPPAIKQYEDGTPLRAFAPRENPLERYLENRRDKLGIF 304 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII-------SSNE 355 +D H R + +T + + + + Sbjct: 305 DDADCGVSPIQ--AHLSTRAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQWQGRWGQGA 362 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 E+ R + +K +VL +DGE+ + + + +C + K +GI++ T+AF + Sbjct: 363 AELPRPYGQADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEGDA- 421 Query: 411 QQEKARYFLSNCASPNSF-FEANS 433 F++ CAS S ++A S Sbjct: 422 ------RFVAQCASDRSHAYKATS 439 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 57/173 (32%), Gaps = 24/173 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-SFSWGVH 317 +D S + + ++A I + + ++G ++ + + Sbjct: 75 IDKSGSMQTDNRINAAKNAAKGFIDLMDMTKH-----QVGIVGYSSVAETSSLPLTTDTA 129 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K F G T A+ A + S HR EA+ IV++TDG Sbjct: 130 AA-----KQFIDPIVASGGTETGYAIDQAITLLSS------HR----PEAQPVIVIMTDG 174 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--RYFLSNCASPNSF 428 E ++ + AK GI TIA + + L A+ NS Sbjct: 175 EAN-SSQAALERAQAAKDAGIVFYTIALLGPNDNPDTSAPNELLKQMATTNSH 226 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 67/198 (33%), Gaps = 24/198 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E + + + V+ + I ++ D+ R+G F Sbjct: 95 AIDISQSMEATDYTDANGAKSDRLSAVKSVVRDFIA--RRKDD-----RIGLIVFGTGAY 147 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + ++ +TA+ DA+ + + E E Sbjct: 148 PQAPLT--LDHASLLLLLDEVGIGMAGPNTALGDAIGLTIKALEKTPEQE---------- 195 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 K ++LLTDG +T + A++ GI + TI + + L A Sbjct: 196 -KVLILLTDGNDTSSAITPDHAAHLAQANGIVVHTIGIGDPQATGDAKVDLTTLQAIART 254 Query: 424 SPNSFFEANSTHELNKIF 441 + FF A+ L +++ Sbjct: 255 TGGQFFRADDRQALQQVY 272 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ + + D + G Sbjct: 111 EAAKQVASEFINGRPNDN---IGLTIFAGESFTQCPLTVD-HGVLLNLFNSIKGDIAQRG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI + A + K++ K I+LLTDG N + + + Sbjct: 167 LIEDGTAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGSNNRGDISPLTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANS 433 AK GIRI TI N T + + L+ A + ++F A S Sbjct: 216 IAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATS 275 Query: 434 THELNKIFRD 443 +L +++++ Sbjct: 276 NSKLKEVYQE 285 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ + + D + G Sbjct: 111 EAAKQVASEFINGRPNDN---IGLTIFAGESFTQCPLTVD-HGVLLNLFNSIKGDIAQRG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI + A + K++ K I+LLTDG N + + + Sbjct: 167 LIEDGTAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGSNNRGDISPLTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANS 433 AK GIRI TI N T + + L+ A + ++F A S Sbjct: 216 IAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATS 275 Query: 434 THELNKIFRD 443 +L +++++ Sbjct: 276 NSKLKEVYQE 285 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 74/202 (36%), Gaps = 17/202 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S+ I + ++ LA ++ R+G F D Sbjct: 113 DLSGSMAEQDFTSASGANISRLDATKEVLAEFAKT-------RQGDRLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A S ++ K K Sbjct: 166 TPFTADQKVWLALLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSESNQAASSKPRQ---K 220 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP-- 425 ++LTDG +T E I AK++G+RI IA T E A + AS Sbjct: 221 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESG 280 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 F+A + EL + + D IG Sbjct: 281 GKAFQALNRDELARAY-DDIGK 301 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 56/192 (29%), Gaps = 38/192 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +D AS I N N +G F+ + + + + Sbjct: 110 RLEAAKDVAASF---INGRPNDN----IGLVVFSAESFTQCPLT--TDHTVLLNLFKDIQ 160 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TAI + A I K++ K I+LLTDG N + Sbjct: 161 SGMIQDGTAIGLGLANAVSRI-----------KDSHAKSKVIILLTDGSNNAGEIAPVTA 209 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCASP--NSFFEA 431 AK+ G+R+ TI E L AS +F A Sbjct: 210 AEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRA 269 Query: 432 NSTHELNKIFRD 443 L +I+ + Sbjct: 270 TDNASLKEIYSE 281 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 33/149 (22%), Positives = 57/149 (38%), Gaps = 17/149 (11%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK-NNLEAKKYIVLLTDGENT 380 + +K G+T + + + ++ ++N K + + YIVLL+DG NT Sbjct: 396 STLKNQIDAMTPSGNTNQSIGLAWGWQSLSTTNGPIAAPGKESGYVYQDYIVLLSDGLNT 455 Query: 381 QD--------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 Q+ + +C K K G+ I TI +V L NCAS Sbjct: 456 QNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTIQVNVGSKDP--LSQVLQNCASDG 513 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVIR 455 +F S E F++ + R+ + Sbjct: 514 NFQMITSATETADAFQNILTQISQLRLAK 542 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 70/204 (34%), Gaps = 31/204 (15%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 +G G VD R + +++ A +A++ VS A + T Q+I Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLM-----------VSKDAAANPTMTSQQITNA 49 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + R F + + D+ + + +A +++ S + + + F++ G Sbjct: 50 VQRYFTSL----YNDKSAFGVTVSATYTPSSSSAAAKILASGQGAIQ----TDFMKIAGF 101 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + + + + V+D + SM S ++ + A + Sbjct: 102 P--QLSFGTSSTSTWGNSRMRVA----LVLDNTGSM------SSNGKMSALQRAAKDMID 149 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCN 237 S S+ K G + P+ N Sbjct: 150 SLSAFAKKTGDVYISIIPFSKDVN 173 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 63.8 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 73/184 (39%), Gaps = 28/184 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVK 325 + + +V+ L S I+ ++ D R+G F D F+ W L+ V Sbjct: 111 VDRLSMVKSYLQSFIKQ-RQGD------RIGIILFADHAYLMVPFTQDWQAAGLLLDEVN 163 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI +A+ A +H K K ++LL+DG+++ + + Sbjct: 164 IGLAGKF----TAIGEAITLAVKK-------TLHEPKPIQN--KTLILLSDGKDSINTIQ 210 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----LSNCA--SPNSFFEANSTHELNK 439 AK+ G++I TI + T E L A + +F A S +L++ Sbjct: 211 PTDAAALAKASGLKIYTIGIGSDSTDAEAESDLDETTLEEIANMTGGQYFRARSEQDLSE 270 Query: 440 IFRD 443 I++ Sbjct: 271 IYQQ 274 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 67/181 (37%), Gaps = 28/181 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ A+A + + D R+G F + + ++ V + R + + Sbjct: 112 VTRLDAVK-AVAKEFVTSRAGD------RVGLILFAEFAYTAAPLTFDVAAVSRIVDE-- 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A STAI + A + R+K + + ++LL+DG +T N Sbjct: 163 ATIGISGRSTAIAGGLGLA-----------LKRLKRSDAQSRVVILLSDGSDTSGNVLPR 211 Query: 388 AICNKAKSQGIRIMTIAFS------VNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 A+ G+ + TIA +T+ L + A S F +T +L Sbjct: 212 DSARLAEQLGVTVHTIALGPEDMETAPQTRDAVDTATLRDIAELSGGRTFRVRNTDDLRA 271 Query: 440 I 440 + Sbjct: 272 V 272 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 56/192 (29%), Gaps = 38/192 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +D AS I N N +G F+ + + + + Sbjct: 110 RLEAAKDVAASF---INGRPNDN----IGLVVFSAESFTQCPLT--TDHTVLLNLFKDIQ 160 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TAI + A I K++ K I+LLTDG N + Sbjct: 161 SGMIQDGTAIGLGLANAVSRI-----------KDSHAKSKVIILLTDGSNNAGEIAPVTA 209 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCASP--NSFFEA 431 AK+ G+R+ TI E L AS +F A Sbjct: 210 AEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRA 269 Query: 432 NSTHELNKIFRD 443 L +I+ + Sbjct: 270 TDNASLKEIYSE 281 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 60/188 (31%), Gaps = 35/188 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 D + V + + + R+G F R + LI+ + + + Sbjct: 122 DVVKQVTQKFIEARPND---RIGMIAFAARPYLVSPLTLDHGWLIQNLDRVKLGLVED-- 176 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI A+ + +I + + + +VLLTDG+N + A + Sbjct: 177 GTAIGSAIASCTTRLIERKDSK----------SRIVVLLTDGDNNAGKVSPLTAAEAASA 226 Query: 396 QGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNSFFEANSTH 435 G+++ TI + L A + F+ A T Sbjct: 227 LGVKVYTIGAGTKGFAPMPVGRDVFGRKVYQNVKVDVDEDTLKKIADMTKAKFYRATDTK 286 Query: 436 ELNKIFRD 443 L +I+ + Sbjct: 287 SLTQIYEE 294 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 77/520 (14%), Positives = 148/520 (28%), Gaps = 87/520 (16%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 9 AKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR--------------- 109 Q + +Q I+ ++ + R Sbjct: 69 IAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRS 128 Query: 110 -EVRDIVRDTAVEM----------------NPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + V + N +A + + + + +++ Sbjct: 129 DDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQTLV 188 Query: 153 IKSWL----IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD---YQRDSEGQPLNCFG 205 S L + + + + + V D SRSM RD E P N Sbjct: 189 SSSQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRI 248 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY------------MLYPGPLDPSL 253 V + IRD+ +SPY S + Y + Sbjct: 249 DALREVVSNIQDILLPKAIRDD-ISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQA 307 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 ++ +S + ++ D + I I + N + + Sbjct: 308 EISNYFNSGQISQGFEELSRSMD-IEKTINQITQFKNGEKKSY--PFSLSSLSSRNFCLE 364 Query: 314 WGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNN 364 K V + +G TA+ + + + +N+D E K N Sbjct: 365 NNKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLN 424 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI------CNKAKSQGIRIMT------------IAFS 406 ++ +++L+DGE+ + ++ + C K K +I + IAF Sbjct: 425 TNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKE---KINSLQDSNYPQVEARIAFV 481 Query: 407 VNKTQQEKARYFL-SNCASPNSFFEANSTHELNKIFRDRI 445 + + C ++ NS L F+ I Sbjct: 482 ALGFNPPQDQLIAWKKCVG-KQYYPVNSKQGLLDAFKQII 520 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 59/174 (33%), Gaps = 29/174 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L +DA + + + D +G F+ + R + Sbjct: 479 KIELAKDAAQAAVELLGPKDA------IGVIAFDGDSYTVSELR---STSDRGAISDAIS 529 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + AM AY+ ++ + H ++L+TDG ++ + +G+A Sbjct: 530 TIEASGGTNMYPAMADAYEALLGATAKLKH-----------VILMTDGVSSPGDFQGVA- 577 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 + I + T+A ++ L A ++ + + ++F Sbjct: 578 -GDMSASRITLSTVALGQGSSE-----DLLEELAQIGGGRYYFCDDPQSVPQVF 625 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 46/177 (25%), Positives = 70/177 (39%), Gaps = 24/177 (13%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + +++ IDN R+G F+ + + H+++ + +T A + T Sbjct: 122 AGARTAAMRFIDNRPAD-RIGLVVFSGGSFTRCPLTLD-HEVLGRLAETVAPGFFDEPGT 179 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 AI A+ TA + + +S+ E K +VL+TDGEN A + G Sbjct: 180 AIGTAILTATNRLKASSSKE-----------KALVLITDGENNAGEVTPETAARLAANYG 228 Query: 398 IRIMTI-------AF--SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 IRI T+ AF + N K R L A S F A L K FRD Sbjct: 229 IRIYTVFAGKEARAFENTSNTALNRKGRSELETVARISGGRMFSAGDVFGLMKSFRD 285 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 80/493 (16%), Positives = 143/493 (29%), Gaps = 81/493 (16%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA---SVPLIQ 70 L G I+ AL + LG+ G+ +D + L AA A++ + I Sbjct: 18 LASDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIA 77 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S + T + + + N T + V + +++ Sbjct: 78 SNAKQGDATAAGLTAGRNQATKAFAIN---------TGKVPFATVSVSRLDVTRSGQTLT 128 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + + + F + +G+ S A SY + ++D S SM Sbjct: 129 ATVIYTATIQ----NTFGKILGLSSTTFTNTITASADLASY-----LDFYLMVDVSGSMG 179 Query: 191 DYQRDSEGQPL---------NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ++ + L NC K +++ GK+ +R + ++ + K Sbjct: 180 LPTAAADAEKLASITKEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVNNAVCELLKRAA 239 Query: 242 YMLYPGPLDPSLS--EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--DTVRM 297 + P S L +R A K + + + Sbjct: 240 TPVVPNQYRIGFYPFINRLATLSPLSDTTTSMTALRTAA-----QCDKTWPLAFTNLLDT 294 Query: 298 G---ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-------NDAMQTAY 347 G N+ S K + + T + +T D MQ + Sbjct: 295 GSTQLFTGNNPTTGTGSGGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLITDGMQNS- 353 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQG--IRIM 401 S+N D + K Y G D + I C + K G I I+ Sbjct: 354 -QSYSTNNDARTFPGSPSLFKGY------GNAGWDGSQPAQIDPSKCKELKDAGAIISIL 406 Query: 402 TIAFSVNKTQQEKARYF----------------LSNCASPNSFFEANSTHELNKIFRDRI 445 I ++ K + L CAS F+ ANS ++ Sbjct: 407 YIPYNQVKNYTNDSYIVWENNRVNGFSPTLADPLRKCASQGFFYTANSADDITASLGAMF 466 Query: 446 GNEIFERVIRITK 458 + RV RIT+ Sbjct: 467 DQAL--RVARITQ 477 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 57/162 (35%), Gaps = 33/162 (20%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + ++R ++K E G TAI DA+ T+ + + Sbjct: 99 RIGLVVFAGAAYTQAPLTLD-YGVVREVLKQIRTRVLEDG-TAIGDALATSLNRL----- 151 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-EK 414 +++ + +VL+TDG+N + + A+S I I TI + Sbjct: 152 ------RDSEAKSRVVVLITDGDNNAGKISPLDAASMAESLKIPIYTILVGKGGKVPFPQ 205 Query: 415 ARY-----------------FLSNCAS--PNSFFEANSTHEL 437 + L + AS ++ A L Sbjct: 206 GQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGL 247 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 73/234 (31%), Gaps = 44/234 (18%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH---------VIKKKHLVRDALASVI 282 +++C + ++Y VD S I + V+ L+ + Sbjct: 76 LVIACARPVWYGDPVEFQPKYRDMMLLVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFV 135 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ D R+G F D + +++ I + A+ TAI D Sbjct: 136 EK-RQGD------RLGVVLFGDHAYLQTPLTADRRTVMQQINQ--AVIGLVGERTAIGDG 186 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T + S ++ ++LL+DG NT E + A+ I T Sbjct: 187 IGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEPLEATEIAQKYNATIYT 235 Query: 403 IAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + + L+ A + +F A EL I+ Sbjct: 236 VGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELEAIY 289 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 57/162 (35%), Gaps = 33/162 (20%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + ++R ++K E G TAI DA+ T+ + + Sbjct: 81 RIGLVVFAGAAYTQAPLTLD-YGVVREVLKQIRTRVLEDG-TAIGDALATSLNRL----- 133 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-EK 414 +++ + +VL+TDG+N + + A+S I I TI + Sbjct: 134 ------RDSEAKSRVVVLITDGDNNAGKISPLDAASMAESLKIPIYTILVGKGGKVPFPQ 187 Query: 415 ARY-----------------FLSNCAS--PNSFFEANSTHEL 437 + L + AS ++ A L Sbjct: 188 GQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGL 229 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 72/191 (37%), Gaps = 29/191 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S + +++A+ + ++ + FN V +G Sbjct: 535 LDTSGSMLQDNRIGALKEAVGVFLGTL---PPGSKV---AVIEFNSFV---NPLVFGPAN 585 Query: 319 LIRTI----VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 I T VK+ G T+ DA+ A + I N ++ ++ L Sbjct: 586 EIFTTRFDDVKSQVNRFRANGGTSYYDAVDRALELIA------------NQTGRRAVLAL 633 Query: 375 TDGENTQDNEEGI-AICNKAKSQGIRIMTIAFS-VNKTQQEKARYFLSNCASPNSFFEAN 432 TDGE+T G+ ++ KA++ G+ + T+ ++ + + + + +F A Sbjct: 634 TDGEDTSSRLAGLDSVILKARNLGLPVHTLGVGREDEIEVGELQRLARE--TRGRYFPAR 691 Query: 433 STHELNKIFRD 443 +L IF + Sbjct: 692 DATKLRVIFAE 702 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 58/487 (11%), Positives = 141/487 (28%), Gaps = 123/487 (25%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + G+ VD+ R + L+ A A + S+ L + K++ Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQL-----------PDDPDVTNGKVKAA 49 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + N N T +++ + V + ++ ++ + + +GI Sbjct: 50 VTANLALNDPD------------ATDIQVTSGGATRSVCVDAKANVDMT----LTKVIGI 93 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + + + + + + V+D + SM DS Sbjct: 94 --------GDTTVTAEACAGYNDIELVLVLDSTGSMKGSPIDSAKDAARDLVNLIMPA-- 143 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SS K+G+ + + + G + + +++ L+ + Sbjct: 144 STSSTRSKIGLVPFQGKVRIDGSDPVTAERNPDGVGPGCRNADGTLNTGKLKVEYSRTA- 202 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK--TFAIDE 331 + G + F D+ S S + TI+ Sbjct: 203 --TSTNIFYG----------YTLSGVSTFTDKTCSGMSPIRALSSDKNTILNNIEAINAG 250 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--------NTQDN 383 T I++ ++ + S E + + +K +++LTDG+ N Sbjct: 251 AVTSGTLISEGIKWGRKVL--SPEAPYVEGSTDKKVRKIMIVLTDGDTEDGRCGGNFASA 308 Query: 384 EEGIAI-------------------------------------------------CNKAK 394 + + + AK Sbjct: 309 SKTVNTYWTNAYFGQGLKPDTATSPYATLSTATATLAQIPDCKDGGKLNQFVLDEADAAK 368 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEANSTHELNKIFRDRIG 446 + + I ++ F + + + AS + +++A S+ + +F+ +IG Sbjct: 369 NDLNYPVEIFSVRFGASD---ATDKSLMQKIASSKPGTTDHYYDAPSSTGIQDMFK-KIG 424 Query: 447 NEIFERV 453 ++ +R+ Sbjct: 425 QQLGQRL 431 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 63/160 (39%), Gaps = 24/160 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G F + + + H+L+ +V+ ++D TAI A+ + + +S Sbjct: 136 RFGLVLFRGKSFTLCPLTLD-HRLLGMLVRQVSVDAISDKGTAIGSAILVGTNRLRAS-- 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF---------- 405 + ++ ++LLTDGE+ + A+S+GIRI I Sbjct: 193 ---------VSKERVLLLLTDGEHNSGEVGPVTASEIAQSEGIRIYVIGVRNEEEAGSPE 243 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 S++ ++ L A + +F A+ + L F + Sbjct: 244 SMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAFGE 283 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 65/180 (36%), Gaps = 20/180 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L R+G F D F+ + +R + ++ Sbjct: 135 YISRLDAVKRVLHEFAEQ-------RQGDRLGLILFGDAAYLQAPFTADLASWLRLLDES 187 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ST + DA+ A + S + K ++LLTDG +T + Sbjct: 188 RVA--MAGQSTHVGDALGLAIKVMSSDEIKSSQKN-------KVVLLLTDGNDTDSSVPP 238 Query: 387 IAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + A +GIR+ IA +Q + A + F+A ST ELNK+++ Sbjct: 239 LEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDMEVIEGVAALTGGKAFKAISTQELNKVYQ 298 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L P FV SS V K +DAL +++ ++ D+ N Sbjct: 277 DGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFN---- 332 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q + Sbjct: 333 --IVGFSNRIKV-----WKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + +V LTDG+ T + I N + + I T+ + Sbjct: 385 --NDYVAHNDIEDRSVS-LVVFLTDGKPTVGETHTLKILNNTREAARGRVCIFTVGIGAD 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E + F D I Sbjct: 442 VDFKLLEKLSLENCGLTRRVHEDHDARAQLIGFYDEI 478 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 58/167 (34%), Gaps = 33/167 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + +++ +VK E G TAI DA+ T+ + + Sbjct: 133 RIGLVVFAGAAYTQAPLTLD-YGVLKEVVKQLRTRVLEDG-TAIGDALATSLNRL----- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 +++ + +VL+TDG+N + N A++ + I TI Sbjct: 186 ------RDSEAKSRVVVLITDGDNNSGKISPMDSANMAQALKVPIYTILVGKGGKVPFPQ 239 Query: 413 ---------------EKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + + A + ++ A +L + + Sbjct: 240 GTDLFGNTVWRDTEIPINPELMQDIADRTGGEYYRATDPEQLREGLQ 286 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 76/240 (31%), Gaps = 30/240 (12%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS---------S 263 K S N + + L+ + S + Y +D+S Sbjct: 41 KKTSLINKEKLLYSLILTLLIFSLASPIIYDQKTSSKRKGRDLVFALDTSGSMAESGFNP 100 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 +K +++ L S I + DNV G + F S+ + + + Sbjct: 101 ENVQNRKFDALKELLRSFITK-RYNDNV------GVSIFGTYAYPAIPLSYDMGSVAFLL 153 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 F STAI + + A + +K +K I+L+TDG Sbjct: 154 D--FFDVGIAGDSTAIGEGLAMA-----------LKILKKGEAKEKVIILITDGYQNSGA 200 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 KAK Q ++I TI ++ + FEA + L I+++ Sbjct: 201 VSVKEAVQKAKKQHVKIYTIGIGDRSAFDANLLQLIAK-NTDAKMFEAKNVKMLQDIYKE 259 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 66/185 (35%), Gaps = 29/185 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKT 326 + + +D I N N +G T F + + L+ I Sbjct: 109 RIEVAKDVAKEFIA---SRPNDN----IGLTIFAGEAFTQCPMTTDHASLLNLLAGIRAD 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +++ TAI + A + K+ E K ++LLTDG N + Sbjct: 162 LSVNHLIQDGTAIGMGLANAVGRL-----------KDVKEGSKVVILLTDGSNNVGDISP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKA------RYFLSNCA--SPNSFFEANSTHELN 438 + + A+ G+R+ TI + + L + A + F+ A S EL+ Sbjct: 211 LTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYKTLQDIAMQTDGEFYRAQSRAELS 270 Query: 439 KIFRD 443 +I++D Sbjct: 271 QIYKD 275 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 51/166 (30%), Gaps = 31/166 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + L+ + + TAI + A + + S Sbjct: 131 KIGLVVFAGESFTQCPLTTDQAVLVNLLRE--VKSGMIQDGTAIGLGLANAVNRLKDSPG 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 K ++LLTDG N Q + AK+ GIR+ TI Sbjct: 189 KS-----------KVVILLTDGINNQGAIAPVTAAELAKAFGIRVYTIGVGTYGEAPYPV 237 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L A + +F A +L +I+ + Sbjct: 238 PTPFGVQLQNMPVEIDEGVLQQIANVTGGKYFRATDNDKLQQIYSE 283 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 55/169 (32%), Gaps = 33/169 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + ++ + + E G TAI + TA + I Sbjct: 132 RIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENG-TAIGSGLATAVNRI----- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 K++ K I+LLTDG N + A GI++ TI N Sbjct: 186 ------KDSDAESKVIILLTDGVNNAGFIDPSTASELAVEFGIKVYTIGVGSNGMALSPV 239 Query: 412 ---QEKAR------------YFLSNC--ASPNSFFEANSTHELNKIFRD 443 R L A+ +F A + +L +I+ + Sbjct: 240 GVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKYFRATNNEKLEEIYAE 288 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 59/188 (31%), Gaps = 36/188 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL V K + R+G + + + KLI + Sbjct: 115 EALKKVASQFVKDRPND---RIGLVIYAGESYTKTPVT--TDKLIILNALSEITYGQIED 169 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K + + I+LLTDG N + A Sbjct: 170 GTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAE 218 Query: 396 QGIRIMTIAFSVNKT------------------QQEKARYFLSNCA--SPNSFFEANSTH 435 GI++ T+ N Q + + A + +F A + Sbjct: 219 YGIKVYTVGIGTNGMALSPYALNADGSIIYRMQQVDIDEPLMKKIAQVTKGRYFRATNNQ 278 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 279 KLQQIYDE 286 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 60/188 (31%), Gaps = 36/188 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL V K + R+G + + + ++ + + + Sbjct: 115 EALKKVASQFVKDRPND---RIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIED-- 169 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K + + I+LLTDG N + A Sbjct: 170 GTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAE 218 Query: 396 QGIRIMTIAFSVNKT------------------QQEKARYFLSNCA--SPNSFFEANSTH 435 GIR+ TI N T Q E + A + +F A Sbjct: 219 YGIRVYTIGIGSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQ 278 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 279 KLQQIYDE 286 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 77/520 (14%), Positives = 148/520 (28%), Gaps = 87/520 (16%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 9 AKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR--------------- 109 Q + +Q I+ ++ + R Sbjct: 69 IAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRS 128 Query: 110 -EVRDIVRDTAVEM----------------NPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + V + N +A + + + + +++ Sbjct: 129 DDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQTLV 188 Query: 153 IKSWL----IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD---YQRDSEGQPLNCFG 205 S L + + + + + V D SRSM RD E P N Sbjct: 189 SSSQLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRI 248 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY------------MLYPGPLDPSL 253 V + IRD+ +SPY S + Y + Sbjct: 249 DALREVVSNIQDILLPKAIRDD-ISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQA 307 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 ++ +S + ++ D + I I + N + + Sbjct: 308 EISNYFNSGQISQGFEELSRSMD-IEKTINQITQFKNGEKKSY--PFSLSSLSSRNFCLE 364 Query: 314 WGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNN 364 K V + +G TA+ + + + +N+D E K N Sbjct: 365 NNKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLN 424 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI------CNKAKSQGIRIMT------------IAFS 406 ++ +++L+DGE+ + ++ + C K K +I + IAF Sbjct: 425 TNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKE---KINSLQDSNYPQVEARIAFV 481 Query: 407 VNKTQQEKARYFL-SNCASPNSFFEANSTHELNKIFRDRI 445 + + C ++ NS L F+ I Sbjct: 482 ALGFNPPQDQLIAWKKCVG-KQYYPVNSKQGLLDAFKQII 520 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 60/154 (38%), Gaps = 18/154 (11%) Query: 297 MGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + L+ + + TAI + Sbjct: 129 IGLTIFAGEAFTQCPMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLA--------- 179 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + V R+K++ K ++LLTDG N + + AK GIR+ TI + Sbjct: 180 --NSVSRLKDSKAKSKVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGEDI 237 Query: 414 KARYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 A + L + A + F+ A S EL+KI++D Sbjct: 238 GAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQD 271 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 64/203 (31%), Gaps = 43/203 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I+ + ++ F R ++ L I Sbjct: 106 TRLEASKKTMIDFIKK-------RNFDKISLVAFALRASVLSPSTFDYTLLEEEIKN--- 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 I +E GST+I + TA D + S D K I+LLTDGEN + Sbjct: 156 IKIDEEGSTSIGLGIATAVDMLRSVKGDNE----------KIIILLTDGENNSGEIDPKL 205 Query: 389 ICNKAKSQGIRIMTIAFS--------VNKTQQEKARYFLSN-----------CASP--NS 427 A + I+I TI V + + AS Sbjct: 206 ASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFSLNEEALIDIASTTGGK 265 Query: 428 FFEANSTHELNKIFR--DRIGNE 448 +F A + L+ ++ DRI + Sbjct: 266 YFNAQNASALDNVYNTIDRIEKK 288 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 59/475 (12%), Positives = 145/475 (30%), Gaps = 98/475 (20%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALL-MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++ +G + AL+ +PVM VD R + + L A A+ Sbjct: 8 LNRALEMVATAFFRARSGQVAVTFALVTLPVMFATA-AAVDYGRRNAAKTQLDAALDGAV 66 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + ++ + + + + V V Sbjct: 67 LAVMSQKTNTIPTTT------LQNMETQFRTEAAK---------VPGVTVTSFTPGAPVN 111 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 ++ + L++ Y + + M I + + + A + Y ++ Sbjct: 112 -----TSKTLSLTASYTATVKTSLASM--MQIPAMPVSGTSSATRNTSQY-----INYYL 159 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ++D S SM D++ Q + C + + Sbjct: 160 LLDNSPSMGLAATDADVQ----------------------------NMKIATNGCAFACH 191 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIK-KKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + + +++ +LR+ IK + ++R+A+++++ + +M Sbjct: 192 QHTFDKKGNITGDDQNDNYHIALRNNIKLRIQVLREAVSALVDQANVSMLLPQQFQMEMW 251 Query: 301 FFND-----RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 FND ++ + + I +A T A+ TI +S + Sbjct: 252 TFNDSVTQTKLQAMTPTLNNIKNAAPNIDIAYAYYNQSDNQTDFERAIARMNTTIPASGD 311 Query: 356 DEVHRMKNNLEAKKYIVLLTDG-ENTQDNEE-----------------GIAICNKAKSQG 397 + +++ L+TDG E+T + + C+ K++ Sbjct: 312 GL-----TPDKPIRFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFIGPLSPSTCSALKNKN 366 Query: 398 IRI----------MTIAFSVNKTQQEKAR--YFLSNCASPNSFFEANSTHELNKI 440 ++I F + +++ L CAS +F + ++ Sbjct: 367 VKIGIIYTQYLPIYDNDFYNRYVRPYESQIGPSLQACASDGMYFPVTTNGDITAA 421 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 74/205 (36%), Gaps = 39/205 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H + ++ D G Sbjct: 111 EAAKDVASEFVSGRPSDN---IGLTIFAGESFTQCPLTID-HGSLLNLLNNVRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + K + K ++LLTDG N + + Sbjct: 167 LIQDGTAVGMGLANAVSRL-----------KGSKAKSKVVILLTDGSNNMGDISPLTAAQ 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANS 433 A+S GIR+ TI NK +Y LS+ A + +F+ A + Sbjct: 216 IARSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNMPVEIDTKTLSDIAAITEGNFYRATN 275 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 EL +I+RD +++ + + +TK Sbjct: 276 NRELKQIYRDI--DKLEKTKMNVTK 298 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 74/198 (37%), Gaps = 24/198 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + + I + V++ + I+ ++ D+ R+G F Sbjct: 97 AIDLSQSMQTQDFNDANGQRIDRLSAVKEVVQGFIQ--RRKDD-----RLGLIVFGSGAF 149 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + +++ + +TAI DA+ A + ++E Sbjct: 150 AQAPLT--LDHASLSLLLEDSGIGMAGPNTAIGDAIGLALKLLEQAHE-----------P 196 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 +K ++LLTDG +T A ++G+ I TI + E L A Sbjct: 197 EKVLILLTDGNDTSSAITPQHAAAMAAARGVVIHTIGIGDPSAEGEAKVDLSALEQIART 256 Query: 424 SPNSFFEANSTHELNKIF 441 + +F A L++++ Sbjct: 257 TGGRYFRAEDRSALDQVY 274 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 62/204 (30%), Gaps = 65/204 (31%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII-------SSNEDEVHRMKNN 364 + + +KT + GST I++ + T+ N Sbjct: 373 LTNNTN-----ALKTLINNMAPSGSTNIHEGFMWGWRTLSPNSVFADGQPYASSANSSNA 427 Query: 365 LEAKKYIVLLTDGENTQDNEE--------------------------------------- 385 K I+L+TDG N+ Sbjct: 428 TNINKIIILMTDGTNSWGTNSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNTNIADGN 487 Query: 386 ---------GIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCAS-PNSFFEANST 434 C K+ I I TI FSV + + L NCAS P+ F+ ANS+ Sbjct: 488 TARKALDALTAEACANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSS 547 Query: 435 HELNKIFRDRIGNEIFERVIRITK 458 +L K F+ + +R+T+ Sbjct: 548 DDLIKAFKSI---QASIGALRLTQ 568 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 59/373 (15%), Positives = 124/373 (33%), Gaps = 54/373 (14%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K ++ +GH ++ A+++P + GV + D R + L++A++ A++ Sbjct: 2 KHAMRKQSGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLA-------- 53 Query: 72 LEEVSSRAKNSFTFPKQKIEEYL--IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 VS++ + ++ I+ YL + + + K E+ + + T R Y Sbjct: 54 ---VSAKDEQDHQLAERYIQHYLYDMDSILDIEVKKLGCDEMPECIAATE-RGEARYFEY 109 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +V + + P + + G + +R Y + + I +++DFS SM Sbjct: 110 RVAGQTLHKSWF-PGNDVISGFGD-----SFNVTGSSKARRYQSQ-PIDITFIVDFSESM 162 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D LN + + N P+ V+ + Sbjct: 163 NDSWSGGRHSKLNDLKDIIEDVADELGAYND-----LYPEHPHRVALTGFNRRTINKDKN 217 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI---RSIKKIDNVNDTVRMGATFFNDRV 306 D + + V S + +A ++ N +D R +F Sbjct: 218 DNLVVRDQRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNGDDDARFYDLYFTTDF 277 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 S G TA + A + S ++++ Sbjct: 278 SSFTK---------------KVKKFKAGGGTASLQGIIRAGQIVTSMSKNQ--------- 313 Query: 367 AKKYIVLLTDGEN 379 K+ I++L+DGE+ Sbjct: 314 -KQLIIILSDGED 325 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ + + D + G Sbjct: 111 EAAKQVAAEFINGRPNDN---IGLTVFAGEAFTQCPLTVD-HGVLLNLFNSIKGDIAQRG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI + A + K++ K I+LLTDG N + + + Sbjct: 167 MIEDGTAIGMGLANAISRL-----------KDSKAKSKVIILLTDGSNNRGDISPLTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANS 433 AK GIR+ TI N T + + L+ A + ++F A S Sbjct: 216 IAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVNVPVEIDEQTLTQIAGTTNGNYFRATS 275 Query: 434 THELNKIFRD 443 +L +++R+ Sbjct: 276 NSKLEEVYRE 285 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ + + D + G Sbjct: 111 EAAKQVASEFINGRPNDN---IGLTIFAGESFTQCPLTVD-HGVLLNLFNSIKGDIAQRG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI + A + K++ K I+LLTDG N + + + Sbjct: 167 LIEDGTAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGSNNRGDISPLTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANS 433 AK GIRI TI N T + + L+ A + ++F A S Sbjct: 216 IAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATS 275 Query: 434 THELNKIFRD 443 +L +++++ Sbjct: 276 NSKLKEVYQE 285 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 60/154 (38%), Gaps = 18/154 (11%) Query: 297 MGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + L+ + + TAI + Sbjct: 129 IGLTIFAGEAFTQCPMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLA--------- 179 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + V R+K++ K ++LLTDG N + + AK GIR+ TI + Sbjct: 180 --NSVSRLKDSKAKSKVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGEDI 237 Query: 414 KARYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 A + L + A + F+ A S EL+KI++D Sbjct: 238 GAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQD 271 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 51/201 (25%), Positives = 79/201 (39%), Gaps = 18/201 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + + ++ L+ +++ +K D R+G F D Sbjct: 113 DLSGSMAEQDFTSKAGENISRLNAAKEVLSDFVKT-RKGD------RLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DAM A E R+ +K Sbjct: 166 TPFTPDQKVWLELLNQTDVA--MAGQSTHLGDAMGLAIKVF----EQSKSRIGVEENKEK 219 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+RI IA +T E A + A S Sbjct: 220 VAIVLTDGNDTGSFVEPIEAAKVAKAKGVRIHVIAMGDPQTLGEAALDMKTIRRIAKESG 279 Query: 426 NSFFEANSTHELNKIFRDRIG 446 FEA + EL K + D IG Sbjct: 280 GKAFEAMNRDELAKAY-DDIG 299 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 62/213 (29%), Gaps = 26/213 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD S S + S ++ + V+ +A + R+G F D Sbjct: 99 ALDLSQSMDTRDFSDPQGNLQARVDAVKTVVADFVDR-------RPYDRLGLVAFGDAPY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + + T M + + + + + Sbjct: 152 PLVPFTMDHATVRSML-------------TGALPGMAGPKTALGDALGLSIKLFQQSQAP 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCASP 425 K +V+LTDG +T A IRI T+ Q E+ L A+ Sbjct: 199 DKVLVVLTDGNDTASKMPPDKAAEIASQNHIRIHTVGIGNPDAQGEEKLDTETLQKIATA 258 Query: 426 --NSFFEANSTHELNKIFR--DRIGNEIFERVI 454 +F L +I+ D I + + Sbjct: 259 TGGRYFFGQDQQALAEIYTLLDSITPANQKTLS 291 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 68/504 (13%), Positives = 144/504 (28%), Gaps = 68/504 (13%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-- 62 K K+ ++ G + IITALL +L + VD + L QA A + Sbjct: 9 AKLFNSIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 63 -------------------TASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 S I + + + K + + ++ YL + +N Sbjct: 69 IAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDDKN 128 Query: 101 NLKKNFTDREVRDI------VRDTAVEMNPRKSAYQVVLSSRYD--LLLNPLSLFLRSMG 152 K + + N + V+ L + S Sbjct: 129 GQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSSR 188 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM---LDYQRDSEGQPLNCFGQPAD 209 + + + + + + V D S SM +D + S ++ Sbjct: 189 LHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVK 248 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 ++ + ++ + + P +++ + S + Sbjct: 249 DIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYRNN 308 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI------ 323 K L + + I+ ++ ++ + K Sbjct: 309 KLDQASKLLDQYMDIERTINQIDQF-NGSNISYDFINTTKKCLGKSEGKETTRAWFDKKN 367 Query: 324 --VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNNLEAKKYIVLLTDGENT 380 V + + G TA+ M + + +N+D E K N ++ +++L+DGE+ Sbjct: 368 LGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDN 427 Query: 381 QDNEEGI------AICNKAKSQGIRIMT------------IAFSVNKTQQEKAR-YFLSN 421 + E + +CNK K +I + +AF + + Sbjct: 428 RPTEGTLVKLMSAGLCNKIKR---KIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQ 484 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 C ++ S L FR I Sbjct: 485 CVG-KQYYTVFSKQGLLDAFRQII 507 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 68/504 (13%), Positives = 144/504 (28%), Gaps = 68/504 (13%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-- 62 K K+ ++ G + IITALL +L + VD + L QA A + Sbjct: 9 AKLFNSIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAALLL 68 Query: 63 -------------------TASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 S I + + + K + + ++ YL + +N Sbjct: 69 IAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDDKN 128 Query: 101 NLKKNFTDREVRDI------VRDTAVEMNPRKSAYQVVLSSRYD--LLLNPLSLFLRSMG 152 K + + N + V+ L + S Sbjct: 129 GQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSSR 188 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM---LDYQRDSEGQPLNCFGQPAD 209 + + + + + + V D S SM +D + S ++ Sbjct: 189 LHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVK 248 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 ++ + ++ + + P +++ + S + Sbjct: 249 DIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYRNN 308 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI------ 323 K L + + I+ ++ ++ + K Sbjct: 309 KLDQASKLLDQYMDIERTINQIDQF-NGSNISYDFINTTKKCLGKSEGKETTRAWFDKKN 367 Query: 324 --VKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNNLEAKKYIVLLTDGENT 380 V + + G TA+ M + + +N+D E K N ++ +++L+DGE+ Sbjct: 368 LGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDN 427 Query: 381 QDNEEGI------AICNKAKSQGIRIMT------------IAFSVNKTQQEKAR-YFLSN 421 + E + +CNK K +I + +AF + + Sbjct: 428 RPTEGTLVKLMSAGLCNKIKR---KIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQ 484 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 C ++ S L FR I Sbjct: 485 CVG-KQYYTVFSKQGLLDAFRQII 507 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 56/167 (33%), Gaps = 32/167 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + H ++ + E G TAI + TA + + Sbjct: 134 RIGLVIFSGESFTQCPLTID-HDVLINLYHDIKNGMIEDG-TAIGMGLATAVNRL----- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + + K ++LLTDG N + I AK GIR+ T+ Sbjct: 187 ------RGSEAKSKVVILLTDGVNNAGSIPPITAAEIAKQFGIRVYTVGIGTQGYAPYPV 240 Query: 413 --------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L+ A + +F A + L +I++ Sbjct: 241 PSPYGGVVYQRMEVQIDEPTLTKIAAITGGKYFRATNNDALTRIYKQ 287 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + ++ + + TAI D + A + Sbjct: 130 IGLVVFGSFADVQAPLTPDLNAIQSLLADLR--PGMADSRTAIGDGLALAVRQL------ 181 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + + +VLL+DGEN A ++ IR+ TI F + + + Sbjct: 182 -----RESTTEDRVVVLLSDGENNSGEIRPDEATAVAAAENIRVYTIGFG-SAGRDSLLQ 235 Query: 417 YF-----------LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F L A + ++ A S+ EL ++FRD E ++ + + Sbjct: 236 SFGLRSSSLDEQTLREIAEQTQGRYYRATSSAELAEVFRDIERLEPSDQKTDLQR 290 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 30/165 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ ++ + R + + A+ TAI DA+ + Sbjct: 129 RLGLILFADQAYLQAPLTFDRFAVARFLDE--AVLGLVGQQTAIGDAIALGVKRFNDLEQ 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VLLTDGEN + A+ G+++ TI + ++ Sbjct: 187 SS-----------RVLVLLTDGENNAGRFTPAQAVSLARQSGVKLYTIGIGSAEIRRRGL 235 Query: 416 ----------------RYFLSNCASP-NSFFEANSTHELNKIFRD 443 + F+ S +F A ST EL I+++ Sbjct: 236 LGTRTVNPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQE 280 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 66/189 (34%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A V ++ +G T F + + + L +I A Sbjct: 111 EAAKQVAAQFINGRPNDN---IGLTIFAGEAFTQCPLTIDHGVLLNLFGSIKGDIAQRGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TAI M A + + R+K++ K I+LLTDG N + + + Sbjct: 168 IEDGTAI--GMGLA---------NSISRLKDSKAKSKVIILLTDGSNNRGDISPLTAAEI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCASP--NSFFEANST 434 AK GIR+ TI N T +Y L AS ++F A S Sbjct: 217 AKQFGIRVYTIGVGTNGTAPYPMPTYAGVQYVNVPVEIDEQTLIQIASTTNGNYFRATSN 276 Query: 435 HELNKIFRD 443 +L +++ + Sbjct: 277 SKLKEVYEE 285 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 60/443 (13%), Positives = 129/443 (29%), Gaps = 81/443 (18%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K ++S +G ++ +L+PVML V ++VV L+ + A A L + Sbjct: 26 KARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVT 85 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + YL R + + + E P K V Sbjct: 86 GDKAEAIEAAERLLEANP---YLDRTLSIG-DADIIFGKSNRTEENRRYEFTPDKKVNSV 141 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + + P+ + I+ I+ ++ + + I V+D S SM Sbjct: 142 SLRA-FGADDVPMLFPTMGVPIEFRPIK---------QAVATQVELDIAIVLDRSGSMAF 191 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + P + + +N + ++ ++ S + Sbjct: 192 SHDEVAKNGSPSSAPPGWK-MGHAVPENARWLDTVAAVNGFLDIMEDSSHDERVSLSTYS 250 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 S+ + + +A ++ + Sbjct: 251 DKSKADVKLTG----DYTEIRAAMNAHSTKFK---------------------------- 278 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 G+T I + + ++ + ++ I Sbjct: 279 ----------------------GGATNIGSGI------LEGGATLGDKKLARSWASRVLI 310 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 V ++DG + E I + ++ I I T+ FS QE + +S F A Sbjct: 311 V-MSDGIHNTG-IEPIPAAQQVANEKIMIFTVTFSDEANVQEMEKVAVSG---GGQHFHA 365 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + +L + FR +I + + Sbjct: 366 KDSQQLTEAFR-KIAKSLPTLIT 387 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 27/160 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ ++ + L + + + G TAI A+ TA + + Sbjct: 152 RIGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSG-TAIGTALGTAINRL----- 205 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 KN ++ IVL++DGENT + I + I+I I + T Q Sbjct: 206 ------KNAETKERIIVLISDGENTSGLMDPITAADLCLEYNIKIYCIGLGKDGTHQFKD 259 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L N + + F+ A L+ + Sbjct: 260 DNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLDDV 299 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 56/164 (34%), Gaps = 14/164 (8%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI- 69 S++L G I+ A+++PV+LG + VD+ R + L+ AA A + + L Sbjct: 8 SRRLQSQRGGT-AILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARSLSD 66 Query: 70 --------QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + S +I++ I N+ V Sbjct: 67 PGGQPYNWSAASIKALDVARSNVANGGQIQDAAIETGYWNILNPALGMRPAG---TPGVP 123 Query: 122 MNPRKSAYQVVLSSRYDLLLNPL-SLFLRSMGIKSWLIQTKAEA 164 A +V + PL LF +GI IQ A A Sbjct: 124 ATGDVPAVRVTTAISATQNNGPLQLLFAPILGITERSIQASAIA 167 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 60/421 (14%), Positives = 130/421 (30%), Gaps = 57/421 (13%) Query: 32 VMLGVGG-MLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPK 87 V + G + R+ +H LK AQ+ + + PL IQ ++ S+ Sbjct: 364 VCFLIDGSWSIGKRRFQIQKHFLKDVAQSLDVGVAGPLMGIIQYGDDPSTEFNLKTYANS 423 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + ++ + + + N R +A P Sbjct: 424 KDLKNAIEKILQKGGLSNVGKALSFVNKNFFEDSNGNRGAA--------------PNVAI 469 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + G + ++ + +E G++I +V + + + F Sbjct: 470 VMVDGWPTDKVEEASR-------LARESGINIFFVTIE----GPDENEKQNVVEANFVDK 518 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 A Y S N K+ +V ++ S +D SS Sbjct: 519 AVCRTNGYYSINVPSWFSLHKVVQPLVKRICDSDHLACSKTCLNSADIGFVIDGSSSVGT 578 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + +++ +A++ + + D R+GA + F + + ++ Sbjct: 579 GNFRTVLQF-VANISKEFEISDTD---TRIGAVQYT--YEQRLEFGFEKQSTKQDVLNAI 632 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T+ A+ A++ + +K+ +K ++L+TDG + D + Sbjct: 633 KRINYWSGGTSTGAAINYAFEQLF---------IKSKPNKRKIMILITDGRSYDDVQGP- 682 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 A G+ ++ + A+ L A+ F + L + F R Sbjct: 683 --ATAAHQNGVITYSVGIA------WAAQDELEAIATDPDKEHSFFVDEFDSLYR-FVPR 733 Query: 445 I 445 I Sbjct: 734 I 734 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 78/243 (32%), Gaps = 30/243 (12%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 IITA + V++ G+ VD R + + + TA + A+V Sbjct: 2 IITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEF--------------S 47 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 ++ + + + + + +V + + ++L Sbjct: 48 KLGSEQDARKAAKKAFDAQVSQLNLHGA----KLKKLNIVTDDETMKVSVDAVFELP--- 100 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + ++ G K+ + T+++A +E + I ID + SM R + L Sbjct: 101 -TTLMQIAGFKTLEVATRSDAVGGG----QEVILDIVMCIDATGSMGATLRSVQRNAL-S 154 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP---GPLDPSLSEEHFVD 260 F +K Q + +R Y Y + D ++FVD Sbjct: 155 FEANLKNRLKELGRQVDIIRVRPIYYWDYDYDGWSRSYGLKKSTFLKLPDQRTQFKNFVD 214 Query: 261 SSS 263 S S Sbjct: 215 SES 217 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 59/188 (31%), Gaps = 36/188 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 AL V K + R+G + + + ++ ++ + + Sbjct: 114 TALKKVAADFIKKRPND---RIGLVAYAAESYTKTPITSDKSIVLSSLRQITHGQLED-- 168 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K++ K I+LLTDG N E + A Sbjct: 169 GTAIGMGLATAVNRL-----------KDSKSKSKVIILLTDGVNNSGFIEPKTAADLAVE 217 Query: 396 QGIRIMTIAFSVNKT------------------QQEKARYFLSNCA--SPNSFFEANSTH 435 I+ TI N Q E L A + +F A Sbjct: 218 YKIKTYTIGLGTNGNALTPIAFNPDRTYRYGMRQVEIDEKLLEEIATVTGGQYFRATDNE 277 Query: 436 ELNKIFRD 443 +L+ I+ + Sbjct: 278 KLSAIYNE 285 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A V ++ +G T F + + + L ++I A Sbjct: 111 EAAKQVAAEFINGRPNDN---IGLTIFAGEAFTQCPLTVDHGVLLNLFQSIKCDIAQKGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 M TA+ + A + K++ K I+LLTDG N + + + Sbjct: 168 IMDGTALGMGLANAVSRL-----------KDSKAKSKVIILLTDGVNNRGDISPLTAAEI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANST 434 AK GIR+ TI N T + + L+ A + ++F A S Sbjct: 217 AKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQVPVEIDEQTLTQIAGTTNGNYFRATSN 276 Query: 435 HELNKIFRD 443 +L +++++ Sbjct: 277 SKLKEVYQE 285 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 69/504 (13%), Positives = 142/504 (28%), Gaps = 145/504 (28%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ +G I+ L +M+ GG+ +D++ ++ A++ A+ L L+ Sbjct: 29 FRRAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAA-DLDNELD 87 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 K + + L+ +V ++ + V Sbjct: 88 AQGVVEDYMS---KMSLADALV-----------------------SVNVDEGLNYRTVTA 121 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + N F++ +GI++ +A S++ + + V + V+D S SM D Sbjct: 122 DGYRTMPSN----FMQLIGIENM------QAGGHSQAMERINKVEVSMVLDISGSMDDGD 171 Query: 194 RDSEGQPLNCF---------------------------GQPADRTVKSYSSQNGKVGIRD 226 + +E Q G + + + Sbjct: 172 KMAELQTAASDFVDTLLDDGSEDLVSISLVPYSEHVNAGPEILSYLNVNYMHDDSYCLEM 231 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + + SL Y D+ R+ ++ ++ I Sbjct: 232 PNSAFNSAALDLSLTYDQMQHFQWNYSGSNSLTDTVCPRYAYEQIRPWSQDAGALKTQIS 291 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-NDAMQT 345 ++ R G + F W L+ + A G+ + Sbjct: 292 QLQP-----RAGTSIF-------MGMKW-ASALLDPSTRPIASGMIADGTVDAVFEGRPV 338 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---------------------------- 377 AY + + K IVL+TDG Sbjct: 339 AYS---------------DTDVLKTIVLMTDGQHDRSFRIQNWAYNDENEVEHWSQYNLW 383 Query: 378 --------------------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + + ++C AK QGI I +I F V+ + Sbjct: 384 HYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIWSIGFEVS----DHGAN 439 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 + +CAS + F E+++ F Sbjct: 440 VMESCASSPAHFFRVEGVEISEAF 463 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 68/194 (35%), Gaps = 45/194 (23%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+A ++ D R+G ++ ++ + LIR I + + + Sbjct: 117 KAVAVKFSKERQAD------RLGLVSYSGEALTRVPLTTDREVLIREINALESGELED-- 168 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAI----- 389 TAI + TA + I K++ K I+L+TDG E+ + + I Sbjct: 169 GTAIGIGLATAINHI-----------KDSKAKSKVIILMTDGVESINPTNDLMYISPQTA 217 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNSFF 429 A S+GI++ TI + L N A + +F Sbjct: 218 AEMATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIADLTGGLYF 277 Query: 430 EANSTHELNKIFRD 443 A L KI+++ Sbjct: 278 RATDNQSLQKIYQE 291 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/334 (12%), Positives = 98/334 (29%), Gaps = 63/334 (18%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G + A + S S + + V D S SM + +G + Sbjct: 51 GTYKLSLSVTGTASSTSESSKADVVI----VFDISNSMDEETNTYVEYATGRYGSVSSDA 106 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 S++ Y + + L + + + + Sbjct: 107 PTGSSTRRRLYRRSTNNWGYYQYTEITNDTTSGTVYYLGDNYQYHEY--TGKRYSQKTRL 164 Query: 272 HLVRDALASVIRSIKKIDNVND--TVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTFA 328 + + A ++I + +N + +VR+ F+ +W + +IV + Sbjct: 165 DVAKSATNTMIDQL-LANNATNPGSVRISLVSFDTFASDAT--AWSTSSENLHSIVNGYK 221 Query: 329 IDENEM-----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--- 380 ++ G T DA+Q A + +A+K+++ ++DG T Sbjct: 222 TPQSSHLGGHRGGTNWEDALQKA------------DGTQPRADAQKHVIFVSDGNPTFRI 269 Query: 381 -------------------------QDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQ 412 N A + AK G T+ + Sbjct: 270 SSINGNPDDQYNDVHGHGDDDYYHSHPNYNYDAAKDDAKKIVDGGAAFYTVGTFGDA--- 326 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + + + +++++A+ L F++ + Sbjct: 327 ARMQNLATEAGASDNYYKADDEAALKAAFKNIVA 360 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 60/188 (31%), Gaps = 36/188 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +AL V K + R+G + + + ++ + + + Sbjct: 122 EALKKVASQFVKDRPND---RIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIED-- 176 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + + K + + I+LLTDG N + A Sbjct: 177 GTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAE 225 Query: 396 QGIRIMTIAFSVNKT------------------QQEKARYFLSNCA--SPNSFFEANSTH 435 GIR+ TI N T Q E + A + +F A Sbjct: 226 YGIRVYTIGIGSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQ 285 Query: 436 ELNKIFRD 443 +L +I+ + Sbjct: 286 KLQQIYNE 293 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 66/205 (32%), Gaps = 39/205 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +V+ + I + R+G F R + L R + Sbjct: 113 RVDVVKKVVFRFIGE----RPDD---RIGLVAFAGRPYMVSPLTLDHDWLGRRLQTIHPG 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A+ ++ + + K ++LLTDG N + Sbjct: 166 MVED--GTAIGSAIGSSINRLRDQKAKS-----------KVVILLTDGMNNAGKILPVTA 212 Query: 390 CNKAKSQGIRIMTIA------FSVNKTQQEKARYF-----------LSNCA--SPNSFFE 430 A++ GI+I TI V T + + L A + ++ Sbjct: 213 AEAAETLGIKIYTIGAGSRGEVPVPITDKFGNQKIVRAKVDIDEATLEKVAQMTGAKYYR 272 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A T L KI+ + E +R IR Sbjct: 273 ATDTDSLKKIYSEINKLETTKRKIR 297 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 72/202 (35%), Gaps = 17/202 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S+ I + ++ LA ++ R+G F D Sbjct: 113 DLSGSMAEQDFTSASGANISRLDATKEVLAEFAKT-------RQGDRLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A S + K K Sbjct: 166 TPFTADQKVWLALLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSEPSQAASSKPRQ---K 220 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP-- 425 ++LTDG +T E I AK++G+RI IA T E A + AS Sbjct: 221 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESG 280 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 F+A + EL + D IG Sbjct: 281 GKAFQALNRDELASAY-DDIGK 301 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 52/164 (31%), Gaps = 30/164 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + + L + + + G TAI A+ A + + Sbjct: 140 RIGLVVFRGKGYTQCPLTLDHEVLAMLLDRLSPGVIQDDG-TAIGTAILIAVNRL----- 193 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--AFSVNKTQQE 413 K + K ++L+TDGEN + A G+RI I F V + + + Sbjct: 194 ------KASESLHKVLILVTDGENNAGEVGPGTAASIAARSGVRIYVINAGFKVVEDRID 247 Query: 414 ----------KARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 K L A + +F F I Sbjct: 248 PPEESGRYIQKDEESLQGIARTTGGGYFRVEDP----AAFDQTI 287 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 57/176 (32%), Gaps = 30/176 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVK 325 K L +++ + ++ D V G F+D+ V+ + V Sbjct: 422 TKIELAKESAMRTVELLRSKDTV------GVVAFDDQPWWVVPPQKLGN------KEEVL 469 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T I A+ +A + ++ H I+L+TDG+ + N Sbjct: 470 SSIQSIPSAGGTNIYPAVSSALEEMLKIKSQRRH-----------IILMTDGQ-SAMNSG 517 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I + ++A + + A+ ++ L +F Sbjct: 518 YQDLTDTMVENKITMSSVAVGTDA-DTHLLQSLAE--AAKGRYYFVEDETTLPAVF 570 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 81/202 (40%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L+ ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLSEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTADQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDKSSGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMETIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 56/167 (33%), Gaps = 16/167 (9%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F + + L + G TA+ A+ + + + Sbjct: 142 RIALVAFAGETATLSPLTTDYTALKEYLASINTNIIRTSG-TALGMALSSCVNKLRDVAG 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + ++++DG+NT AKS G+R+ TIA +++ Sbjct: 201 KS-----------RIAIIISDGDNTAGTIPPETAVELAKSFGVRVYTIAIGKPGSEEGVD 249 Query: 416 RYFLSNCAS--PNSFFEANSTHELNKIFR--DRIGNEIFERVIRITK 458 L A FF+A L+KIF D + I E + + Sbjct: 250 EKTLRMLAGGPNGRFFKAADNSSLSKIFDEIDHLEKTISETASEVNR 296 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDN---IGITLFAGESFTQCPLTVD-HAVLLNLIKDVKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 M A + V R+K++ K I+LLTDG N + + + AKS Sbjct: 167 ---TAVGMGIA---------NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 72/201 (35%), Gaps = 35/201 (17%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S S++ + K + + A I I N N +G F + + H ++ Sbjct: 97 SGSMQAMDFKPNRLEAAKDVAISFINNRPNDN----IGMVTFAGESFTQCPLTTD-HTVL 151 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +V+ + + G TAI + TA + + K++ + ++LLTDG N Sbjct: 152 LNMVQDLQMGVLDDG-TAIGMGLATAVNRL-----------KDSKAKSRVVILLTDGSNN 199 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY----------------FLSNCA- 423 + + A++ GIR+ T+ + L A Sbjct: 200 MGDITPRMAADIARTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAE 259 Query: 424 -SPNSFFEANSTHELNKIFRD 443 S +F A LN+I+++ Sbjct: 260 VSGGKYFRAVDNETLNEIYKE 280 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 63/191 (32%), Gaps = 30/191 (15%) Query: 270 KKHLVRDALASVIRSIKKID----NVNDTVRMGATFFND--RVISDPSFSWGVHKLIRTI 323 + ++ AL + + + + R+ F+ + Sbjct: 337 RIDALKTALRGLSGADTTLTGRYATFANRERVTLIPFSSAPGAPRTTELTPATRGAALKQ 396 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQ 381 ++ G T I A+Q AY+ ++ A +Y IVL+TDGE T+ Sbjct: 397 LRAQVDALTPDGGTNIYGALQAAYEQARAA------------PAGRYTSIVLMTDGERTE 444 Query: 382 DNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 A +++ ++ T+ F + + L+ F+ + +L Sbjct: 445 GPSADQFRATYAALPERARQVKTFTVLFGDSDATEMNRIATLTG----GRTFDGQN--DL 498 Query: 438 NKIFRDRIGNE 448 F+D G + Sbjct: 499 RAAFKDIRGYQ 509 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 68/199 (34%), Gaps = 26/199 (13%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E + + + V+ L + ++ R+G F Sbjct: 98 AVDLSGSMEARDFTLPSGVTVDRLDAVKQVLKELAA-------NRESDRLGLIVFGAAAY 150 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ H++ + ++ I ST DA+ + S+ D Sbjct: 151 LQTPFT-DDHQVWQQLLDETEIGM-AGPSTVFGDAIGLSIKLFSDSDSDN---------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----LSNC 422 + +++LTDG +T + A + G+RI TIA T E A +S Sbjct: 199 -RVLIMLTDGNDTGSTVPPVDAAKVAAANGVRIYTIAIGDPATVGEDALDMDTITRVSKI 257 Query: 423 ASPNSFFEANSTHELNKIF 441 A F A E+ + + Sbjct: 258 A-DGRTFRALDQDEMRQAY 275 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 40/85 (47%), Gaps = 7/85 (8%) Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +T E + ++ +C+ AK +GI + T+ F + R L CAS +S + Sbjct: 481 MTHKEASAKDQRTDHVCDAAKDEGIIVYTVGFEAPYS----GRRVLKRCASSDSHYYDAD 536 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 E++ F I + I R +R+T+ Sbjct: 537 GLEISDAFTS-IASSI--RKLRLTQ 558 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 61/397 (15%), Positives = 127/397 (31%), Gaps = 83/397 (20%) Query: 12 KKLIKSCTGHF-----FIITALLMPVML---GVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + +K F ++T + + + L G G+ VD++R+ AL+ A++ Sbjct: 17 RNRLKDRANAFAREEDGLMTVMALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLA 76 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ ++VS K + EYL ++V + Sbjct: 77 AA----DLDQQVSPETVVRSYLEKAGLLEYL-----------------------SSVTVQ 109 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 +V ++ +L + F++ G S I + AE + V I V+ Sbjct: 110 EGLGYRKVSATATAELP----THFMKLSGYDSLTIPAASTAEESIGN------VEISLVL 159 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN--KSLY 241 D S SM R + A + + + G + PY N + Sbjct: 160 DVSGSMNSNSRLYN-------LKNAAKEFVDHMLSATEPGTVSISIVPYATQVNAGADIL 212 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-KIDNVNDTVRMGAT 300 +F+D + + + + + D ++ V Sbjct: 213 SYYNVSTEHNYSHCVNFIDDEFSQPGLSRVTPLERTMH--FDPFSYTKDPISTPV--CPV 268 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-----------NDAMQTAYDT 349 + + FS +T++ + G+T+I + Q+ Sbjct: 269 RASTEI---LPFSND-----QTVLNNYIDGLTGRGNTSIDIGTKWGVVMLDPGTQSVISG 320 Query: 350 IISSNEDEVH-----RMKNNLEAKKYIVLLTDGENTQ 381 +IS N+ ++ + K +++++DGENT Sbjct: 321 LISDNKVPASFQGRPSAYDSGDVLKVLIVMSDGENTN 357 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/263 (12%), Positives = 76/263 (28%), Gaps = 22/263 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + Y + G I+T +PV++G + ++ ++ A A ++ Sbjct: 8 LLHYLRLFCDDRKGGIAIMTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGAL 67 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ + +R + N +V TA + R Sbjct: 68 AYSNANS-------------EDQMTDAALRVAKLN-----GVDAANVVVSLTASPKDSRV 109 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 A V +++ L L P+ + I + + AE+ + + S Sbjct: 110 QAVHVDVTTTNTLFLAPILGVDSKLNIATEAYSSLGAAESGCIIALDKSASGVTL----S 165 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + P + + S+ + + S +V + S + Sbjct: 166 GGVQANASSCYVASNSNLVAPCGTKITAKSASYYEGSSQPCPWSTNIVRADGSPAPVTKQ 225 Query: 247 GPLDPSLSEEHFVDSSSLRHVIK 269 DP S I+ Sbjct: 226 YTSDPLEGNAGVDKLSGRFDTIR 248 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 30/65 (46%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 +TA+L PV+LG GM V V + L++AA +A + + L + S + Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGTIQTSQAEAFARN 60 Query: 85 FPKQK 89 F K Sbjct: 61 FVAGK 65 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 71/189 (37%), Gaps = 28/189 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + +AS++R +ID F D V SF+ V K+ + + Sbjct: 110 REAAKKFMASLLRKQDQID---------LMDFADDVDELVSFTSDVQKIDSGLGR----- 155 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +TA+ DA+ A + + R + +VL+TDGENT + A Sbjct: 156 IHHGDATALYDAVYLASQRLGETPTSAGQR--------RVLVLITDGENTTHHGSYDAAL 207 Query: 391 NKAKSQGIRIMT---IAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRI 445 +A+ G I + S + + + L A ++ H+L F+ + Sbjct: 208 EQAQRAGAMIYALIIVPVSADAGRNTGGEHALIQLARDTGGKYYYVEDKHDLAPAFQ-HV 266 Query: 446 GNEIFERVI 454 +++ + Sbjct: 267 SDDLRTQYT 275 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 73/219 (33%), Gaps = 24/219 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K ++AL ++++ ++ D+ N Sbjct: 293 NGYFVHYFAPMDLPPLPKNVVFVLDSSASMVGTKLRQTKEALFTILQDLRPEDHFN---- 348 Query: 297 MGATFFNDRV-----ISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 F++R+ + + + I + G T IN A+QT + Sbjct: 349 --IIGFSNRIKVWQQDRLVPVTPNNIRDAKKYIHNM-----SPTGGTNINSALQTGAKLL 401 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSV 407 N+ + I+ LTDG T + I + K + TI Sbjct: 402 ---NDYIAQNNIDARSVS-LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGN 457 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + R L NC F E K F D IG Sbjct: 458 DVDYKLLERMALENCGMVRHFQEDEDAASHLKGFYDEIG 496 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 64/462 (13%), Positives = 147/462 (31%), Gaps = 74/462 (16%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I + P++ GV + V+ R+ + + A + + +++ + Sbjct: 16 VIFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVA---ANISSDITENKTLAK 72 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDR--EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + + + I + N+++ D + V E + Y+V LSS ++ Sbjct: 73 NYVDGFVPDGTISLADINIERKSCDEIYGSQCGVAGVYDEEGLVFTQYKVTLSSEFE-SW 131 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD-YQRDSEGQP 200 P F + G + + T ++ + + +V DFS SM + R+ + + Sbjct: 132 YPEDDF--APGFE----EIVELGGTAVARKYQGFTIDVAFVADFSGSMQQTWNREIKYKG 185 Query: 201 LNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + R +++++ KV + + N S +Y + S Sbjct: 186 VVNVISDITRKLETFNDHTEQELNGKKVANKVAFIGYNFYPHNGSTFYSNVDYKANYSRL 245 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-SFS 313 + ++ + + RD + + K+ + V ++D + Sbjct: 246 SYKWQENIPEINYRR---TARDPINN-----KRTPIIGRYVNNTIPLYSDDSYFYTLDLT 297 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + TI + TA + + A + N +K I++ Sbjct: 298 DNFTQFRNTISTFYPDY-----GTASYEGIIEA-----------AKIVNNGENIRKLIIV 341 Query: 374 LTDGENTQDNEEG---------------IAICNKA-----------KSQGIRIMTIAFSV 407 L+DGE++ + +C ++ +I I F Sbjct: 342 LSDGEDSINENNPYDNRYPGFIAPLIYQSGLCQNIINDLESKEINGRNVEAKIFVIGFGY 401 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + L CA + A+S E+ I E+ Sbjct: 402 DLEKNPG----LKICAGEENVQSADSYQEIFDTVLQLISEEV 439 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 72/201 (35%), Gaps = 35/201 (17%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S S++ + K + + A I I N N +G F + + H ++ Sbjct: 97 SGSMQAMDFKPNRLEAAKDVAISFINNRPNDN----IGMVTFAGESFTQCPLTTD-HTVL 151 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +V+ + + G TAI + TA + + K++ + ++LLTDG N Sbjct: 152 LNMVQDLQMGVLDDG-TAIGMGLATAVNRL-----------KDSKAKSRVVILLTDGSNN 199 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY----------------FLSNCA- 423 + + A++ GIR+ T+ + L A Sbjct: 200 MGDITPRMAADIARTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAE 259 Query: 424 -SPNSFFEANSTHELNKIFRD 443 S +F A LN+I+++ Sbjct: 260 VSGGKYFRAVDNETLNEIYKE 280 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +V D G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ + +GI ++ + + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNTTAKALRDKGILVLAVGIAGANS-----WELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 713 PDKGARP-------NVRKFLILITDGEAQDIVRDP---AIALRKEGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 88/274 (32%), Gaps = 28/274 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + N + +Y S + D+ + +V+ + + Sbjct: 241 SLPSVVEFCDKNTHNSEAPNMQNKMCNYKSTWEIIMESDDFRNSSVVNSLVPPFETTFEL 300 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + +D S + + +R A + I +I + R+G F Sbjct: 301 LQTQDRAVSLVLDVSGSMNTNNRITNLRTAAEVFLIQIIEIGS-----RVGIVTFESSAY 355 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 R + + G T I ++ + I ++ Sbjct: 356 EKSPLLQITSVATRQRLVQN-LPTTAGGGTKICAGIEKGLEIITNA---------IGTTY 405 Query: 368 KKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 IVLLTDGE + +++C K K G I TIA + ++ F SN Sbjct: 406 GSEIVLLTDGE-----DSTMSLCREKVKESGAIIHTIALG--PSAAKELEEF-SNITGGL 457 Query: 427 SFF--EANSTHELNKIFRDRIG--NEIFERVIRI 456 + + + +L + F + +I E+ I++ Sbjct: 458 QLYAVDVDVPSKLVEAFSEITTGSGDISEQSIQL 491 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 77/556 (13%), Positives = 157/556 (28%), Gaps = 119/556 (21%) Query: 13 KLIKSCTGHFFIITALL-MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS--VPLI 69 +L G + ALL +P+ + G+ +D+ R S + L+ A A + A+ Sbjct: 16 RLGADERGAIAVQFALLLIPIAVLTFGL-IDISRASVQKRQLQDALDAATLMAARSTATT 74 Query: 70 QSLEEVSSRAKNSFTFPK---------------------QKIEEYLIRNFENNL------ 102 + + A + I+ +I+ +NL Sbjct: 75 NADLDTIGDAALATEMAGLGVTFGPGNSSFVLGDNNTVVGTIQNVVIKPIISNLWSSTNT 134 Query: 103 --------KKNFTDREVRDIVRDT---AVEMNPRKSAYQVVLSSRYDL------------ 139 ++ EV ++ +T A + S ++++ L Sbjct: 135 PVSATATVMRSINHLEVALVLDNTGSMASSLGSGGSKITALITASKSLVDVLSAAAARAT 194 Query: 140 -------LLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + P S+ + ++ + + ++ Sbjct: 195 EADAVKISVVPFSMTVNIGSTYQTQTSWLTGTQPAAYGVDNFATSQNRFTLLSNLGLTWG 254 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 +S P + + + + + + + + + + Y Sbjct: 255 GCVESRPAPFDVTDDAPNPAIGASMFVPFFAPDEPDDNTVNISASSSTKYRDARRYSTSY 314 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG-ATFFNDR--VIS 308 ++ + D+ + R + + + K + + G A N + S Sbjct: 315 PITNTYLTDTVTPTGTATNAWSTRSTVVAKYATSNKA-TLLSLAKTGTAYGPNAGCGMTS 373 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--DEVHRMKN-NL 365 + K R VK G+T + + + T+ + D V Sbjct: 374 LMRLTNVKAKADRDTVKGKLDQMIASGNTNVAMGLIWGWHTLSKNAPFADGVDPATTVGK 433 Query: 366 EAKKYIVLLTDGENTQDNEEGIAI------------------------------------ 389 K IVLLTDG+NT D Sbjct: 434 RTTKVIVLLTDGDNTNDTYNNPNASIYTGYGYITQGRLLNASNSPLGATSTATNRRDAID 493 Query: 390 ------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFR 442 C AK+ G++I I V+ R L +CAS P +++ +L +F Sbjct: 494 SREARACTNAKAAGVQIYAIGVGVSSHS----RGILQDCASKPEMYYDVTDAAQLASVFN 549 Query: 443 DRIGNEIFERVIRITK 458 G + +RITK Sbjct: 550 TIAG---SIQNLRITK 562 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 59/373 (15%), Positives = 123/373 (32%), Gaps = 54/373 (14%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K ++ GH ++ A+++P + GV + D R + L++A++ A++ Sbjct: 2 KHAMRKQFGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLA-------- 53 Query: 72 LEEVSSRAKNSFTFPKQKIEEYL--IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 VS++ + ++ I+ YL + + + K E+ + + T R Y Sbjct: 54 ---VSAKDEQDHQLAERYIQHYLYDMDSILDIEVKKLGCDEIPECIAATE-RGEARYFEY 109 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +V + + P + + G + +R Y + + I +++DFS SM Sbjct: 110 RVAGQTLHKSWF-PGNDVISGFGD-----SFNVTGSSKARRYQSQ-PIDITFIVDFSESM 162 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D LN + + N P+ V+ + Sbjct: 163 NDSWSGGRHSKLNDLKDIIEDVADELGAYND-----LYPEHPHRVALTGFNRRTINKDKN 217 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI---RSIKKIDNVNDTVRMGATFFNDRV 306 D + + V S + +A ++ N +D R +F Sbjct: 218 DNLVVRDQRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNSDDDARFYDLYFTTDF 277 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 S G TA + A + S ++++ Sbjct: 278 SSFTK---------------KVKKFKAGGGTASLQGIIRAGQIVTSMSKNQ--------- 313 Query: 367 AKKYIVLLTDGEN 379 K+ I++L+DGE+ Sbjct: 314 -KQLIIILSDGED 325 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 62/193 (32%), Gaps = 38/193 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I N + R+G F + + H+++ ++ Sbjct: 108 TRLEAAKKVASEFI-------NDRQSDRIGLVIFAGESFTQCPLTTD-HRVLLNLLSEVK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + + + + K++ + ++LLTDG N + Sbjct: 160 FGMIEDG-TAIGLGLANSVNRL-----------KDSQSKSRVVILLTDGSNNAGQIAPLT 207 Query: 389 ICNKAKSQGIRIMTIAFS------VNKTQQEKARYF----------LSNCASP--NSFFE 430 A S GIR+ TI + L+ AS S+F Sbjct: 208 AAELAASYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSGDFDERTLTEIASKTGGSYFR 267 Query: 431 ANSTHELNKIFRD 443 A L+ I+ + Sbjct: 268 ATDNTSLSGIYDE 280 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 60/189 (31%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A V ++ +G T F + + + L I A Sbjct: 111 EAAKQVASEFIAGRPNDN---IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKTDMAQRGM 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K I+LLTDG N + + Sbjct: 168 IQDGTAVGMGIANAVSRL-----------KDSKAKSKVIILLTDGTNNAGDISPLTAAEI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANST 434 AKS GIR+ TI N +Y L+ A + F+ A Sbjct: 217 AKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDN 276 Query: 435 HELNKIFRD 443 +L +++D Sbjct: 277 KKLEDVYKD 285 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 72/202 (35%), Gaps = 17/202 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S+ I + ++ LA ++ R+G F D Sbjct: 113 DLSGSMAEQDFTSASGAKISRLDATKEVLADFAKT-------RQGDRLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A S + K K Sbjct: 166 TPFTADQKVWLALLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSEPSQAAFSKPRQ---K 220 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP-- 425 ++LTDG +T E I AK++G+RI IA T E A + AS Sbjct: 221 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESG 280 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 F+A + EL + D IG Sbjct: 281 GKAFQALNRDELASAY-DDIGK 301 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 73/217 (33%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L P FV SS V K +DAL +++ ++ D+ N Sbjct: 277 DGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFN---- 332 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+Q + Sbjct: 333 --IVGFSNRIKV-----WKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLL- 384 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 N+ H + +V LTDG+ T I N + + I T+ + Sbjct: 385 --NDYVAHNDIEDRSVS-LVVFLTDGKPTVGETHTFKILNNTREATRGRVCIFTVGIGAD 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E + F D I Sbjct: 442 VDFKLLEKLSLENCGLTRRVHEDHDARAQLIGFYDEI 478 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 73/219 (33%), Gaps = 24/219 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K ++AL ++++ ++ D+ N Sbjct: 289 NGYFVHYFAPMDLPPLPKNVVFVLDSSASMVGTKLRQTKEALFTILQDLRPEDHFN---- 344 Query: 297 MGATFFNDRV-----ISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 F++R+ + + + I + G T IN A+QT + Sbjct: 345 --IIGFSNRIKVWQQDRLVPVTPNNIRDAKKYIHNM-----SPTGGTNINSALQTGAKLL 397 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSV 407 N+ + I+ LTDG T + I + K + TI Sbjct: 398 ---NDYIAQNNIDARSVS-LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGN 453 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + R L NC F E K F D IG Sbjct: 454 DVDYKLLERMALENCGMVRHFQEDEDAASHLKGFYDEIG 492 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 61/176 (34%), Gaps = 29/176 (16%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + + A A +I ++ D V G F+ +G ++++ Sbjct: 426 IPKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYYWAYK--FGKISKKEDVIESI 477 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI + A T+ S IVLLTDG Q E Sbjct: 478 -SSIEVGGGTAIIPPLSEAVKTLKKSKAKSK-----------LIVLLTDGMGEQGGYEIP 525 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 A N+AK I+I TI LS AS F+ ++ EL +F Sbjct: 526 A--NEAKRNNIKITTIGVGKYVNA-----TVLSWIASFTSGRFYLVSNPSELVDVF 574 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 50/218 (22%), Positives = 73/218 (33%), Gaps = 23/218 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K ++AL +++ ++ DN N Sbjct: 287 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGAKLKQTKEALFTILHDLRPEDNFN---- 342 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F+ R+ W + T K + + G T IN A+QT + Sbjct: 343 --IVGFSSRIKV-----WKDQLVPVTPNSIRDGKVYIHHMSPSGGTNINGALQTGIRLL- 394 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVN 408 N+ H + IV LTDG T + I N K + + TI + Sbjct: 395 --NDFVAHNDIDARSVS-LIVFLTDGRPTVGEIQTPKILNNTKEAARDRVCLFTIGIGDD 451 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + L NC F K F D IG Sbjct: 452 VDFKLLEKLSLENCGMTRRFQVEADAAAQLKGFYDEIG 489 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 81/202 (40%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L+ ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLSEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTADQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDQSRGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 57/161 (35%), Gaps = 37/161 (22%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKYIVLLTDGE 378 R T+ +G T + M + + DE N+ ++IV +TDG Sbjct: 508 RATFNTYVQSLQPLGGTYHDAGMVWGARLLSPTGLFADENATAPNDRPISRHIVFMTDGA 567 Query: 379 NTQD----------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + N +C A+ +GI I ++F V Sbjct: 568 MAPNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVGSN 627 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 L+NCAS FEA++ ELN+ F+ I +I + Sbjct: 628 DS------LNNCASSGQAFEADNAAELNEQFQ-AIARQISK 661 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 79/234 (33%), Gaps = 32/234 (13%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ +K LI G+ F++TA + ++G G VD+ R + L+QA ++ Sbjct: 7 SRLCAGTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLAG 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + +++A+ N NF E + T + + Sbjct: 67 RRAMGGASYDEAAQAE-------------------ANKMFNFNFPEAKY--GATGILFSS 105 Query: 125 RK-SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 R +A V + L + + G + + + A+ + V + V+ Sbjct: 106 RALNASDVEGQASAVLP----TELMFMFGKEEFRLSADCTAKLEISN------VDVMLVL 155 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 D + SM + + ++ + G + PY + N Sbjct: 156 DVTGSMAQTNAGDSVNRITALKDATMDFFDTLTNADVGDGRLRFGVVPYSSTAN 209 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 61/189 (32%), Gaps = 37/189 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 AL ++ + + R+G ++ + + L+ + ++ Sbjct: 115 TALKNIAKKFVDKRPGD---RIGLVTYSGEAFTKVPVTSDHAVLLEELENLNPLELQP-- 169 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAK 394 TAI + + A + S K I+L+TDG NT +N + AK Sbjct: 170 GTAIGEGLSVAVSHLRHSKAKS-----------KIIILMTDGVNTIENAMPAQVGAQLAK 218 Query: 395 SQGIRIMTIAFSVNKTQQEKARY------------------FLSNCA--SPNSFFEANST 434 S IR+ +I N + L A + +F A S Sbjct: 219 SNDIRVYSIGIGTNGYALMPTQTDIFGDLVFTEVEVKIDEPVLREIAQTTGGKYFRATSN 278 Query: 435 HELNKIFRD 443 L +++ + Sbjct: 279 QSLEEVYEE 287 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 76/190 (40%), Gaps = 35/190 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +V+ L+ I ++ D R+G F D ++ + + + + ++ Sbjct: 110 VNRLTMVKHVLSDFIER-REGD------RLGLILFADTAYLQTPMTYDRNTVKQMLNESV 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ + V R +++ ++ + +VLLTDG+NT N Sbjct: 163 L--GLVGERTAIGDAIALS-----------VKRFRDDEKSNRVLVLLTDGQNTAGNLPPE 209 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEAN 432 A++ + I IA + ++ + + A + +F A Sbjct: 210 QALELAQAYDVTIYPIAVGAEEVVVDSFFGQRRVNPSRDLDVPLMQSIAKQTGGKYFRAR 269 Query: 433 STHELNKIFR 442 ST+EL +I++ Sbjct: 270 STNELEEIYQ 279 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 80/202 (39%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTADQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G++T+D+ +A+C+ AK +GI + TI F + + L CAS S + E Sbjct: 510 GDSTKDSR-TLAVCDAAKEKGIVVFTIGFEAPW----RGQQVLQQCASSASHYYDVDGLE 564 Query: 437 LNKIFRDRIGNEIFERVIRITK 458 ++ F I + I R +R+T+ Sbjct: 565 ISDAFAS-IASAI--RQLRLTE 583 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 62/402 (15%), Positives = 133/402 (33%), Gaps = 65/402 (16%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F+ + +G +++ +M +GG+ +D+VR L+ A++ A+ Sbjct: 20 FVDELRAFRSDESGVLAKPMIMILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAA- 78 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 ++ P+ + +Y+ ++ ++ EV + ++ Sbjct: 79 -------DLDQPLD-----PEAVVIDYMSKSGLSD-YTTVVVPEVSPTAKRVKASVDTEF 125 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +A + D + NP + L + + + + A + V I V+D S Sbjct: 126 TAGWMNSIFYEDYMRNPDTYELEPI---TLPLLASSTAVESIGN------VEISLVLDVS 176 Query: 187 RSMLDYQR-----------------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 SM R ++E ++ P V ++ ++ + DE Sbjct: 177 GSMRSNNRLVNLKRAAKEFVQTMDDNTEDGKMSISIVPYSTQVSMPAAFLDEMRVSDEHS 236 Query: 230 SPYMVSCNKSLYYMLYPG-----PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 ++ + S + S ++ D +++ DA + Sbjct: 237 YSNCINFDGSDFNTTGLNLSREYERTMHFSVWNYYDYRDDDEHVRQPTCASDADNPERTA 296 Query: 285 IKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKL---IRTIVKTFAIDENEMGSTAIN 340 + DNV + A ++ D WG L ++ ++ T A D N S Sbjct: 297 LLMSDNVAQLQSYIDAFEHSENTSIDLGMKWGTALLDPSVQPVIATLANDANPNQSIEAR 356 Query: 341 DAMQ-TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 A + +Y + K IV++TDG+NT Sbjct: 357 YANRPVSYQDTETL---------------KVIVMMTDGQNTA 383 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 80/202 (39%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTADQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 80/202 (39%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTADQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 63/200 (31%), Gaps = 24/200 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E + + V+ +A I R+G F Sbjct: 97 AIDISQSMETEDFVAPDGRREDRLSGVKAVVADFIDR-------RQDDRLGLIVFGTAAY 149 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + ++ +TAI DA+ A + E + Sbjct: 150 PQAPLT--QDHATLKLLLGQVSTRMAGPNTAIGDAIGVAIKQFEHAGEHDQ--------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++LLTDG +T + A ++ I + T+ + + E+ F L A Sbjct: 199 --VLILLTDGNDTGSAVPPDRAASMAAARHIVVHTVGIGDPQAEGEEKVDFDALRAIAAK 256 Query: 424 SPNSFFEANSTHELNKIFRD 443 + FF A L +++ + Sbjct: 257 TGGRFFPAQDQASLRQVYAE 276 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 78/472 (16%), Positives = 163/472 (34%), Gaps = 41/472 (8%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ + G F I + + +ML GM +D++R+ + L+ A++ A+ L Sbjct: 23 FVKEFARDEDGAFIIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAA-DLD 81 Query: 70 QSLEEVS------SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 Q+L + ++A S +++ L + + T + + Sbjct: 82 QTLSPAAVVTDYFAKAGLSSFLTSTTVDQGLNYRIISAQGNMTMPTTFMRLSGQTELAIR 141 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV- 182 +A + V + L+++ R+ + + + +TV E +S+ + Sbjct: 142 GDATAEERVSNVEISLVVDISGSMGRNNKLSTLRTASHTFIDTVI-RPETEDLISLNIIP 200 Query: 183 ----IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ + D + + + + V R K + Sbjct: 201 YTAQVNAGPDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPVSTRTYKQMQHFQYGWS 260 Query: 239 SLYYMLYPGPLDPSLSEEHFV-DSSSLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTV 295 S Y P+ F D++SL+ + + + + +D + Sbjct: 261 SSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRARANTAIHLGMKWGVSMLDPTFRPI 320 Query: 296 RMGATFFNDRVISDP---SFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQ 344 N++V + ++ + ++TI V T+ I +E ST A Sbjct: 321 VTAMIA-NNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRIS-DEFYSTPSQIAHW 378 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 Y +N + N KK+ TQ + +IC+ AK++GI + TI Sbjct: 379 DRYQLFFFTNNYIDRDIDQNYYYKKFT-------ATQADTMLQSICDAAKAEGILVWTIG 431 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 F V+ + CAS S F EL++ F I +I + + + Sbjct: 432 FEVSNHAAGEMLD----CASSPSHFFRVEGVELSEAFAS-IARQINQLRLVL 478 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 60/189 (31%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A V ++ +G T F + + + L I A Sbjct: 111 EAAKQVASEFIAGRPNDN---IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKTDMAQRGM 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K I+LLTDG N + + Sbjct: 168 IQDGTAVGMGIANAVSRL-----------KDSKAKSKVIILLTDGTNNAGDISPLTAAEI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANST 434 AKS GIR+ TI N +Y L+ A + F+ A Sbjct: 217 AKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDN 276 Query: 435 HELNKIFRD 443 +L +++D Sbjct: 277 KKLEDVYKD 285 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 29/184 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V + + + D R+ F F++ + R I Sbjct: 113 QPITRLEAV-TTVGAEFARRRAGD------RVALIVFGSEAYYAAPFTFDTEAIARRI-- 163 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A +T I+DA+ A + S+ D + ++LL+DG N Sbjct: 164 EEATIGISGRATNISDALGLALKRMAGSDADT-----------RVVILLSDGANNAGATN 212 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKA-------RYFLSNCA--SPNSFFEANSTHE 436 + A G+R+ TIA + + L + S F +T + Sbjct: 213 PRGVAQLAAQMGVRVHTIAMGPKSVDEAEEGERGVVDAETLDAISKVSGGETFRVRTTED 272 Query: 437 LNKI 440 L + Sbjct: 273 LIAV 276 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 64/183 (34%), Gaps = 27/183 (14%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + + R A +V+ + D R+G F + + H ++ Sbjct: 112 SQTDTQSRFEIARQAARNVVEQ-RSND------RIGLVVFRGEAYTLSPLTRD-HTVLSL 163 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + + TAI A+ A + + +S + ++LLTDGEN Sbjct: 164 LLDNLSSRIIQDDGTAIGSALLVALNRLQASESELQ-----------MVILLTDGENNAG 212 Query: 383 NEEGIAICNKAKSQGIRIM--TIAFSVNKTQQEKARYF----LSNCA--SPNSFFEANST 434 + A +G+R +AF K + L A + S+F N+ Sbjct: 213 EVSPLTAAALAARRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGGSYFTVNNK 272 Query: 435 HEL 437 EL Sbjct: 273 TEL 275 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 80/202 (39%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L ++S +K D R+G F D Sbjct: 113 DLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQS-RKGD------RLGLVLFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + ++ ++ + +K Sbjct: 166 TPFTVDQKVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREK 222 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++G+R+ IA +T E A + A S Sbjct: 223 VAIVLTDGNDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESG 282 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL+ + D IG Sbjct: 283 GEAFEALNRDELSAAY-DEIGK 303 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 55/153 (35%), Gaps = 21/153 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR + V + R I +TAI D + A + Sbjct: 132 RVGLVVFGDRAYVAAPQTHDVASVARLIDGLQI--GVSGKATAIADGLGLAIRRL----- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-VNKTQQEK 414 K + I+LL+DG++T + +A A+ G+R+ TIA + + Sbjct: 185 -RERDAK-----SRVILLLSDGQDTTGMVDPVAAAQTARDLGMRVYTIALGPADLSDDPG 238 Query: 415 AR-----YFLSNCASP--NSFFEANSTHELNKI 440 AR L A F +T +L + Sbjct: 239 ARDAVDADTLRRIAQAAGGETFRVRTTDDLQAV 271 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 59/188 (31%), Gaps = 57/188 (30%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + G+T + + + T+ R + + KK ++++TDG NT Sbjct: 506 KNTTQAKLTSMQASGATNVQMGVAWGWRTLSPGEPFTEGRPYDAEDNKKIMIIMTDGNNT 565 Query: 381 QD-------------------------------------------------NEEGIAICN 391 +E C Sbjct: 566 YYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYDGEANPGHNSQTFTKAMDEHLTETCT 625 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-----FFEANSTHELNKIFRDRIG 446 AK+ GI I +IAF V + L +CAS + +F+AN+ L F ++I Sbjct: 626 NAKNAGITIYSIAFDVPNGS--SVKATLEDCASSDVGGGKLYFDANNNAALIDTF-EKIA 682 Query: 447 NEIFERVI 454 + + I Sbjct: 683 ERLADLRI 690 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 82/246 (33%), Gaps = 13/246 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ G+ F+ TAL +PVML G D + A I A+ L + Sbjct: 24 KRFRDDERGNVFVFTALSLPVMLMAIGAGADYAELYRARVNFQSAVDAGAIAAAKNLAAT 83 Query: 72 LEEVSSRAKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + +S+ F + E +R + N D V+ ++ + + S Sbjct: 84 GQVQTSKDIGEEVFRSNLSHLGEKAVREGQINFDMGDGDCAVQGVITTATLPHDRFFSLS 143 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V S + N + G + +++ + E + + + I V+D S SM Sbjct: 144 FVDQSQQKGFGANKI-----VKGQEEFILSASSTVECGNDT------IEIALVLDNSGSM 192 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + + T +++ + + + + + N++ +M G Sbjct: 193 RWNGKIGTLRQASNSLVETLHTTMGSANKAIQFSVVPFAATVNVGTNNRNEPWMDTQGRS 252 Query: 250 DPSLSE 255 Sbjct: 253 STHWEM 258 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S E + + + V++ L+ + ++ D R+G FF Sbjct: 98 AVDLSGSMETKDFKNKSGENVTRLDSVKEVLSEFLAE-REGD------RVGLVFFGSAAF 150 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + + + T + DA+ + +I +E E Sbjct: 151 IQMPFTEDLEICQELMDEAQVRMAGP--QTMLGDAIGLSI-SIFDQSELE---------- 197 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCAS- 424 K ++LLTDG +T A+ +GI I T+A +Q L + +S Sbjct: 198 DKVLILLTDGNDTGSLVAPEKAAQIARDKGIVIHTVAVGDPAAAGEQALDEATLRSISSL 257 Query: 425 -PNSFFEANSTHELNKIFR--DRIG 446 ++ A + EL I+ D+IG Sbjct: 258 TKGKYYWAGNREELAGIYDEIDKIG 282 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D+ G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ A + +GI ++ + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNATAKALRDKGILVLAVGIAG-----ANTWELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE ++ + G+ I ++ + Sbjct: 713 PEKGARP-------NVRKFLILITDGEAQDIVKDP---AVALRKDGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 57/165 (34%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 A + S+ +D + +R+G +++ S S GV+K +++ +G Sbjct: 249 AFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTE--VLQRIQDLSPHVGQAY 306 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ + S+ + N + VL+T + ++ + +G+ Sbjct: 307 TGAALRKTRKEVFSAQ----RGSRKNQGVPQIAVLVT---HRASDDNVTKAAVNLRREGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + T+ +Q L AS + A F + Sbjct: 360 TVFTMGVEGANPEQ------LEKIAS----YPAEQFTSKLSNFSE 394 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D+ G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ A + +GI ++ + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNATAKALRDKGILVLAVGIAG-----ANTWELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE ++ + G+ I ++ + Sbjct: 713 PEKGARP-------NVRKFLILITDGEAQDIVKDP---AVALRKDGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 57/165 (34%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 A + S+ +D + +R+G +++ S S GV+K +++ +G Sbjct: 249 AFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTE--VLQRIQDLSPHVGQAY 306 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ + S+ + N + VL+T + ++ + +G+ Sbjct: 307 TGAALRKTRKEVFSAQ----RGSRKNQGVPQIAVLVT---HRASDDNVTKAAVNLRREGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + T+ +Q L AS + A F + Sbjct: 360 TVFTMGVEGANPEQ------LEKIAS----YPAEQFTSKLSNFSE 394 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 53/160 (33%), Gaps = 24/160 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F +S + L + + TAI A+ A + + + Sbjct: 143 RIGLIVFAGEAVSLCPLTTDYELLYGFLDEVTP-SLIPTPGTAIGSALAVAVNRMRDTAG 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK- 414 + K +L++DG+NT N A + G+++ TI+ K+ + Sbjct: 202 ES-----------KVAILISDGDNTSGNLGPTTSAQLANAFGVKVYTISVGKPKSASKAD 250 Query: 415 ---------ARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L N A +F A L +F+ Sbjct: 251 TTASAGALMDEGELQNIAGIGNGKYFRATDNTALESVFKQ 290 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 59/490 (12%), Positives = 141/490 (28%), Gaps = 104/490 (21%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ +++ + ++ +G+ II AL + +L G +D + L + A Sbjct: 1 MLSGHRYLGLLSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + + ++ ++ + T + V D+ Sbjct: 61 MLA-ATGYTAMRGTAADAKTSATNMYNGQMSSH-----------KLTSNSLNITVTDSVT 108 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +A VV ++ F+ G + + + A +Y + Sbjct: 109 ARTVTGTASVVVNTA-----------FMYMFGFPTMTVTASSSASASFPTY-----MDFY 152 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 ++D S S ++ L Sbjct: 153 VLVDNSPSQGLGATTADMTTLQNATS----------------------------DKCAFA 184 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + Y +L + + + V + +VR A S+ + V++ RM Sbjct: 185 CHDTYTSSTKKTLQTNSYYQIAKNKGVTMRIDVVRSATQSLTDTATSSQVVSNQYRMAVY 244 Query: 301 FFND-----RVISDPSFSWGVHKLIRTIVKTFAIDENEMG-----STAINDAMQTAYDTI 350 + + S S + + ++ + G T + AM Sbjct: 245 SLGSDCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGAMS------ 298 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG----------------IAICNKAK 394 + ++ +K++ ++DG C+ K Sbjct: 299 GMNGVIPAQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQEPLNTTTCDTLK 358 Query: 395 SQGIRI---MTIAFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHE----LN 438 ++GI+I T ++ ++ + +CASP ++E +S+ L Sbjct: 359 ARGIKIAVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCASPGYYYEVDSSGSIGAALT 418 Query: 439 KIFRDRIGNE 448 +F+ I + Sbjct: 419 ALFQQAIASA 428 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 24/255 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +K G++ ++T L M ++G + +D A A I + ++ Sbjct: 3 RKFANDRRGNYALMTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIVAG 62 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + A Y FE NL + T N ++ Sbjct: 63 ATDAEAIA-------------YAKNFFEANLAHIDPANTTLAV---TLPNNNTGGGTLKL 106 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + Y P + L G S T A ++ + + V+D S SM + Sbjct: 107 CGTLTYKPYFLPTAKIL--AGGTSGNATTM--AFNTCSEVRLKNTLEVSLVLDNSGSMKE 162 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + S + A + V + Q + + + +V S+ P + Sbjct: 163 LGKGSNKVRFDLLKDAAKQLVDQLAGQAQLMKQVSKPVQFSLVPFAASV----NVDPGNA 218 Query: 252 SLSEEHFVDSSSLRH 266 S + S + H Sbjct: 219 SAAWMDTTGISPIHH 233 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 53/162 (32%), Gaps = 14/162 (8%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + VKT G+T + + M + T+ S+ R + K Sbjct: 415 KPLTDVSTTAGASAVKTAIDAMAADGATNVPEGMAWGWRTLSSTAPFTEGRPETERGNDK 474 Query: 370 YIVLLTDGENTQDNEEGI-----------AICNKAKSQGIRIMTIAFSVNKT-QQEKARY 417 +++LTDG NT + + N + + + R Sbjct: 475 VLIVLTDGANTYYTPDSVIAQTYSGTNYNYGANDLAGNKAIYSALGYVTPYSNGYSYGRM 534 Query: 418 FLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 FL +S + + AN T +N+ F N V+ +T Sbjct: 535 FLGTSSSVIKSDYSNANYTKAMNEHFTTLCNNAKAANVMVMT 576 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 58/175 (33%), Gaps = 38/175 (21%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKT--FAIDENEMGSTAINDAMQTAYDTIISS 353 R+G + F R + + L+ + + A D TAI D + A + Sbjct: 157 RIGLSAFGARSFTQCPLTMDYGSLLEILKASDDLARDTLVNNRTAIGDGLMNALARL--- 213 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-- 411 K + + ++LLTDG + + AKS G+++ T+ + Sbjct: 214 --------KMSDAKSRVVILLTDGRDNASVVPPVRAAEVAKSLGVKVYTVGVGKKSGKIL 265 Query: 412 -------------------QEKA--RYFLSNCASP--NSFFEANSTHELNKIFRD 443 E+ L AS F+ A + EL KI+ + Sbjct: 266 AFQQNPWTGEISWGERDITPEEGIDEDVLKAIASKTGGRFYRAENKAELEKIYSE 320 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 63/186 (33%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDNV---GITLFAGESFTQCPLTVD-HAVLLNLIKDVKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 M A + V R+K++ K I+LLTDG N + + + AKS Sbjct: 167 ---TAVGMGIA---------NAVTRLKDSKAKSKVIILLTDGTNNRGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 70/188 (37%), Gaps = 18/188 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + ++++D + +KK D + VR GA + D Sbjct: 845 VFVIDSSGSIDYDEYNIMKD---FMTDLVKKADVGKNQVRFGALKYADDPEVLFYL--DT 899 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D+ G+T +A+ + + + R + ++++TD Sbjct: 900 LGTKWEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLQRGVP-----QVLIVITD 954 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GE + D ++ A + +GI I+ + N + L+ S + +F + Sbjct: 955 GE-SHDADKLNATAKALRDKGILILAVGIAGANPVE------LLAMAGSSDKYFFVETFG 1007 Query: 436 ELNKIFRD 443 L IF D Sbjct: 1008 GLKGIFSD 1015 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 57/171 (33%), Gaps = 16/171 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A + ++ D + VR+GA F+ + G I + Sbjct: 1051 KKMKAFLASVVQDFDVTVNRVRIGAAQFSHNYQPEFPL--GTFTGEEEISLQIEKIQQIF 1108 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I A++ D R+ + +++LTDG Q +E + Sbjct: 1109 GYTHIGAALRRVGRYF---RPDMGSRINAGTP--QVLLVLTDG---QSQDEVAQAAEDLR 1160 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 +GI I ++ + + + + + EL K+ + RI Sbjct: 1161 RKGINIYSVGI-----GDVDDQQLVQITGTADKKLTVHDFDELRKV-KKRI 1205 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDN---IGITLFAGESFTQCPLTVD-HAVLLNLLKDMKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + A + K++ K I+LLTDG N + + + AKS Sbjct: 167 -TAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGVNNKGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 64/390 (16%), Positives = 127/390 (32%), Gaps = 46/390 (11%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I AS +I ++S+RA + I ++ L +++ D I + Sbjct: 27 IAAASTIVIGYQVQLSNRAMQAADAA--SIACEFKGEYDQALTQSYLDYYQPKIDKVRG- 83 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS-- 178 Q+ +S ++ L SL + + ++Y E VS Sbjct: 84 ---------QIRTNSGCNMSL-GYSLSTIFTSLTLSDTSFVVSSTANEKAYVTEDVVSDP 133 Query: 179 --IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + V+D S SM D + ++ + S + KV I + + Sbjct: 134 LELVIVLDISTSMYGAINDLKAILKRGIVSLKEQQNNAQSEDHIKVSIIPFSTGVSVNNA 193 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 L+ + + + S + + + S+K Sbjct: 194 PW----------LNDARTFCVDGTTESEDKFYAARTVANLDITHDQISVKLSQPNKWRES 243 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 A F + + ++ T+ G TA + + + + Sbjct: 244 CSAASFT------LPLTADLDQVTNTVDSLR-----TEGGTASYQGLIWGLRQLTPNWQK 292 Query: 357 --EVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 EV +N + ++ +VL+TDG + D+ +C++AK GI + + F VN + Sbjct: 293 AWEVGPNRNVDKVERKLVLMTDGNDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGS- 351 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIF 441 + F P F A+ T +L+ F Sbjct: 352 --RLEQFTRCAVDPKGVFSASDTQDLDHYF 379 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 66/157 (42%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + N K+ ++++TDG+ + D+++ ++ + +GI I + Sbjct: 899 HA-NALF----TEEHGSRINQNVKQMLIVITDGK-SHDHDQLNDTASELRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNAIYVDNFDKLKDVF 984 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 79/227 (34%), Gaps = 33/227 (14%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +KS + D LS D S +K R A++ K + ++ Sbjct: 80 DKSPQEISQFSSEDAPLSVGVVFDCSGSMG--QKLDKSRQAVSQFF---KLANPEDEFF- 133 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FND F+ + ++ + G TA+ DA+ A Sbjct: 134 --LVQFNDSASLIQPFTRNLEEIQNHL-----AFTQSKGRTALLDAVYLA---------- 176 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-- 414 +H MK +K ++L++DG + I N K ++I I + + + Sbjct: 177 -LHEMKKAKNPRKALLLISDGGDNSSRYTEPEIKNLVKEADVQIYAIGIYESAAGRGRTP 235 Query: 415 ----ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L+ A + ++ ++ +EL + +IG E+ + I Sbjct: 236 EESSGPALLTEIAEQTGGRQYQVDNLNELPDV-AAKIGVELRNQYIL 281 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 66/196 (33%), Gaps = 24/196 (12%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E + I + V+ + I ++ D R+G F + Sbjct: 101 DLSQSMEAKDFQDQQGNKIDRLEAVKLVVDDFIER-REGD------RIGLILFGTKAYLQ 153 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + R ++ I T + DA+ A T S + + Sbjct: 154 VPFTQDLE-TARFLLDEAQIGM-AGAQTMLGDAIGLAIQTFEDSKTEN-----------R 200 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--RYFLSNCA--SP 425 ++LLTDG +T A + I I TIA +T+ + L A + Sbjct: 201 VLILLTDGNDTGSQVPPDQAAKFAAQRNIVIYTIAIGNPETEGTEKIDEETLQLIADQTG 260 Query: 426 NSFFEANSTHELNKIF 441 FF + L +I+ Sbjct: 261 GQFFRGSDRQGLIQIY 276 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 55/164 (33%), Gaps = 30/164 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R ++ + +++ + TAI DA+ A + E Sbjct: 136 RLGLILFGTRAYLQTPLTFD-GATVAAMLRDSVVGLAGRE-TAIGDAIGLAVKRLREQPE 193 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK- 414 ++ ++LLTDG+NT + + A G+R+ TI + Sbjct: 194 -----------GQRVLILLTDGDNTAGALDPLEAAELAAQAGVRVYTIGIGGGELGVRSL 242 Query: 415 -ARYFLSNCASP--------------NSFFEANSTHELNKIFRD 443 L AS F A+S +L ++ + Sbjct: 243 FGMRLLRQ-ASDFDPATLERIAEITGGRAFTADSRQQLEAVYDE 285 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/147 (24%), Positives = 59/147 (40%), Gaps = 21/147 (14%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F V++ S+ + + I+K I TA+ D++ T+ + + Sbjct: 130 IGLVVFGTSVLTASPLSFDKNS-QKEIIKYIDIGI-VGEQTAMFDSLATSINIL------ 181 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 KN+ I+LLTDGE+ I AK I+I TI E R Sbjct: 182 -----KNSKAKSNIIILLTDGEDNASKIPPQIILKLAKKYKIKIYTIGIG------ESNR 230 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIF 441 LS + + F ANS +L +++ Sbjct: 231 QMLSTISQETGAKSFLANSKDDLVEVY 257 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 51/143 (35%), Gaps = 19/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A+ + +++ R+G F + +++ R + Sbjct: 111 RLERTKYAIGKLFEGLQQD-------RVGLVVFAGEPKVQLPITSD-YRMARAFARRIDP 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A++ A E + I+L+TDGEN D+ IA+ Sbjct: 163 SLVSVQGTAIGKALEQALLAFSGDTE---------QSHGRVIILITDGENHDDDA--IAV 211 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 +A G++I TI + Sbjct: 212 AERAAQMGVKIFTIGIGTPEGAP 234 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 77/245 (31%), Gaps = 17/245 (6%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ + + G+ C+ Sbjct: 157 ALRRAGILVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 Y P+ SE D L L + + + +D +D V++G Sbjct: 217 AAQYTEQESPACSEASPADIVFLVDSSTSIGLQNFQKVKHFLHSVVLGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L ++ MG T+ A++ + S E Sbjct: 277 VQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R K+ + + +VL+TDGE ++E + ++ K G+ + + ++ Q+ L Sbjct: 333 RAKDGVP--QIVVLVTDGE---SSDEVQDVADQLKRDGVFVFVVGINIQDVQE------L 381 Query: 420 SNCAS 424 AS Sbjct: 382 QKIAS 386 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 21/155 (13%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR G ++D++IS + + + + + E G T A+ + Sbjct: 884 DRVRFGVVQYSDKIISQFFLT--QYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQN 941 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++ +Y++++TDG++T + G+ I I T + Sbjct: 942 TAR---------VDVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGVRDANTTE 989 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 L AS FF L I ++ I + Sbjct: 990 ------LEEIASKKMFF-IYEFDSLKSIHQEVIRD 1017 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 78/259 (30%), Gaps = 33/259 (12%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV---SCNKSLYYMLYP 246 + + V + S+ N K + + + + + +YP Sbjct: 30 ANSGNTVRVSIREVSTTSQPKIVMTLSANNSKGLPVTDLSADDFIVKENGKEQSDIAVYP 89 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 +P + +S +DA +I + D ++G F+ Sbjct: 90 FYQNPDPIDVVLALDTSASMNDDAFTAAQDAAYGLINGLSPED------KVGLITFDKTA 143 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + ++ +I K TA+ + A + + Sbjct: 144 RVIEPLAQDHARVQESIQKLSRSV-----GTALYQGLSLAAQEVAKGQNTKA-------- 190 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP- 425 IVL+TDG NT N KA+ G + T+ F Q L A+ Sbjct: 191 ----IVLMTDGFNTSRNTTLEEAVAKAQEVGASVFTVGFGKKVDTQG-----LQKIANET 241 Query: 426 -NSFFEANSTHELNKIFRD 443 +F A + +L ++F D Sbjct: 242 GGEYFSAPTNAQLRRVFAD 260 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 56/194 (28%), Gaps = 39/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + ++ R+G F+ + + L I I Sbjct: 112 RMEAAKKVIRDFMKE-------RKYDRIGLVIFSGLAFTQCPLTTDKDSLAEFINN-INI 163 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + TAI A+ T+ + + K++ + I+L+TDG N + + Sbjct: 164 GDTGLDGTAIGSAIMTSVNRL-----------KDSRAKSRIIILVTDGNNNMGEIDPLTA 212 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARY------------------FLSNCA--SPNSFF 429 A+S I+I + + L A + +F Sbjct: 213 SKIARSYDIKIYAVGVGSLDGAIYEVDDPFLGKREIKYRKDAINESVLKEVAYNTSGGYF 272 Query: 430 EANSTHELNKIFRD 443 A I + Sbjct: 273 RAQDVKSFENIMKQ 286 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDN---IGITLFAGESFTQCPLTVD-HAVLLNLLKDMKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + A + K++ K I+LLTDG N + + + AKS Sbjct: 167 -TAIGMGIANAVTRL-----------KDSKAKSKVIILLTDGVNNKGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 63/221 (28%), Gaps = 38/221 (17%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + +D L+ + + + +D ++ I N N +G Sbjct: 81 WQNSSTEGIDIVLAMDISTSMMAQDLKPNRLEASKDVASAF---INGRPNDN----IGLV 133 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + + + + TAI + A I Sbjct: 134 VFAAESFTQCPLT--TDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI---------- 181 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--- 417 K++ K I+LLTDG N Q + AK+ G+R+ TI + Sbjct: 182 -KDSQAKSKVIILLTDGVNNQGEIAPVTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFG 240 Query: 418 -------------FLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +F A L +I+ + Sbjct: 241 VQYMDVDVEIDEPTLKQIAATTGGQYFRATDNASLKEIYSE 281 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 57/310 (18%), Positives = 106/310 (34%), Gaps = 63/310 (20%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I V+D S SM NC + + V I+D+ Sbjct: 95 IVLVMDTSTSMKCLVEPESYDIDNCVP----------TKEGHIVYIKDKSYLVNTTFLRG 144 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S + + Y + + + + +K+ ++ A S I+ +KK +N + + +G Sbjct: 145 SRHKLFYITMGTTNYYIQG--NKCYRQSSYNEKNRLKHAQESAIKFVKKFENDKN-ISIG 201 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ R I + + ++ +I G+T I +++A + NED Sbjct: 202 LVSFDTRAIEQKELTSSLSEVKSSINNL---KVAYNGATNIEAGLKSAQKILKKGNEDA- 257 Query: 359 HRMKNNLEAKKYIVLLTDG----------------------ENTQDNEE----------- 385 KY++L++DG +NT N Sbjct: 258 ---------DKYVILMSDGFPTAFDYAGEKFEENFNEHEVQDNTFINFGYNDYRGYAMKH 308 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I + K GI I FS + EK A+ + EA +T LN + ++I Sbjct: 309 SINQADSLKKVGINSFIIGFS-DGANSEKLNKIAK--AAGGEYEEARNTDALNGAY-NKI 364 Query: 446 GNEIFERVIR 455 ++ +I+ Sbjct: 365 ETKVKAPLIK 374 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 40/133 (30%), Gaps = 18/133 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV--HKLIRTI 323 + V+ + K ++ + ++ + S V Sbjct: 721 QDRSRIDSVKKVANDFVDKFK----DDENTEIAIVRYSSKADVVLDNSNKVFLSSKDNET 776 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 +K +T I D ++ +Y + ++D KY++L+TDG T Sbjct: 777 IKKRINSLKADVATNIGDGIRKSYSILDKCDKDSE----------KYMILMTDGVPTAYT 826 Query: 384 --EEGIAICNKAK 394 I N K Sbjct: 827 CYANTIKTLNNCK 839 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 19/108 (17%) Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + V R+K++ K I+LLTDG N + + AK+ GIR+ T+ T Sbjct: 174 SNAVARLKDSQAKSKVIILLTDGSNNAGDISPLTSAEIAKTYGIRVYTVGIGTRGTAPYP 233 Query: 415 ARYF-----------------LSNCA--SPNSFFEANSTHELNKIFRD 443 + L + A + +F A L I+++ Sbjct: 234 IQTMTGAIQRIQVEVDIDEPTLKDIARTTGGVYFRATDNTSLQDIYQE 281 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 69/474 (14%), Positives = 141/474 (29%), Gaps = 80/474 (16%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + + +L+ +M+ +GG +D R ++ A A++ + L S+ Sbjct: 18 RDARASILPMVGVLVALMVVIGGAGLDYGRAIMLRASISHALDAAVLAVARQLSVSIMTD 77 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S + I++ N + T ++ ++ A ++ +A Sbjct: 78 SE--------LDKAIKDAFAANMASAGLSGATLGDLTYVLDPDAGTISATATAL------ 123 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + + +G ++ I A+A + V + V+D + SM + Sbjct: 124 -----VPTYFIHVGGLGPENVAIAASADAT------YSRFDVELAMVVDVTGSMRNSMAS 172 Query: 196 SEGQPLNCFGQPADRTVKSYSSQ-----------------NGKVGIRDEKLSPYMVSCNK 238 + K +S+ KV D + Sbjct: 173 LRTAAQSVVDILIPDGTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCVTERMG 232 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + Y + + S E F S+ + + ++ +I K+ + Sbjct: 233 NEKYTDATYNYNGT-SSEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKD-------- 283 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA-------YDTII 351 N R +WG + L + D T D ++ A ++ Sbjct: 284 ----NGRTAGQTGIAWGWYALSPKWSNLWPNDSVPGSYTDS-DILKFALIMTDGDFNEYY 338 Query: 352 SSNEDEVHR--------MKNNLEAKKYIVLL----TDGENTQDNEEGIAICNKAKSQGIR 399 + + K V G + + +C K GI+ Sbjct: 339 DKATAQSNCKWQFNWSTFKWEQVCDSSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQ 398 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFE 451 + +I F N + +CAS +FF A S EL F +I N+I Sbjct: 399 VYSIYFGSNANS--AGAKVMKDCASSTKETFFMATSDSELIAAF-AKIANKIQN 449 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 58/380 (15%), Positives = 111/380 (29%), Gaps = 36/380 (9%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + R+F + + +N TD + + A +++ + + + + F + Sbjct: 174 KVAARDFIDTMVENTTDGRMSISIVPYATQVSVSDELFD-------EYTTSGTNNFANCI 226 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADR 210 ++ T A + T R S S +DY+ D Q Sbjct: 227 NFETSDYSTTALSTTSERERTMHFSPWYTSNTRASGSPIDYEICDDRSSREILPLQKDAT 286 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 T+KS+ + G + + M L P + + S Sbjct: 287 TLKSFITNLTAWG--NTSIDIGMKWGVALLDPSARPAISSLASGASVPSEFSVRPVDYSD 344 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG--ATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + V D R G +++ ++ Sbjct: 345 PDTLKIIVLMTDGQNTSQYYVEDDHRAGDSNVWYDFSANRYS--TYNPDNGYYWRDGYSY 402 Query: 329 IDENEMGSTAI----------NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G +++ Y I + Y V G Sbjct: 403 WYSSPYGGNNAQQLSYPDLFAYTSLKYLYKYIYADWMGSYSARSEWY----YGVYDYHG- 457 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 N+ N +C+ AK+QGI + TI F L +CAS +S + E+ Sbjct: 458 NSTKNTRTSNVCSAAKAQGIIVYTIGFEA----PSNGVAVLQDCASSDSHYFDVDGLEIR 513 Query: 439 KIFRDRIGNEIFERVIRITK 458 F + I I R +R+T+ Sbjct: 514 DAF-ESIATSI--RKLRLTQ 530 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 59/388 (15%), Positives = 106/388 (27%), Gaps = 64/388 (16%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + +G T ML VGG+ VD++R L+ A++ A+ Sbjct: 22 NLRSFRSDESGALAYPTIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAA---- 77 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 L++ A + + + + VE Sbjct: 78 -DLDQTQEPAVVVQDYLNKA--------------------GLGEYYEAPEVETGLGYKKV 116 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + +D + L+ G + + AE G+ I V+D S SM Sbjct: 117 KATIDATFD------AHLLQFAGGSDLPVYASSTAEESID------GLEISLVLDVSGSM 164 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 R S + A R +N G + PY + S Sbjct: 165 NSNSRLSN-------LKVAARDFIDTMVENTTDGRMSISIVPYATQVSVSDELF---DEY 214 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 S + + L + N + Sbjct: 215 TTSGTNNFANCINFETSDYSTTALSTTSERERTMHFSPWYTSNTRASGSPIDYEICDDRS 274 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK- 368 + K T+ K+F + G+T+I+ M+ + S + + + Sbjct: 275 SREILPLQKDATTL-KSFITNLTAWGNTSIDIGMKWGVALLDPSARPAISSLASGASVPS 333 Query: 369 ---------------KYIVLLTDGENTQ 381 K IVL+TDG+NT Sbjct: 334 EFSVRPVDYSDPDTLKIIVLMTDGQNTS 361 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 61/475 (12%), Positives = 152/475 (32%), Gaps = 86/475 (18%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 ++A + ++ + + + E+ ++ ++ K++ ++ + RD Sbjct: 28 QEATFSVVLPETAYISGFVMEIDGKSYKAYVKEKEEAKQIYNEAVGRGQAAAHVEANARD 87 Query: 114 IVRDT-AVEMNPRKSA----------------YQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 R T ++ + P+K A Y+VV++ + L++ + +S Sbjct: 88 SNRFTVSLNIEPQKKAVFTLTYEELLQRQNEQYEVVINIHPGQPVKDLNVEVHID--ESR 145 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 ++ + + ++ D + S+ + ++++ F +R + + Sbjct: 146 PLKFVKSPPLRTGNEISKND-------DKTASLAEIKQNNSTSATVKFNPNIERQKQLAT 198 Query: 217 SQNGKV------------GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 K + + ++ + + P ++ + FV +S Sbjct: 199 GLGTKEENGLAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQVIFVLDTSG 258 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-------------- 310 + +++A+ S++ +KK D V + V + V Sbjct: 259 SMDGNRIKQLKEAMNSILSELKKED-VFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLYD 317 Query: 311 --------------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 ++ + K N G T I A++ + + E+ Sbjct: 318 SPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKEN 377 Query: 357 EVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQ 411 + + IV LTDGE + N E I + G I +++F + Sbjct: 378 KEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGADR 431 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN--------EIFERVIRITK 458 + + L N +EA + F +I + + V ++TK Sbjct: 432 EFLQKISLKNLGFARHIYEAADASLQLQEFYKQISSPLLNNVNFKYVSNVTKLTK 486 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 80/252 (31%), Gaps = 27/252 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 R +EG + P S R + + P + ML LD S Sbjct: 48 RPTEGAIIAHRTWPQLILESLAWSLLVLALARPQFVEPPIEKVEPQRDLMLG---LDLSQ 104 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + + ++ + VR + + + R+G F D F+ Sbjct: 105 SMDTKDFRAPDGNLEARVDAVRKVVGDFVAR----RPGD---RIGLIAFGDAPYPLAPFT 157 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 H+L+R I+ T++ DA+ A + + +K +++ Sbjct: 158 MD-HELVREIISGTLPGI-AGPRTSLGDAVGLA-----------IKMFEKTTVPEKVLIV 204 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCA--SPNSFF 429 LTDG +T + AK G+ + T+ + E L A + +F Sbjct: 205 LTDGNDTASKMPPLKAAEIAKRNGVVVHTVGIGDPQATGEDRLDATALEKIAETTGGRYF 264 Query: 430 EANSTHELNKIF 441 +L + Sbjct: 265 FGGDQAQLAAAY 276 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 66/195 (33%), Gaps = 39/195 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKT 326 + + A I N N +G T F + + + L +++ Sbjct: 109 RIEAAKQVAAEF---INGRPNDN----IGLTIFAGEAFTQCPLTVDHGVLLNLFQSVSCD 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TA+ + A + K++ K ++LLTDG N + + Sbjct: 162 MVQRGMIEDGTALGMGLANAVSRL-----------KDSKAKSKVVILLTDGVNNRGDISP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSF 428 + AK GIR+ TI N T + + +S A + ++ Sbjct: 211 LTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQMPVEIDEQTMSQIAGTTNGNY 270 Query: 429 FEANSTHELNKIFRD 443 F A S +L +++R+ Sbjct: 271 FRATSNTKLKEVYRE 285 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 63/221 (28%), Gaps = 38/221 (17%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + +D L+ + + + +D ++ I N N +G Sbjct: 81 WQNSSTEGIDIVLAMDISTSMMAQDLKPNRLEASKDVASAF---INGRPNDN----IGLV 133 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + + + + TAI + A I Sbjct: 134 VFAAESFTQCPLT--TDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI---------- 181 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--- 417 K++ K I+LLTDG N Q + AK+ G+R+ TI + Sbjct: 182 -KDSQAKSKVIILLTDGVNNQGEIAPVTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFG 240 Query: 418 -------------FLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +F A L +I+ + Sbjct: 241 VQYMDVDVEIDEPTLKQIAATTGGQYFRATDNASLKEIYSE 281 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 88/230 (38%), Gaps = 22/230 (9%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 ++I +++ F R M D R + L G R + + +G+ + + Sbjct: 18 LAIGYILFFQRRMRDLNRFASLDLLKMAGFSLSRKKQWLKAVLMIIGV-LFLIITLIEPK 76 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + +D ++ + + + + + + +++ ++ R Sbjct: 77 WGYHWEEVEKKGIDIMIAVDTSRSMLADDVKPNRLEVAKREIEDLLKILEGD-------R 129 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R + + R + ++ +G TAI +A+ Y I + E+ Sbjct: 130 VGLIAFAGRAFTYCPLTSDYS-AFRLFLNDLNVNIIPVGGTAIAEAI---YKGIDAFGEN 185 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 E + K ++++TDGEN + + + +KAK +GI I T+ Sbjct: 186 ENNH--------KAMIIITDGENHE--TDPLKAASKAKEKGIVIYTVGVG 225 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 53/154 (34%), Gaps = 28/154 (18%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL-EAKKYIVLLTDGENTQDN 383 T +G T + + +++ +K+ + IVL++DG NT D Sbjct: 429 NTTVDGLFPVGGTNQPIGLVWGWQSLVGGGPFPTPPVKDEQYTYQDIIVLMSDGLNTVDR 488 Query: 384 E------------------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 C K+ GI++ T+ + N + + L NCASP Sbjct: 489 WYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIKVYTVHVNTNGSPES---TLLKNCASP 545 Query: 426 N-----SFFEANSTHELNKIFRDRIGNEIFERVI 454 F S LN F + I ++ + + Sbjct: 546 ADDGGKEFQMVTSASGLNAAF-NSIATKLTDLRV 578 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 63/187 (33%), Gaps = 27/187 (14%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + I +G+ ++ + ++ G VD R A++ A + + Sbjct: 7 TQLRKSAARFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTALMV 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ S ++ + ++ + L N + V + T Sbjct: 67 A-------KDYSLNKISASEIDGK------AKSIFSALYTNKSANSVEVVAVLTP--NTG 111 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + S +V + + + F++ + I I + S + + V+D Sbjct: 112 KGSTIKVDGTGKV------PTDFMKLVNISQIDIGASSTTTWGSTR------LRVALVLD 159 Query: 185 FSRSMLD 191 + SM D Sbjct: 160 TTGSMND 166 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 60/186 (32%), Gaps = 36/186 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ ++ + ++ I + I ++ Sbjct: 127 RVGLVAFSGEALTQVPLTTDYPVVLAAIDNLQVGQLEDG-------------TAIGTAIA 173 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +R++N+ + +VLLTDGEN + + A + GIRI TI + Sbjct: 174 TAANRLRNSPGRSRVMVLLTDGENNRGAIDPRTAAQAAGTFGIRIYTIGVGTDGMAAVPV 233 Query: 416 -----------------RYFLSNCASP--NSFFEANSTHELNKIFR--DRIGNEIFERV- 453 L+ A+ +F A L I+ DR+ E Sbjct: 234 GRGLFGLRYENRPVKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQIDRLERSAVEARA 293 Query: 454 -IRITK 458 IR T+ Sbjct: 294 YIRYTE 299 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 63/453 (13%), Positives = 128/453 (28%), Gaps = 59/453 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G I L+M V+ + G++VDV + L+ ++ A + + + + Sbjct: 11 FLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALAGAEIIPTATT 70 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + AK + + + + ++ + S + + Sbjct: 71 AAAVYAKATAYSSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQGISCVGPLSYNSIQV 130 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + L F R +G S I + A S ++ V+D + S + Y Sbjct: 131 MQQAVVPL----YFARIIGRSSMTISATSTAAKGGASSR---PYNVALVLDTTYSEISYD 183 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 D + C Q + V S + + Sbjct: 184 SDCGNSQMLCTLQGVQILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFPQMVTSTVS 243 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S+ ++ V A + + +D +D R+G T + + + Sbjct: 244 SDYDCSSNTPANTVYTFPIPGAGTYAPSSSTYRVLDFQSDY-RVGDTS--TSLNQASNLT 300 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNNLEAKKYIV 372 V A ++ A + T Y TI ++ +H +++ ++ Sbjct: 301 KAVGGFSGCTGIYPATSASQSNFQATSGQYGTYYPSTIYAAQSSLIHEQTLFPDSQNVMI 360 Query: 373 LLTDGENTQDNEE--------------------------------------GIAICNKAK 394 ++ DG T I N A Sbjct: 361 IIGDGNATAPQTNNGYPVMSTTASVSATPGASTLAGTSSGLYPSWNGECGQAITAANFAT 420 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 SQG R+ T+A+ CAS S Sbjct: 421 SQGTRVYTVAYGSPSAG----------CASDQS 443 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 47/117 (40%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G F + + + V L + + N G TA+ A+Q A + + Sbjct: 131 RTGIVAFTSKAYTQCPITNDVEALKYFVNQLRPEMLNAKG-TALAPAVQRAAEMLSK--- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK ++LLTDGE+ + + AI A+ +GI+I+ + + + Sbjct: 187 ---------YPGKKALILLTDGEDHEPEQIEEAI-KTAQKEGIKIIAVGIGTEEGEP 233 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 62/157 (39%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D + I++ + + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDNPEILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + N K+ ++++TDGE + D E K + +GI I + Sbjct: 899 HA-NALF----TEEHGSRINQNVKQMLIVITDGE-SDDRVELNDTAAKLRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L I+ Sbjct: 953 V-----GKADQKELEGMAGNKNNTIYVDNFDKLKDIY 984 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 43/106 (40%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF----SVNKTQ 411 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPV 234 Query: 412 QEKA------------RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L+ A + ++F A S +L +++ + Sbjct: 235 PVGGTVQYINTPVEIDEKTLTQIAGITDGNYFRATSNSKLKEVYEE 280 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 25/116 (21%), Positives = 45/116 (38%), Gaps = 13/116 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + + + + + + TAI DA+ + + + Sbjct: 34 RVGLLVFGQRAYALTPLTADLTSVRDQLADSVV--GLAGRETAIGDAIALSVKRLREQKQ 91 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +VLLTDG NT + AK++G+R+ TIAF + Sbjct: 92 -----------GQRVVVLLTDGVNTAGVLNPLKAAELAKAEGVRVHTIAFGGSGGD 136 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 61/475 (12%), Positives = 152/475 (32%), Gaps = 86/475 (18%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 ++A + ++ + + + E+ ++ ++ K++ ++ + RD Sbjct: 87 QEATFSVVLPETAYISGFVMEIDGKSYKAYVKEKEEAKQIYNEAVGRGQAAAHVEANARD 146 Query: 114 IVRDT-AVEMNPRKSA----------------YQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 R T ++ + P+K A Y+VV++ + L++ + +S Sbjct: 147 SNRFTVSLNIEPQKKAVFTLTYEELLQRQNEQYEVVINIHPGQPVKDLNVEVHID--ESR 204 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 ++ + + ++ D + S+ + ++++ F +R + + Sbjct: 205 PLKFVKSPPLRTGNEISKND-------DKTASLAEIKQNNSTSATVKFNPNIERQKQLAT 257 Query: 217 SQNGKV------------GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 K + + ++ + + P ++ + FV +S Sbjct: 258 GLGTKEENGLAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQVIFVLDTSG 317 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-------------- 310 + +++A+ S++ +KK D V + V + V Sbjct: 318 SMDGNRIKQLKEAMNSILSELKKED-VFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLYD 376 Query: 311 --------------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 ++ + K N G T I A++ + + E+ Sbjct: 377 SPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKEN 436 Query: 357 EVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQ 411 + + IV LTDGE + N E I + G I +++F + Sbjct: 437 KEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGADR 490 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN--------EIFERVIRITK 458 + + L N +EA + F +I + + V ++TK Sbjct: 491 EFLQKISLKNLGFARHIYEAADASLQLQEFYKQISSPLLNNVNFKYVSNVTKLTK 545 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D+ G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ A + +GI ++ + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNATAKALRDKGILVLAVGIAG-----ANTWELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE ++ + G+ I ++ + Sbjct: 713 PEKGARP-------NVRKFLILITDGEAQDIVKDP---AVALRKDGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 57/165 (34%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 A + S+ +D + +R+G +++ S S GV+K +++ +G Sbjct: 249 AFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTE--VLQRIQDLSPHVGQAY 306 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ + S+ + N + VL+T + ++ + +G+ Sbjct: 307 TGAALRKTRKEVFSAQ----RGSRKNQGVPQIAVLVT---HRASDDNVTKAAVNLRREGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + T+ +Q L AS + A F + Sbjct: 360 TVFTMGVEGANPEQ------LEKIAS----YPAEQFTSKLSNFSE 394 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +V D G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ + +GI ++ + + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNTTAKALRDKGILVLAVGIAGANS-----WELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 43.4 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 713 PDKGARP-------NVRKFLILITDGEAQDIVRDP---AIALRKEGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 77/245 (31%), Gaps = 17/245 (6%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ + + G+ C+ Sbjct: 157 ALRRAGILVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLGK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 Y P+ SE D L L + + + +D +D V++G Sbjct: 217 AAQYTERESPACSEASPADIVFLVDSSTSIGLQNFQKVKHFLHSVVLGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L ++ MG T+ A++ + S E Sbjct: 277 VQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R K+ + + +VL+TDGE ++E + ++ K G+ + + ++ Q+ L Sbjct: 333 RAKDGVP--QIVVLVTDGE---SSDEVQDVADQLKRDGVFVFVVGINIQDVQE------L 381 Query: 420 SNCAS 424 AS Sbjct: 382 QKIAS 386 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 21/155 (13%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR G ++D++IS + + + + + E G T A+ + Sbjct: 884 DRVRFGVVQYSDKIISQFFLT--QYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQN 941 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++ +Y++++TDG++T + G+ I I T + Sbjct: 942 TAR---------VDVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGVRDANTTE 989 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 L AS FF L I ++ I + Sbjct: 990 ------LEEIASKKMFF-IYEFDSLKSIHQEVIRD 1017 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 68/434 (15%), Positives = 127/434 (29%), Gaps = 55/434 (12%) Query: 46 WSYYEHALKQAA------QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 K+A +A I+ + V+ + +K+ + + + Sbjct: 70 MVNTHSEAKEAIFDLDLPDSAFISNFSMTVNGKTYVADVKE---KHQAKKMYDEARKQGK 126 Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD-LLLNPLSLFLRSMGIKSWLI 158 + R+ + +V + + ++ Y+ LL L + ++ + SW + Sbjct: 127 TA--AHVGTRDRETVKFRVSVNVEAGE---EITFELTYEELLRRHLGKYEYAVSVPSWQV 181 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 E ++ S + SR M + R P + + + + Sbjct: 182 VQNLSVE-ITISERTGIEYVRVLPLRTSRLMTNSVRGESKMPPSTQVEKGTYCARVTYTP 240 Query: 219 N-------GKVGIRDEKLSPYMVSC----------NKSLYYMLYPGPLDPSLSEEHFVDS 261 + G+ + + Y VS N + P L P + FV Sbjct: 241 TPTEQAAHSRPGVTADFVLQYDVSLKDLAGDVQIYNGYFVHYFAPRGLPPIQKDVIFVID 300 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI- 320 S K + A+ ++ + + D FN SD W + I Sbjct: 301 VSGSMFGTKIKQTKSAMHVILNDLHRDD-----------SFNIITFSDVVHVWRPGQSIP 349 Query: 321 -----RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + K + G T IN A+ A ++ I+ LT Sbjct: 350 ATAQNKKSAKDYVNKIEADGWTDINAALMAAASIFNQTSHKPEKETSTKKIP--LIIFLT 407 Query: 376 DGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 DGE T I + A+ I + +AF + R L N +E + Sbjct: 408 DGEATSGVLATSRILSNAQKAMGGTISLFCLAFGEDADYNLMRRLSLENRGIARRIYEYS 467 Query: 433 STHELNKIFRDRIG 446 K F D I Sbjct: 468 DATLQLKGFYDEIA 481 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 64/340 (18%), Positives = 106/340 (31%), Gaps = 45/340 (13%) Query: 125 RKSAYQVVLSSRYDLLLNPLSL---FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 R YQ +S R L+ L + L + GI S + ++ + S ++ Sbjct: 178 RLGKYQYTVSIRPQQLVGKLRVEVNILENSGINSLEVPPLQKSRSKSAGEGQDDA----- 232 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG-------IRDEKLSPYMV 234 S + + L ++ ++NG +G + E + Sbjct: 233 ------SPPPSTVVGQTKTLAKITFSPTVVQQAKIARNGILGDFTIRYDVNRELSVGDVQ 286 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 N + P L P FV SS V K +DAL ++++ ++ D+ N Sbjct: 287 VFNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILQDLRPEDHFN-- 344 Query: 295 VRMGATFFNDRV-----ISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 F++R+ + + I + G T IN A+Q + Sbjct: 345 ----IIGFSNRIKVWQHDQLVPVTPNNIRDAKVYIHNM-----SPSGGTNINGALQISTK 395 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAF 405 + N+ I+ LTDG T E I N K + TI Sbjct: 396 IL---NDYIAQNDIEARSVS-LIIFLTDGRPTFGEIEPAKIINNTKEAIRNKFCLFTIGI 451 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + R L NC E E K F I Sbjct: 452 GNDVDYKLLERLALENCGMMRRVREEEDAAEQLKGFYYEI 491 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 59/189 (31%), Gaps = 63/189 (33%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-- 377 ++ V+ G+T +++ + T+ + R + +K +V +TDG Sbjct: 350 KKSDVQNKINQLVANGATNLHEGFMWGWRTLSPNAPFSGGRAYQAPKNRKIMVFMTDGFN 409 Query: 378 ------------------------------------------------ENTQDNEEGI-- 387 N + + I Sbjct: 410 SWNSRVNTATGSTYDTLGYYSYNGAENERFPDGSQGNGVNYRSLLAAAANNSSSYQTISR 469 Query: 388 --------AICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCAS-PNSFFEANSTHEL 437 C AK+ GI + TI FSV+ + + + CA+ + +F+A +L Sbjct: 470 AMQDELTRQACTNAKTAGIEVFTIGFSVSGDPIDAQGLALMKECATNEDHYFKAEDASQL 529 Query: 438 NKIFRDRIG 446 N F +IG Sbjct: 530 NAAF-SQIG 537 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 28/212 (13%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I L + ++ + G D R++ AL+QAA +A++T + + +S ++ + Sbjct: 1 MIFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMTESTTNAQAKDQ--- 57 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 + N R IV V + R V +++ + Sbjct: 58 ----------------AQVVLNAQPRMTTAIVTGATVSEDKR----TVCATAKVTIQ--- 94 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + F++ + + T ++ I V+D S SM + Sbjct: 95 -NSFMQMAQLATL-TPTVKSCANLAGGADPGTTYEIALVLDNSGSMNSSSDGQSKISILK 152 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + S N K + V Sbjct: 153 SAANSFVDTMFSKSNNVKFSVVPFSSGVAAVD 184 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 66/165 (40%), Gaps = 10/165 (6%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + S++ I+ + + V + + + + F+ + + F+ Sbjct: 30 KVFVNSLLSHIR-VSYKSTYVSVVLFGTSATIDINYIFNPHPNNHKCNFRRDFSNLRFRS 88 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA Q AYD I H+ + K + LLTDG+ + + I + K Sbjct: 89 GMTNMHDAFQAAYDIIFGKY--SGHKRPTHQV-KTAVFLLTDGQWNWNGDPW-PIAKRLK 144 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +GI I TI + L + ASPN++F N + + Sbjct: 145 DRGIEIFTIGVTNGVNVNT-----LRSLASPNNYFHYNDFTQFRE 184 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +V D G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE+ D E+ + +GI ++ + + + L+ S + ++ + Sbjct: 920 GESN-DAEKLNTTAKALRDKGILVLAVGIAGANS-----WELLAMAGSSDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 713 PDKGARP-------NVRKFLILITDGEAQDIVRDP---AIALRKEGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPV 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 RVGGTTQYINTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 73/190 (38%), Gaps = 26/190 (13%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +++ L ++ R+ +++G ++D + + G H+ ++K Sbjct: 61 EIIKSWLVNITRNFDIGP---KFIQVGVVQYSDYPVLEIPL--GTHESTENLIKEMESIH 115 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T A+Q AYD + + K++ K V+LTDG + +E + Sbjct: 116 YLGGNTKTGRAIQFAYDHLFA---------KSSRFLTKIAVVLTDG---KSQDEVKDVAA 163 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNE 448 +A+ I + I + E L A S F +++I ++ I + Sbjct: 164 EARKNKITLFAIGVGSEIEEDE-----LKAIANKPSSTYVFYVEDYIAISRI-KEVIKQK 217 Query: 449 IFERVIRITK 458 + E + T+ Sbjct: 218 LCEESVCPTR 227 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 73/478 (15%), Positives = 147/478 (30%), Gaps = 104/478 (21%) Query: 14 LIKSCTGHFFIITALL-MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +G+ I+TAL +P+ML G +DVVR + + L+ +A + A+ + + Sbjct: 1 MGADRSGNVAILTALAFVPLMLITIG-SLDVVRMTTAQAKLQSTLDSATLAAAS--LSNT 57 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ ++ N D ++ T ++ +A V Sbjct: 58 ADIEDTVDE-------------------YIQANLPDTAPWTTLKLTMGDVTDSLNAKSVE 98 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++ D+ + L+ GI +T A +V++ + VS+ V+D S SM Sbjct: 99 ITATVDIEMT----ILKLAGID----KTSVLASSVAQQAAQNIEVSV--VLDISSSMGGS 148 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + S + F K E S ++ ++ + + Sbjct: 149 KITSLREAAKGFIDTML-----------KEDEDKEYTSLSIIPFGGTVNIGDFYDTYAVN 197 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S +DS S + K++ I+ +D M A N R Sbjct: 198 SSTPGVIDSPSSANYYVNKNVPYGKFMFSTEREGCIEYTDDDFDMAAIPANSRPQVPDFT 257 Query: 313 SW--------------GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 W ++ T +K D + T ++ + S ++ Sbjct: 258 KWVATNPWCPSEDSAMVLNSNNTTDLKALIDDMDLSDGTGMDIGALWGAKVLSGSMRGQL 317 Query: 359 HRMKNNLEAK-------KYIVLLTDGE---------------------------NTQDNE 384 ++ A K V++TDG + Sbjct: 318 GGDFSDRPADFNDEDTLKVAVIMTDGAITAQFRPRDYTTTGKIKNKTQQTIVSKGNINTA 377 Query: 385 EGIA---------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEAN 432 A +C +++ TI F +N L CAS ++++ Sbjct: 378 STKADDAVAYFKRVCEYLNDNNVQVYTIGFQINSGSLPD--QLLKYCASSLSNYYFVE 433 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 69/193 (35%), Gaps = 35/193 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + +++D ++ I +K D ++G F D + + + + K Sbjct: 105 QAVDRFTMIQDVVSDFIER-RKGD------KLGLILFADHAYLQAPLTQDRRSVAQFL-K 156 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 I TAI +A+ ++ + +VLLTDG N + Sbjct: 157 EAQIGL-VGKQTAIGEAIALGVKRFDMVDKSN-----------RILVLLTDGSNNSGSIS 204 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ-------------EKARYFLSNCA--SPNSFFE 430 A +G++I I + ++ + LS+ A + +F Sbjct: 205 PEQAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSMDLDEAQLSSLAKITGGLYFR 264 Query: 431 ANSTHELNKIFRD 443 A S+ +L +I+++ Sbjct: 265 ARSSQDLQQIYQE 277 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 66/189 (34%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A +V ++ +G T F + + + L+R + A Sbjct: 111 EAAKNVAAEFISGRPNDN---IGLTIFAGEAFTQCPMTTDHTSLLNLLRNVRTDIAARGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K ++LLTDG N + + Sbjct: 168 ISDGTAVGMGLANAVSRL-----------KDSKTKSKVVILLTDGSNNMGDISPMTSAQI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANST 434 AKS IR+ TI NK E LS+ A + +F+ A + Sbjct: 217 AKSLDIRVYTIGVGTNKVAPYPMSVGGGTQYINIPVEIDSKTLSDIAAVTEGNFYRATNN 276 Query: 435 HELNKIFRD 443 +L +I++D Sbjct: 277 QQLKQIYKD 285 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 44/124 (35%), Gaps = 14/124 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL+TDG N + I A +G++I I + ++ Sbjct: 193 TS-----------RALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDP-DKDAL 240 Query: 416 RYFL 419 + L Sbjct: 241 QSVL 244 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 82/246 (33%), Gaps = 33/246 (13%) Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P Q A TV + V + +E + S + P + + Sbjct: 24 PTGATAQIAPPTVSKTVNPTDIVLLTEETTITLTIQGTGSEWTTSVP------IDVVFAL 77 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 DSS L + A S + + + G +++ + + + Sbjct: 78 DSSGSMGWNDPSGLRKTAAKSFVDKLNSTTDQA-----GVVSWDNNIDFTQTLTNNFS-- 130 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +VK+ + G T +N + A + + + I+ L++G+ Sbjct: 131 ---LVKSKIDAVDSSGGTDLNVGLNAAISLLDTGKQANS---------SWVIIFLSNGQG 178 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHEL 437 T + + A ++G + TI +++ ++ L + A+ ++ + + L Sbjct: 179 TYSHSTAV----VAANKGYTVYTIGLAISPGSTAESN--LKDIANTTGGKYYSSPNATNL 232 Query: 438 NKIFRD 443 + +F D Sbjct: 233 DAVFND 238 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVSMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 57/154 (37%), Gaps = 22/154 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F FS+ V + R I A +T+I+D + A Sbjct: 136 RVALVVFGSEAYFAAPFSFDVEAIARQI--ESAQIGVSGRATSISDGLGIA--------- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF------SVNK 409 + RM+N+ A + ++LL+DG N + A G+R+ TIA S + Sbjct: 185 --LKRMENSEAASRVVILLSDGVNNAGATNPRGVAELAAQMGVRVHTIALGPKDLSSADP 242 Query: 410 TQQEKAR-YFLSNCA--SPNSFFEANSTHELNKI 440 ++ L + S F +T +L + Sbjct: 243 GERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 63/185 (34%), Gaps = 23/185 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV---- 324 K +DAL +++ ++ D R F++R+ W H + T Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKV-----WKDHLISVTPDSIRD 52 Query: 325 -KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K + + G T IN A+Q A + N+ H + IV LTDG+ T Sbjct: 53 GKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGE 108 Query: 384 EEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + I N + + I TI + + + L NC E Sbjct: 109 THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAGSQLIG 168 Query: 441 FRDRI 445 F D I Sbjct: 169 FYDEI 173 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 790 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 844 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +V D G+T +A+ + + +H+ + ++++TD Sbjct: 845 LGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 899 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ + +GI ++ + + + L+ S + ++ + Sbjct: 900 GE-SHDAEKLNTTAKALRDKGILVLAVGIAGANS-----WELLAMAGSSDKYYFVETFGG 953 Query: 437 LNKIFRD 443 L IF D Sbjct: 954 LKGIFSD 960 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 635 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 692 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 693 PDKGARP-------NVRKFLILITDGEAQDIVRDP---AIALRKEGVIIYSVGVFGSNVT 742 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 743 Q------LEEISGKPEMVFYVENFDILQHI 766 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 72/228 (31%), Gaps = 11/228 (4%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ + + G+ C+ Sbjct: 157 ALRRAGILVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLGK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 Y P+ SE D L L + + + +D +D V++G Sbjct: 217 AAQYTERESPACSEASPADIVFLVDSSTSIGLQNFQKVKHFLHSVVSGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L ++ MG T+ A++ + S E Sbjct: 277 VQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 R K+ + + +VL+TDGE ++E + ++ K G+ + + ++ Sbjct: 333 RAKDGVP--QIVVLVTDGE---SSDEVQDVADQLKRDGVFVFVVGINI 375 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 21/155 (13%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR G ++D++IS + + + + + G T A+ ++ Sbjct: 884 DRVRFGVVQYSDKIISQFFLT--QYASMAGLSAAIDNIQQVGGGTTTGKAL----SKMVP 937 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ + +Y++++TDG++T + G+ I I T + Sbjct: 938 VFQNTARI-----DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGVRDANTTE 989 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 L AS FF L I ++ I + Sbjct: 990 ------LEEIASKKMFF-IYEFDSLKSIHQEVIRD 1017 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 72/228 (31%), Gaps = 11/228 (4%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ + + G+ C+ Sbjct: 157 ALRRAGILVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLGK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 Y P+ SE D L L + + + +D +D V++G Sbjct: 217 AAQYTERESPACSEASPADIVFLVDSSTSIGLQNFQKVKHFLHSVVSGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L ++ MG T+ A++ + S E Sbjct: 277 VQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 R K+ + + +VL+TDGE ++E + ++ K G+ + + ++ Sbjct: 333 RAKDGVP--QIVVLVTDGE---SSDEVQDVADQLKRDGVFVFVVGINI 375 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 58/194 (29%), Gaps = 40/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A IKK N R+G + + + ++ + + Sbjct: 112 RLASLKKVAADF---IKKRPND----RIGLVVYAGESYTKTPITSDKGIVLNALKEITYG 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI + T+ + + K + K I+LLTDG N E Sbjct: 165 SLED--GTAIGMGLATSVNRL-----------KESKALSKVIILLTDGINNSGFIEPQTA 211 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------------------QEKARYFLSNCA--SPNSFF 429 A I+ TI N E L A + ++F Sbjct: 212 AELAVEYDIKTYTIGLGTNGNALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYF 271 Query: 430 EANSTHELNKIFRD 443 A + L I+ + Sbjct: 272 RATNNESLASIYDE 285 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 65/171 (38%), Gaps = 18/171 (10%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 DA+ SV R + + R+G F + + + H++ ++ + D Sbjct: 123 DAVKSVAREFVTRHSND---RIGVVVFKGKGYTLSPLTLD-HRVTGMLIDNVSPDVIRDE 178 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA+ A+ A + + +S D +K I+L +DG + + + + A + Sbjct: 179 GTAVGTAVLIAVNRLRASQSD-----------QKVIILFSDGVSNAGEIDPVTAASFAAA 227 Query: 396 QGIRIMTIAFSVNKTQQEKARY-FLSNCA--SPNSFFEANSTHELNKIFRD 443 QGIRI T + L A + +F A ++ L + F Sbjct: 228 QGIRIYTAGAGSASSASSALDEGELRRVALTAGGRYFRAGTSASLAEAFES 278 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D ++ D+ G+T Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDN--LDTKWEVISVLQNDQPLGGNTY 883 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D ++ A + +GI Sbjct: 884 TAEALGFSDHMFTEARGSRLHKGVP-----QVLIVITDGE-SHDADKLNATAKALRDKGI 937 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 938 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 977 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 64/172 (37%), Gaps = 19/172 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ LASVI+ D N+ VR+GA F+ + G+ I + Sbjct: 1015 MKEFLASVIQDF---DISNNRVRIGAAQFSHTYQPEFPL--GMFIGKEEISFQIENIKQI 1069 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I A++ +H + +++LTDG Q +E + Sbjct: 1070 FGYTHIGAALRQVGHYFRPDMGSRIHAGTP-----QVLLVLTDG---QSQDEVAQAAEEL 1121 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +GI I ++ + + + N ++ EL K+ + RI Sbjct: 1122 RHKGIDIYSVGI-----GDVDDQQLVQITGTANKKLTVHNFDELKKV-KKRI 1167 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 50/163 (30%), Gaps = 19/163 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D +R+G +++ + S GV+K + V A Sbjct: 248 LEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVNK---SEVLQDIQSLAPQAGKAYT 304 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A I H + N + VL+T ++ + + QG+ + Sbjct: 305 GA---ALRKIRKEVFSAQHGSRKNQGVPQIAVLVT---HSPSQDNVTKAAVNLRRQGVIV 358 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 TI Q L AS A + F D Sbjct: 359 FTIGVEGASDTQ------LEKIAS----HPAEQYVSQLRSFSD 391 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 70/232 (30%), Gaps = 21/232 (9%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 R+ ++ N + + VS L G +D ++ + + S Sbjct: 387 RSFSDLAAHNQTFLKKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGSTQAT 446 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 ++ L+ V VR+GA + DR + S +K + K Sbjct: 447 DFQEMKTFLSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNK--HDVRKAIEN 501 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+ A+ + + + R+ + +V+LT+G + Sbjct: 502 IRQMGGNRNTGAALNFTLGLLQRAKQQRGGRVPCH------LVVLTNGASRDSVSGP--- 552 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKI 440 N+ + I + I +E + L A + + L I Sbjct: 553 ANRLSEELIHVYAIGV------REANQTQLREIAGEEKRVYYVHDFDALKDI 598 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 63/181 (34%), Gaps = 26/181 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S L + IK K +++ ++ I+ D V++G F+D + + Sbjct: 628 SIGLENFIKMKTFMKNLVSK--SQIR-----ADRVQIGVVQFSDVNKEEFQLNRYTS--- 677 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + +G T + + T S + + +K+++L+TDGE Sbjct: 678 QEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGARPNV------RKFLILITDGEAQ 731 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 ++ + +GI I ++ + Q L + P F + L Sbjct: 732 DIVKDP---AVALREEGIIIYSVGVFGSNVTQ------LEEISGRPEMVFYVENFDILKH 782 Query: 440 I 440 I Sbjct: 783 I 783 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D ++ D+ G+T Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDN--LDTKWEVISVLQNDQPLGGNTY 883 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D ++ A + +GI Sbjct: 884 TAEALGFSDHMFTEARGSRLHKGVP-----QVLIVITDGE-SHDADKLNATAKALRDKGI 937 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 938 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 977 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 64/172 (37%), Gaps = 19/172 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ LASVI+ D N+ VR+GA F+ + G+ I + Sbjct: 1015 MKEFLASVIQDF---DISNNRVRIGAAQFSHTYQPEFPL--GMFIGKEEISFQIENIKQI 1069 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I A++ +H + +++LTDG Q +E + Sbjct: 1070 FGYTHIGAALRQVGHYFRPDMGSRIHAGTP-----QVLLVLTDG---QSQDEVAQAAEEL 1121 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +GI I ++ + + + N ++ EL K+ + RI Sbjct: 1122 RHKGIDIYSVGI-----GDVDDQQLVQITGTANKKLTVHNFDELKKV-KKRI 1167 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 50/163 (30%), Gaps = 19/163 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D +R+G +++ + S GV+K + V A Sbjct: 248 LEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVNK---SEVLQDIQSLAPQAGKAYT 304 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A I H + N + VL+T ++ + + QG+ + Sbjct: 305 GA---ALRKIRKEVFSAQHGSRKNQGVPQIAVLVT---HSPSQDNVTKAAVNLRRQGVIV 358 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 TI Q L AS A + F D Sbjct: 359 FTIGVEGASDTQ------LEKIAS----HPAEQYVSQLRSFSD 391 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 70/232 (30%), Gaps = 21/232 (9%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 R+ ++ N + + VS L G +D ++ + + S Sbjct: 387 RSFSDLAAHNQTFLKKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGSTQAT 446 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 ++ L+ V VR+GA + DR + S +K + K Sbjct: 447 DFQEMKTFLSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNK--HDVRKAIEN 501 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+ A+ + + + R+ + +V+LT+G + Sbjct: 502 IRQMGGNRNTGAALNFTLGLLQRAKQQRGGRVPCH------LVVLTNGASRDSVSGP--- 552 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKI 440 N+ + I + I +E + L A + + L I Sbjct: 553 ANRLSEELIHVYAIGV------REANQTQLREIAGEEKRVYYVHDFDALKDI 598 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 63/181 (34%), Gaps = 26/181 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S L + IK K +++ ++ I+ D V++G F+D + + Sbjct: 628 SIGLENFIKMKTFMKNLVSK--SQIR-----ADRVQIGVVQFSDVNKEEFQLNRYTS--- 677 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + +G T + + T S + + +K+++L+TDGE Sbjct: 678 QEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGARPNV------RKFLILITDGEAQ 731 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 ++ + +GI I ++ + Q L + P F + L Sbjct: 732 DIVKDP---AVALREEGIIIYSVGVFGSNVTQ------LEEISGRPEMVFYVENFDILKH 782 Query: 440 I 440 I Sbjct: 783 I 783 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 59/153 (38%), Gaps = 21/153 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR + + V + ++ T I +TAI D + A Sbjct: 132 RVGLVVFGDRAYVAAAPTHDV-GAVAQVIGTLQIGV-SGKATAIADGLGLA--------- 180 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF------SVNK 409 + R++ + I+LL+DG++T + +A A+ G+R+ TIA S Sbjct: 181 --IRRLRGREAESRVIILLSDGQDTTGAVDPVAAAQAAQELGMRVYTIALGPADLASSPD 238 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A + F ST +L + Sbjct: 239 ARDAVDSETLRRIAEVAGGETFRVRSTEDLEAV 271 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 71/200 (35%), Gaps = 28/200 (14%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +D+S K + A++ +K D +G F + Sbjct: 69 TNTHKKGYNIVIDLDTSGSMAEFNKIDAAK-AVSLDFAKKRKNDA------LGLVVFGNI 121 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ K I+K + G TAI DA+ + + ++N Sbjct: 122 AYIASPLTFD-KKTFEDILKRIYVSI-AGGKTAIYDALFLSSNLFKNAN----------- 168 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-- 423 +K I+LLTDG + + K K + I++ +IA + L + Sbjct: 169 -GEKIIILLTDGMDNMSITPLDVVIKKLKKEHIKVYSIAIGGDAD-----LSVLKKISKE 222 Query: 424 SPNSFFEANSTHELNKIFRD 443 + F+ A+S +L KI+ D Sbjct: 223 TNGKFYIASSLEDLKKIYSD 242 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 17/155 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + ++ + + + + A+ TAI DA+ A + + Sbjct: 130 QVGLILFGTQPYLQAPLTTDLNTVGQFLDE--AMIGVAGTQTAIGDAIGLAIKRLRDAT- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + + + ++LLTDG N A + G+RI TI ++T Sbjct: 187 -----NVSGRKGETVLILLTDGSNDAGAMPPDEAAKMAAAAGLRIYTIGVGSDQTDPFGM 241 Query: 413 ----EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L A + +F A L +++ Sbjct: 242 GGANDLDEDTLKLIAKTTGGEYFRATDVENLQQVY 276 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A +V ++ +G T F + + + L++++ A Sbjct: 111 EAAKNVASEFIADRPNDN---IGLTIFAGEAFTQCPMTTDHVSLINLLQSVRTDIAARGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K ++LLTDG N + + Sbjct: 168 ISDGTAVGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDISPMTSAQI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANST 434 A+S GIR+ TI NK E L + A + +F+ A + Sbjct: 217 ARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATNN 276 Query: 435 HELNKIFRD 443 +L +I++D Sbjct: 277 RQLKQIYKD 285 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A +V ++ +G T F + + + L++++ A Sbjct: 111 EAAKNVASEFIADRPNDN---IGLTIFAGEAFTQCPMTTDHVSLINLLQSVRTDIAARGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K ++LLTDG N + + Sbjct: 168 ISDGTAVGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDISPMTSAQI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANST 434 A+S GIR+ TI NK E L + A + +F+ A + Sbjct: 217 ARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATNN 276 Query: 435 HELNKIFRD 443 +L +I++D Sbjct: 277 RQLKQIYKD 285 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 42/117 (35%), Gaps = 15/117 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ ++ +G T I A+ AY + + Sbjct: 135 RVGLVAFAGQAFLQCPLTFDYDAFRDALLAIDEQTI-PVGGTDIGRALDEAYRAME---K 190 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ H K +VL+TDGE+ + + GI +GI + TI Sbjct: 191 NDRH---------KILVLITDGEDLE--KAGIKTAQALAEKGIVVYTIGVGTAAGSP 236 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 72/197 (36%), Gaps = 24/197 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S + + + + + + V+ L + ++S + R+G F D Sbjct: 129 DSSATKSSTNDTGKGEKVNRLVAVKHVLNAFVKS-------REHDRLGLILFGDAPYLQA 181 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 F+ + + ++ STA DA+ A + + + Sbjct: 182 PFTDDIATWQALLNESDI--GMAGQSTAFGDAIGLAIS-----------VFQQSDTQNRV 228 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPN 426 +++LTDG +T + A ++ I+I TIA +++ L A + Sbjct: 229 LIVLTDGNDTASKVPPVEAAKVAAARDIKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQG 288 Query: 427 SFFEANSTHELNKIFRD 443 F+A ++ EL K++ + Sbjct: 289 KSFQALNSEELLKVYAE 305 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 64/157 (40%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 899 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 54/154 (35%), Gaps = 18/154 (11%) Query: 297 MGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G T F + + + L+ + + TAI + A + +S Sbjct: 129 IGLTIFAGEAFTQCPLTIDHATLINLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLKAS 188 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-- 411 N K ++LLTDG N + + AK IR+ TI ++ Sbjct: 189 NAKS-----------KIVILLTDGSNNVGSISPMTAATIAKKFNIRVYTIGLGTEQSGNY 237 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + F+ A S EL +I+ D Sbjct: 238 NDIDYTTLKQIALTTNGEFYRAQSQTELLQIYND 271 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 38/317 (11%), Positives = 90/317 (28%), Gaps = 51/317 (16%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 T E+GVS + + S +++ + ++ Y+ + + Sbjct: 303 TTQPMQSTENGVSYRVRLQGSNCIVERREYNDYVQTFDEVTVVPTLANLYNYLPVAMDVS 362 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS---------LRHVIKKKHLVRD 276 + + + + +D S + + +++ ++ Sbjct: 363 NWRAEALGCMEERKSTVLTDFSSVDLSANLDLDINTVPVASDQDTQWRPRYPDMIYVRSK 422 Query: 277 ALAS--VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 D + ++ G N P+ + + + + ++ Sbjct: 423 EADDKGSFSPAPVYDTKKEFIQTG----NWWFSGCPAPAQKLKAMTSGELDSYLDSLTPH 478 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRM-KNNLEAKKYIVLLTDGEN-------------- 379 G+T + M + K ++++ LTDG+ Sbjct: 479 GATYHDGGMIWGGRLLSQYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDP 538 Query: 380 ------TQDNEEGIA---------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 TQ + +A CN+ K G + +AF + + CA Sbjct: 539 IDERRWTQTSSLTLAQTVEERFLFACNEVKKLGATVWVVAFGTAANDK------MKTCAG 592 Query: 425 PNSFFEANSTHELNKIF 441 +FEA + +LN F Sbjct: 593 SGRYFEAANASQLNDAF 609 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 71/197 (36%), Gaps = 35/197 (17%) Query: 12 KKLIKSCTGH-FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L +G+ +I A L+P +L + G VD+ R E L+QA + ++ A L Sbjct: 2 RRLASDRSGNTLALIAAGLLP-LLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGT 60 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + ++ ++ T + E+ NF++ + + Sbjct: 61 EIATLTDIPTDAGTRGQ----EFFNSNFQDG-------------------NYGTQNRTFN 97 Query: 131 VVLSSRYDL----LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +VL + Y + ++ + + G ++ + +A V + V+D + Sbjct: 98 MVLENDYSVSGTATVDVPTSVMTVFGFTKIPVKVECQARISFSD------VDVMMVLDVT 151 Query: 187 RSMLDYQRDSEGQPLNC 203 SM ++ Sbjct: 152 GSMKHTNSGDTLSKIDS 168 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 64/157 (40%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 899 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 63/185 (34%), Gaps = 23/185 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV---- 324 K +DAL +++ ++ D R F++R+ W H + T Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKV-----WKDHLISVTPDSIRD 52 Query: 325 -KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K + + G T IN A+Q A + N+ H + IV LTDG+ T Sbjct: 53 GKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGE 108 Query: 384 EEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + I N + + I TI + + + L NC E Sbjct: 109 THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIG 168 Query: 441 FRDRI 445 F D I Sbjct: 169 FYDEI 173 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 69/195 (35%), Gaps = 29/195 (14%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P+L+ +D S + LV++A + + +D ++ +G F++ S Sbjct: 524 EQPTLALILVIDKSGSMSSGDRLDLVKEAAR---ATARTLDPSDE---IGVIAFDN---S 574 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 R + + + G T A++ AY + S H Sbjct: 575 PQVLVRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALVKH--------- 625 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PN 426 ++LL+DGE+ ++ A+ + I + ++ + FL A Sbjct: 626 --VILLSDGESPENGIN--ALLGDMRQSDITVSSVGV-----GDGAGKDFLIRVAERGRG 676 Query: 427 SFFEANSTHELNKIF 441 +F + ++ +IF Sbjct: 677 RYFYSEDGTDVPRIF 691 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 75/215 (34%), Gaps = 27/215 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + S + +D L+ + + + + + ++ + I ++ Sbjct: 76 SYSAWDNKDSEGIDIMLAMDISASMLTNDVIPNRLEVAKEVASDFI----SGRPNDN--- 128 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTIIS 352 +G T F + + LI ++ + D G TAI + A + Sbjct: 129 IGLTIFAGEAFTQCPLTTDHASLIN-LLNSVRTDLVVKGLIQDGTAIGMGLINAVGRL-- 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKT 410 K++ K ++LLTDG N + + AK IR+ TI N Sbjct: 186 ---------KSSKAKSKVVILLTDGSNNVGSISPMTAAEIAKKFNIRVYTIGLGTEQNNG 236 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + F+ A S EL++I++D Sbjct: 237 YSDIDYTTLRQIANVTNGKFYSAQSQTELSQIYKD 271 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 79/240 (32%), Gaps = 36/240 (15%) Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 +T G+ +E V+ S P + +DSS Sbjct: 40 PAVSKTASPALINIAGSGVNEETTVTIEVTGAGSTSTSAVP------MDVVFAIDSSGSM 93 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 L + A S + + + G ++D + + +VK Sbjct: 94 QSNDPSGLRKTAAKSFVDKMDSSRDTA-----GVVSWDDSIDFSLPLTNDFP-----LVK 143 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T + GST +N ++ A D + ++ R +N++E I+ LTDG+ T Sbjct: 144 TNIDSVDSSGSTNLNVGLEEAIDILDANP-----RTENSVE---VIIFLTDGQGTY---- 191 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 + +A +G I +I L + A+ +++ + L IF D Sbjct: 192 LHSTAQEAADKGYVIYSIGLGGVNPTP------LQDMATTTGGAYYSSPDATSLQAIFDD 245 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 63/166 (37%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D G ++ D+ G+T Sbjct: 829 DFMIGLVKKADVGKNRVRFGALKYADDPEVLFYL--GDFDTKLEVISVLQNDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 ++A+ + + + N + ++++TDGE + D ++ A + +GI Sbjct: 887 TSEALGFSDHMFTEAQGSRL-----NKGVPQVLIVITDGE-SHDADKLNATAKALRDKGI 940 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 941 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 980 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 47/126 (37%), Gaps = 10/126 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ D + VR+GA F+ + G + I + G+T I Sbjct: 1022 LVSVVQDFDVSLNRVRIGAAQFSHNYRQEFPL--GTFIGEKEISFQIENIQQLGGNTHIG 1079 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA++ D R+ + +++LTDG Q +E + +GI I Sbjct: 1080 DALRQVGHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALRHRGIDI 1131 Query: 401 MTIAFS 406 ++ Sbjct: 1132 YSVGIG 1137 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 64/157 (40%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 899 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 811 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 867 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ GST +A+ + + + N + ++++T Sbjct: 868 TK---LEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 919 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 920 DGE-SHDADKLNATAKALRDKGILVLAVGIDGANPME------LLAMAGSSDKYFFVETF 972 Query: 435 HELNKIFRD 443 L IF D Sbjct: 973 GGLKGIFSD 981 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1017 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 1074 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1075 GNTHIGAALREVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1126 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1127 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1167 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 68/230 (29%), Gaps = 27/230 (11%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SQN I+D + + P D + ++ S K Sbjct: 193 SMFSQNMTHIIKDVTKYKEGAVDDIFVEACQGPSMADVVFLLDMSINGSEENFDYLK--- 249 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S+ +D + +R+G +++ S S G++K + V + + Sbjct: 250 -----GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINK---SEVLQHIQNLSP 301 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 A A A + + + N + VL+T + + Sbjct: 302 RTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT---HRDSEDNVTKAAVNL 355 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + +G+ I T+ Q L AS A K F D Sbjct: 356 RREGVTIFTLGIEGASDTQ------LEKIAS----HPAEQYVSKLKTFAD 395 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 729 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 783 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 784 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 837 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 838 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 876 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 117 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 118 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 164 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 165 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 211 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 1043 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 1099 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ GST +A+ + + + N + ++++T Sbjct: 1100 TK---LEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 1151 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 1152 DGE-SHDADKLNATAKALRDKGILVLAVGIDGANPME------LLAMAGSSDKYFFVETF 1204 Query: 435 HELNKIFRD 443 L IF D Sbjct: 1205 GGLKGIFSD 1213 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1249 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 1306 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1307 GNTHIGAALREVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1358 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1359 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1399 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 68/230 (29%), Gaps = 27/230 (11%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SQN I+D + + P D + ++ S K Sbjct: 413 SMFSQNMTHIIKDVTKYKEGAVDDIFVEACQGPSMADVVFLLDMSINGSEENFDYLK--- 469 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S+ +D + +R+G +++ S S G++K + V + + Sbjct: 470 -----GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINK---SEVLQHIQNLSP 521 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 A A A + + + N + VL+T + + Sbjct: 522 RTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT---HRDSEDNVTKAAVNL 575 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + +G+ I T+ Q L AS A K F D Sbjct: 576 RREGVTIFTLGIEGASDTQ------LEKIAS----HPAEQYVSKLKTFAD 615 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 429 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 483 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 484 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 537 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 538 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 576 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 104 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 105 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 151 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 152 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 198 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 511 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 565 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 566 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 619 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 620 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 658 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 64/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+GA ++ +V + S ++++ I + + + M A Sbjct: 45 MIDIIHELDIGLAATRIGAVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 104 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 105 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 151 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 152 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 198 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 729 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 783 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 784 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 837 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 838 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 876 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 117 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 118 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 164 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 165 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 211 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 606 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 660 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 661 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 714 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 715 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 753 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 117 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 118 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 164 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 165 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 211 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 72/207 (34%), Gaps = 27/207 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P+L +D+S K V++ + ++ + + D R+ FN Sbjct: 149 STSSRPNLDLVCIIDNSESMSGCSKIENVKNTILQLLEMLNEND------RLSLITFNSY 202 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V L + ++ G T I ++ A+ + S + KN++ Sbjct: 203 AKQLCGLK-KVSNLNKETLQAITNSIKAYGGTNITSGLEIAFQILQSRKK------KNSV 255 Query: 366 EAKKYIVLLTDGENTQDNEEGIA----ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + I LL+DG++ + + + + + I + +F + + Sbjct: 256 SS---IFLLSDGQDDGADTKIKNLLKITYQQLQEESFTIHSFSFGSDHDCP-----LMQK 307 Query: 422 CAS--PNSFFEANSTHELNKIFRDRIG 446 A SF+ ++++ F D +G Sbjct: 308 IAQIKDGSFYFVEKNDQVDEFFIDALG 334 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 64/157 (40%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 899 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 76/265 (28%), Gaps = 27/265 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + F +M S +P G+ + R + + P + Sbjct: 33 LRLPFFDAMATLTGKSPTRPGVQRGRAQLWVNVAVWLLLALALARPQWVEPPLTHVEPMR 92 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +L +D S S + + + V+ + I K + R+G Sbjct: 93 DILL---AVDISQSMDSEDFRDAQGRPASRWQAVQAVVGDF---IDKRPDD----RLGLI 142 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + L + +T TA+ DA+ + + E Sbjct: 143 VFGAGAYPQAPLTRDHAALRLLLQRTAVGMAGPN--TALGDAIGLGIRMLDHARER---- 196 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-- 418 K ++LLTDG +T A + + TI E F Sbjct: 197 -------DKILILLTDGNDTASAVPPARAAELAAQHRVVVHTIGIGDPAASGEDRVDFDA 249 Query: 419 LSNCA--SPNSFFEANSTHELNKIF 441 L + A + FF A L +++ Sbjct: 250 LRDIARIAGGRFFRARDQASLQEVY 274 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 64/157 (40%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 786 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 843 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 844 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 897 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 898 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 929 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 65/176 (36%), Gaps = 40/176 (22%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + RT++ D G+T I + + I ++ E K +V+LT Sbjct: 349 LTSTERTVLNAIG-DMGASGTTNIPNGVGWGIRLISPGAPFTEGSAWDDDEYIKAMVILT 407 Query: 376 DGEN-----------------------------------TQDNEEGIAICNKAKSQGIRI 400 DG+N + ++ A C A+S GIR+ Sbjct: 408 DGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSNVLSNELDDRTEAACAYARSLGIRV 467 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 TI F VN + R + NCAS P+ +F++ S+ L F G+ R+ R Sbjct: 468 YTITFQVNSSS---TRSLMQNCASNPSLYFDSPSSEALEDAFEMIAGDLTNLRLSR 520 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEE 280 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 38/85 (44%), Gaps = 4/85 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S++L + G I+ AL++PV+LG + VD+ R + L+ AA A + + L Sbjct: 8 SRRLHRQRGGT-AILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHSLSD 66 Query: 71 SLEEV---SSRAKNSFTFPKQKIEE 92 + + S+ + + + Sbjct: 67 AGGQPYNWSAAVNAAQNVVQSNVAN 91 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 77/256 (30%), Gaps = 21/256 (8%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 + T L+ V++ + G+ +D R L A + A+ L + Sbjct: 18 MFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAARLLGSGSNAAQQLSMAKA 77 Query: 84 TFPKQKIEEYLIRNFENNL--KKNFTDREVRDIVRDTAVEMNPRKSAYQV-VLSSRYDLL 140 T + + + F +L N D V S Y+V ++ + Sbjct: 78 TAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGSGISTYKVRTVAVKATAT 137 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + L F+R +G+ + + A V + V+D S SM Y P Sbjct: 138 MPTL--FMRIIGMPTVTVSGSGTAS--------RRDVRVILVMDRSSSMGTYYSGINQTP 187 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY-------MVSCNKSLYYMLYPGPLDPSL 253 A + V S+S G + L Y + + Y Y P Sbjct: 188 -PSINDMALKFVNSFSGAGEFGGRDEVGLVVYGGSGIVAYPPRDITKDYTDYTKFTPPDN 246 Query: 254 SEEHFVDSSSLRHVIK 269 + + + I Sbjct: 247 NFKASGNIPKYIADIT 262 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 73/187 (39%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +++++ +I+ +KK D D VR GA ++ F Sbjct: 817 VFVIDSSGSIDYNEYNIMKE---FMIKLVKKADVAKDRVRFGALKYS--YDPTILFYLDE 871 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D + G T A+ + + + N + + ++++TD Sbjct: 872 FDTRSKVISLLQNDSPKGGDTYTAKALAFSEHMFTEARGSRI-----NQKVPQVLIVITD 926 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D + A + +GI I+ + + T+ L+ S + +F + Sbjct: 927 GE-SHDANQLNATAKALRDKGILILAVGIAGANTE-----ELLAMAGSTDKYFFVETFGG 980 Query: 437 LNKIFRD 443 L IF++ Sbjct: 981 LKGIFQN 987 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 60/184 (32%), Gaps = 24/184 (13%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + ++ + D V +G F+ ++ G + + + Sbjct: 1020 ENFKKMKDFLVSVVDDFDIGPSRVHIGLAQFSHVYRAEFFL--GSFTSEGEVSTQIEMTQ 1077 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T I A++ + N+ ++ +++LTDG Q +E Sbjct: 1078 QVFGNTHIGAALKQVEQYFRPEMGSRI-----NVGIQQVLLVLTDG---QSQDEVAKAAE 1129 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + +GI I ++ + + + + ++ EL +I + Sbjct: 1130 DLRRKGIDIYSLGI-----GDVDEQQLIQISGTSDKKLTIDNFDEL---------KKIKK 1175 Query: 452 RVIR 455 R++R Sbjct: 1176 RIVR 1179 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 52/151 (34%), Gaps = 23/151 (15%) Query: 293 DTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V++G F+D + + W H++ I + ID T A++ Sbjct: 663 DQVQVGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRE----TLTGSALKFV---- 714 Query: 351 ISSNEDEVHRMKNNLE-AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 D H K +K+++L+TDGE+ ++ + G+ I ++ Sbjct: 715 ----SDYFHPSKGARPGVRKFLILITDGESQDPVKDP---AMALRQDGVIIYSVGVYGAN 767 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKI 440 Q + P F + L I Sbjct: 768 ETQ-----LVEISGKPEMIFYVETFDILKHI 793 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 61/172 (35%), Gaps = 18/172 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +DN+N R+ F+D + L + + E G+T + +A+ Sbjct: 140 LNLMDNMNTQNRVSIYKFDDTSKRIIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAID 199 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIRIMTI 403 +AY+ I S+ ++LL+DGE+ N++ K I I TI Sbjct: 200 SAYNEINSTKRPGRKAA---------VILLSDGEDNFGLNKKFDETLKPFKDSNISIYTI 250 Query: 404 AFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 E L A ++ + +L F +I +R+ Sbjct: 251 G-----MSNENNFTTLKKIAKDTHGEYYNVKNASDLKGTF-SKIYYATQQRL 296 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 65/157 (41%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D + V+ GA ++D+ + I++ + + G+T A++ Sbjct: 841 VKKADVGRNRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDG+ + D+++ ++ + +GI I + Sbjct: 899 HA-NALF----TEKHGSRIKQNVKQVLIVITDGK-SHDHDQLNDTASELRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + N+ ++ +L +F Sbjct: 953 V-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 52/157 (33%), Gaps = 21/157 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + KI D ++G F+D+ + S + I T A+ Sbjct: 655 LTKIQVGADKTQIGVVQFSDKTKEEFQLSRYF--TQQEISDAIDRMSLINEGTLTGKALN 712 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 H L AKK+++L+TDG D + I + + + I ++ Sbjct: 713 FVSQYFT-------HSKGARLGAKKFLILITDGVAQDDVRDPARI---LRGKDVTIFSVG 762 Query: 405 -FSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNK 439 ++ N++Q L + S F + L Sbjct: 763 VYNANRSQ-------LEEISGDGSLVFHVENFDHLKA 792 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 67/193 (34%), Gaps = 35/193 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + +++D ++ I +K D ++G F D + + + + + Sbjct: 105 QTVDRFTMIQDVVSDFIER-RKGD------KLGLILFADHAYLQAPLTQDRRSVAQFLQE 157 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI +A+ ++ + +VLLTDG N + Sbjct: 158 --AQIGLVGKQTAIGEAIALGVKRFDMVDKSN-----------RILVLLTDGSNNSGSIS 204 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ---------------EKARYFLSNCASPNSFFE 430 A +G++I I + ++ ++A+ + +F Sbjct: 205 PEQAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSMDLDEAQLISLAKTTGGLYFR 264 Query: 431 ANSTHELNKIFRD 443 A S+ +L I+++ Sbjct: 265 ARSSQDLQLIYQE 277 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 55/150 (36%), Gaps = 18/150 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G FN S ++ + + G T ++ M+ A + + + Sbjct: 98 RGGIISFNTEADMLQEMSDN----RYDLLDALSALPDPSGGTDLSQGMRAANEQFVQTKG 153 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K+ +VL+TDG +T + E +A+ GI I T+ T ++A Sbjct: 154 AN----------KQIMVLITDGADTINLAEVYNQVREARMNGITIFTLGLGSLATGLDEA 203 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L + A + + + + + + +D Sbjct: 204 --LLQDIADQTRGQYRQVPNATVIESVLQD 231 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 62/461 (13%), Positives = 144/461 (31%), Gaps = 75/461 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + IK +GH ++ A+++P + G+ + D R + ++ A++ A++ + Sbjct: 2 RKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAH--NDD 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-----PRKS 127 + S A + +Q +YL ++ + + + + PR Sbjct: 60 NQDSQGAGSGSRVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAECRAGLARGEPRFF 119 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y++ +SS D P + + G T + ++ V I +V D+S Sbjct: 120 QYEIEVSSVQDTWF-PGNDSIEGFG------DTFSAKGAAVARKYQSEAVDIIFVSDYSG 172 Query: 188 SMLDYQRDSEGQP-------LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 SM + + + ++ N VG+ V N+S Sbjct: 173 SMAWNWSGGRNRKYIDLRNIIQEVTDELQKFNDLNNTDNNTVGLTAFNYYTKTVPSNRSN 232 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + ++P+ ++ ++ V +S + + DN + Sbjct: 233 HCFM-TQLVNPNGRFSASQTVRNIFVEKNNRYCVNHGDSSRFQDLPLTDNYS-------- 283 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 FN+ V S TA + Sbjct: 284 SFNNSVRSFYP----------------------NHGTASFQGIIRG-----------AQM 310 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEE----GIAICNKAKS--------QGIRIMTIAFSVN 408 ++ ++ +++L+DGE+ + +C+ + G ++ V Sbjct: 311 LRKGRNPRRLLIVLSDGEDGDPSRHMQLVNAGMCSTIVNTLSGDLTPDGHKVKARLAVVG 370 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L C + ++A + ++ + I EI Sbjct: 371 FDYDVNKNRALQKCVGAENVYKAQNRDDILNKILELITEEI 411 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 62/186 (33%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ G T F + + H ++ ++K E G Sbjct: 111 EAAKDVAAEFINGRPNDNV---GITLFAGESFTQCPLTVD-HAVLLNLIKDVKCGLIEDG 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 M A + V R+K++ K I+LLTDG N + + AKS Sbjct: 167 ---TAVGMGIA---------NAVTRLKDSKAKSKVIILLTDGTNNRGEISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 59/168 (35%), Gaps = 31/168 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVK 325 K L ++A A + ++ D +G F+DR +I + + TI+ Sbjct: 382 SKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPL-NNKEEAVDTILS 434 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I ++ AY+ + H I+LLTDG++ N + Sbjct: 435 -----VTPGGGTEIYGSLAKAYENLADMKLQRKH-----------IILLTDGQSQPGNYD 478 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + K GI + T+A + LS S F+ Sbjct: 479 DLI--EQGKDNGITLSTVAIGQDAD--ANLLEALSEMGS-GRFYNVID 521 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 54/168 (32%), Gaps = 32/168 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + +++ ++ + TAI + T+ + + Sbjct: 132 RIGLVVYAGESYTLTPITSD-KGIVKGSLREISYQGLIEDGTAIGMGLATSVNRL----- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K++ K I+LLTDG N + A GI+ TI N T + Sbjct: 186 ------KDSRAKSKVIILLTDGVNNSGFIDPKIATELAVEFGIKTYTIGLGSNGTARAPV 239 Query: 416 ------------------RYFLSNCASP--NSFFEANSTHELNKIFRD 443 L A+ +F A +L +I+ + Sbjct: 240 GILPNGSFQYAMTKVEIDEALLQEIATATGGIYFRATDNKKLEEIYEE 287 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 69/207 (33%), Gaps = 17/207 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM ++G GM VD L AA A + + I Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGS----IAEKS 63 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + A + + RN + D+ ++ K+A ++ Sbjct: 64 SAVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDL------GIDVTKTANKLNS 117 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + F++ G S I A AE Y + ++D + SM Sbjct: 118 QVSFTATVP--TTFMQIFGRDSITISGTATAE-----YQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNG 220 S+ L A + S N Sbjct: 171 TPSDVSKLEAKTGCAFACHQMDKSTNN 197 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ RV ++ S ++ I K E Sbjct: 470 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKAEMNVE 524 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 525 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 578 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 579 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 617 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 104 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 105 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 151 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 152 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 198 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 76/265 (28%), Gaps = 27/265 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + F +M S +P G+ + R + + P + Sbjct: 33 LRLPFFDAMATLTGKSPTRPGVQRGRAQLWLNVAVWLLLALALARPQWVEPPLTHVEPMR 92 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +L +D S S + + + V+ + I K + R+G Sbjct: 93 DILLV---VDISQSMDSEDFRDAQGRPASRWQAVQAVVGDF---IDKRPDD----RLGLI 142 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + L + +T TA+ DA+ + + E Sbjct: 143 VFGAGAYPQAPLTRDHAALRLLLQRTAVGMAGPN--TALGDAIGLGIRMLDHAGER---- 196 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-- 418 K ++LLTDG +T A + + TI E F Sbjct: 197 -------DKILILLTDGNDTASAVPPARAAELAAQHRVVVHTIGIGDPAASGEDRVDFDA 249 Query: 419 LSNCA--SPNSFFEANSTHELNKIF 441 L + A + FF A L +++ Sbjct: 250 LRDIARIAGGRFFRARDQASLQEVY 274 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 63/205 (30%), Gaps = 39/205 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +V+D +A I N + +G F R + + L + Sbjct: 113 RLEVVKDVMAKFISQ---RPNDS----IGLVAFAGRPYVVCPPTLDHNWLTLRLHSLSIG 165 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A+ T + + + I+LLTDG N + Sbjct: 166 MIED--GTAIGSAIGTGVNRLREKKS-----------PSQIIILLTDGINNAGKVPPLIA 212 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA--SPNSFFE 430 AKS +++ TI + LS A + +F Sbjct: 213 AEAAKSFKVKVYTIGAGTRGEAPIPITDAFGRRQLVRARVDIDDKTLSKVAQITGARYFR 272 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A T L K++ + E R ++ Sbjct: 273 ATDTESLEKVYAEINSMETTSRSMK 297 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 56/154 (36%), Gaps = 22/154 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F FS+ V + R I A +T+I+D + A Sbjct: 136 RVALVVFGSEAYFAAPFSFDVEAIARQI--EGAQIGISGRATSISDGLGLA--------- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VNK 409 + RM+N+ A + ++LL+DG N + A G+R+ TIA + Sbjct: 185 --LKRMENSEAASRVVILLSDGVNNAGATNPRGVAELAAQMGVRVHTIALGPKDLSTADP 242 Query: 410 TQQEKAR-YFLSNCA--SPNSFFEANSTHELNKI 440 ++ L + S F +T +L + Sbjct: 243 GERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 49/205 (23%), Positives = 79/205 (38%), Gaps = 17/205 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ LA + +K D R+G F D Sbjct: 108 DLSGSMAEQDFTSKRGDKISRLDATKEVLAD-FATTRKGD------RLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA---YDTIISSNEDEVHRMKNNLE 366 F+ + + +T ST + DA+ A ++ + + E Sbjct: 161 TPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSEKQSAAVQDSSVDANE 218 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA- 423 +K +++LTDG +T E I AK++G+RI IA +T E A + A Sbjct: 219 KEKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQ 278 Query: 424 -SPNSFFEANSTHELNKIFRDRIGN 447 S FEA + EL K + +IG Sbjct: 279 ESGGEAFEALNRDELTKAY-AQIGE 302 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 80/232 (34%), Gaps = 22/232 (9%) Query: 219 NGKVGIRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +GKV + S + S + P P +D SS K A Sbjct: 102 DGKVRVSFIVTSNLRFEISGSAAISAVRPAPGGRPTQVVIDIDGSSSMRRSDPKRERVRA 161 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + ++ ++D N G FN V G+ I + +GST Sbjct: 162 AKRFVETLSRVDKRNQF---GVIEFNTTVEERAPMGSGMKATSDAIQ-----AVDAVGST 213 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+ ++ + D + S + ++ I++LTDG++T + + N+AK+ Sbjct: 214 ALYTSLIRSIDALEGSGKTG---------YRRAILVLTDGKDTASSHGVATVINRAKAAK 264 Query: 398 IRIMTIAFSVNKTQQEKAR-YFLSNCASP--NSFFEANSTHELNKIFRDRIG 446 +RI ++ Q+ + + F + +L F D I Sbjct: 265 VRIYVVSLGGAGDQKGLGYVGPMQRLTTETGGVFTHVDRADDLVARF-DAIA 315 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 53/148 (35%), Gaps = 24/148 (16%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + K+ D+ A F V + + + + + G T I Sbjct: 56 VDAAKEFDDRTKDSYFSAVGFASGVKLIQAPTQSLSTFNTAVNTVSPL----NGGTNIFR 111 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 ++ Y + K + ++L+TDG Q I CN KS+GI ++ Sbjct: 112 GLRGCYQQL-----------KTKPMTDRVLILVTDGFGGQP----INYCNFIKSKGILLV 156 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFF 429 T+ + + FL NCA+ F+ Sbjct: 157 TVGIGTSIN-----QNFLKNCATSEEFY 179 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 54/183 (29%), Gaps = 34/183 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A ++ VR+G F + + ++ I + Sbjct: 107 TRLEAAKAAARGFVQRQPAA------VRIGIVAFGATGLVTQQPTSDRASVVAAIDR--- 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSN----------EDEVHRMKNNLEAKKYIVLLTDGE 378 + G TA+ +QTA I+ E +VLLTDGE Sbjct: 158 --LSPQGGTALGGGLQTALGAIVGKPVVVPGSDPGGGPEPSGPDLGYHGSAAVVLLTDGE 215 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----------QQEKARYFLSNCA--SPN 426 NT + + + + A + G+++ I L A + Sbjct: 216 NTAQP-DPLQVADIASTAGVKVYPIGLGSPAGTVLQIDGFQIATRLDEPLLQQIADRTDG 274 Query: 427 SFF 429 +F Sbjct: 275 RYF 277 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 76/245 (31%), Gaps = 17/245 (6%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + +L + + G+ C Sbjct: 157 ALKREGILLYAIGVKDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 + + P+ +E D L L + + + +D +D V++G Sbjct: 217 VVQHTEQGSPACTEAFLADIVFLVDSSTSIGLQNFQKVKNFLHSIVSGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L +++ MGST A++ + E Sbjct: 277 VQYSDNIYPAFQLK--QSSLKSVVLEQIRNLPYNMGSTNTGSALE--FIRANYLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R K+ + + ++L+TDGE+ + ++ ++ K G+ + + ++ Q+ L Sbjct: 333 RAKDGVP--QIVILVTDGESNDEVQD---AADQLKRDGVFVYVVGINIQDVQE------L 381 Query: 420 SNCAS 424 AS Sbjct: 382 QKIAS 386 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 56/398 (14%), Positives = 127/398 (31%), Gaps = 41/398 (10%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 + LK + T + ++ SL++++ RA+ T + ++L + K + + Sbjct: 681 KPQLKFSLNT--FQNAAQVLTSLDQLTFRARRGRTKAGAAL-DFLRKEVFLPEKGSRSIW 737 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 V+ I AV + S V + + L + + ++GI+S E Sbjct: 738 GVQQI----AVIIMESPSLDNVSTPASH---LRRTGVTIYAVGIQSASESKDLEKIATYP 790 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + +D R +L+ + E + G + L Sbjct: 791 PGKHTIHLESFLQLDVVRKILNKKLCPETLGKMPPVGSMPPGAEMPPGAEMPPGGKMPPL 850 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + ++ +F+ S K +++ + VI+ Sbjct: 851 TGCTPQ-------SMQKVCTHVEKADIYFLIDGSGSISPKDFTEMKEFMKDVIKMFHIGP 903 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR G ++D++IS + T ++ + A Sbjct: 904 DG---VRFGVVQYSDKIISQFLLT------QYTSMEKLGTAIGNIQQGGGGTTTGEALSK 954 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + +Y++++TDG+++ + + GI I I Sbjct: 955 MALVFRNTAR-----TNVAQYLIVITDGQSSDPVAD---AAQGLRDTGINIYAIGVRDAN 1006 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 T + L A+ FF + H L I ++ + + Sbjct: 1007 TTE------LEEIANNRVFF-TDDFHFLKSIHQEVVRD 1037 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 76/245 (31%), Gaps = 17/245 (6%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + +L + + G+ C Sbjct: 157 ALKREGILLYAIGVKDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGA 299 + + P+ +E D L L + + + +D +D V++G Sbjct: 217 VVQHTEQGSPACTEAFLADIVFLVDSSTSIGLQNFQKVKNFLHSIVSGLDVRSDQVQVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D + L +++ MGST A++ + E Sbjct: 277 VQYSDNIYPAFQLK--QSSLKSVVLEQIRNLPYNMGSTNTGSALE--FIRANYLTEMSGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R K+ + + ++L+TDGE+ + ++ ++ K G+ + + ++ Q+ L Sbjct: 333 RAKDGVP--QIVILVTDGESNDEVQD---AADQLKRDGVFVYVVGINIQDVQE------L 381 Query: 420 SNCAS 424 AS Sbjct: 382 QKIAS 386 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 56/398 (14%), Positives = 127/398 (31%), Gaps = 41/398 (10%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 + LK + T + ++ SL++++ RA+ T + ++L + K + + Sbjct: 478 KPQLKFSLNT--FQNAAQVLTSLDQLTFRARRGRTKAGAAL-DFLRKEVFLPEKGSRSIW 534 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 V+ I AV + S V + + L + + ++GI+S E Sbjct: 535 GVQQI----AVIIMESPSLDNVSTPASH---LRRTGVTIYAVGIQSASESKDLEKIATYP 587 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + +D R +L+ + E + G + L Sbjct: 588 PGKHTIHLESFLQLDVVRKILNKKLCPETLGKMPPVGSMPPGAEMPPGAEMPPGGKMPPL 647 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + ++ +F+ S K +++ + VI+ Sbjct: 648 TGCTPQ-------SMQKVCTHVEKADIYFLIDGSGSISPKDFTEMKEFMKDVIKMFHIGP 700 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR G ++D++IS + T ++ + A Sbjct: 701 DG---VRFGVVQYSDKIISQFLLT------QYTSMEKLGTAIGNIQQGGGGTTTGEALSK 751 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + +Y++++TDG+++ + + GI I I Sbjct: 752 MALVFRNTAR-----TNVAQYLIVITDGQSSDPVAD---AAQGLRDTGINIYAIGVRDAN 803 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 T + L A+ FF + H L I ++ + + Sbjct: 804 TTE------LEEIANNRVFF-TDDFHFLKSIHQEVVRD 834 >gi|291452634|ref|ZP_06592024.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] gi|291355583|gb|EFE82485.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] Length = 658 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI---SDPSFSWGVHKLIRTIVK 325 + RDA+ +V+ S+ G + +D + V L R +K Sbjct: 71 TRIEAARDAVGTVVDSLPDGYP------TGLRVYGADRTSGCTDTRLARPVEPLDRDAMK 124 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I +++ A + + ++ ++L++DGE+ + Sbjct: 125 KAVAGVEPKGDTPIGLSLRKAVADLPEP--------EPGAVGRRTVLLISDGEDNCGSPP 176 Query: 386 GIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + G +RI I F V +E+ A ++++A L + + Sbjct: 177 PCEAAEELAESGLDLRIDAIGFQVEGKAREELTCVAE--AGHGAYYDAPDAEALARQLQR 234 Query: 444 R 444 Sbjct: 235 A 235 >gi|239980776|ref|ZP_04703300.1| hypothetical protein SalbJ_15127 [Streptomyces albus J1074] Length = 628 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI---SDPSFSWGVHKLIRTIVK 325 + RDA+ +V+ S+ G + +D + V L R +K Sbjct: 41 TRIEAARDAVGTVVDSLPDGYP------TGLRVYGADRTSGCTDTRLARPVEPLDRDAMK 94 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I +++ A + + ++ ++L++DGE+ + Sbjct: 95 KAVAGVEPKGDTPIGLSLRKAVADLPEP--------EPGAVGRRTVLLISDGEDNCGSPP 146 Query: 386 GIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + G +RI I F V +E+ A ++++A L + + Sbjct: 147 PCEAAEELAESGLDLRIDAIGFQVEGKAREELTCVAE--AGHGAYYDAPDAEALARQLQR 204 Query: 444 R 444 Sbjct: 205 A 205 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 65/410 (15%), Positives = 124/410 (30%), Gaps = 101/410 (24%) Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 ++V + + V + + ++ + FL G + + ++AE V Sbjct: 99 SSVTIENGLNFRTVTVKANNEMK----TQFLGRFGFPTLDVPALSKAEERVEK------V 148 Query: 178 SIQWVIDFSRSM-LDYQRDSEGQPLNCFGQP-ADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 I V+D S SM + + + F K+ S + + P + + Sbjct: 149 EISLVLDVSGSMKNNSKLTTMKDAAKTFIDTVLRPETKNNVSLSLIPYSEQVNVGPDIFN 208 Query: 236 C-------------------------------NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 +++ ++ ++ + D+ Sbjct: 209 ALWVDTRHDFSYCIDVPDGHFVQTQMTPGFPWDQTQHFQWNTYSIESGYQQNTLHDTVCP 268 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH------- 317 R V ++ + S+ I R G + WG Sbjct: 269 RAVYERVRPISQDGPSLKAQIDLFQP-----RAGTAIY-------MGMKWGTALLDPSFR 316 Query: 318 KLIRTIVKTFAIDEN--------------------EMGSTAINDAMQTAYDTIISSNEDE 357 + ++V ++ G + + + TAY +S+ + Sbjct: 317 ETTASLVSDSVVESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTAY---YNSSSEV 373 Query: 358 VHRMKNNLEA--KKYI-------VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 VH K N +YI T + N IC+ AK +GI I TI F VN Sbjct: 374 VHWSKWNFNYYLSQYIKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVIWTIGFEVN 433 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + CAS S F EL F I ++I + +R+T+ Sbjct: 434 ----DTGADVMKKCASSPSHFFRVEGVELTDAF-SAIASQINQ--LRLTQ 476 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ A + +GI ++ + + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNATAKALRDKGILVLAVGIAGANS-----WELLAMAGSGDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 713 PDKGARP-------NVRKFLILITDGEAQDIVRDP---ALALRKEGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 59/186 (31%), Gaps = 34/186 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + V + + Sbjct: 111 EAAKDVATEFINGRPNDN---IGITLFAGESFTQCPLT--VDHAVLLNLLKDMKCGFIED 165 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + A + K++ K I+LLTDG N + + + AKS Sbjct: 166 GTAIGMGLANAVTRL-----------KDSKAKSKVIILLTDGVNNKGDISPLTAAEIAKS 214 Query: 396 QGIRIMTIAF----SVNKTQQEKA------------RYFLSNCA--SPNSFFEANSTHEL 437 GIR+ TI L+ A + ++F A S +L Sbjct: 215 FGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNSKL 274 Query: 438 NKIFRD 443 +++ + Sbjct: 275 KEVYEE 280 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 812 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLEDFG 868 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ G+T +A+ + + + N + ++++T Sbjct: 869 TK---LEVISVLQNDQAMGGNTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 920 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 921 DGE-SHDADKLNATAKALRDKGILVLAVGIAGANPAE------LLAMAGSSDKYFFVETF 973 Query: 435 HELNKIFRD 443 L IF D Sbjct: 974 GGLKGIFSD 982 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1018 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 1075 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1076 GNTHIGAALRKVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1127 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1128 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1168 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 68/230 (29%), Gaps = 27/230 (11%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SQN I+D + + P D + ++ S K Sbjct: 194 SMFSQNMTHIIKDVTKYKEGAVDDIFVEACQGPSVADVVFLLDMSINGSEENFDYLK--- 250 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S+ +D + +R+G +++ S S G++K + V + + Sbjct: 251 -----GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINK---SEVLQHIQNLSP 302 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 A A A + + + N + VL+T + + Sbjct: 303 RTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT---HRDSEDNVTKAAVNL 356 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + +G+ I T+ Q L AS A K F D Sbjct: 357 RREGVTIFTLGIEGASDTQ------LEKIAS----HPAEQYVSKLKTFAD 396 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 54/153 (35%), Gaps = 21/153 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ S S+ + + + A ST I D + A + ++ Sbjct: 140 RIGLVIFADQADVAASLSFDTASVAHALDE--AQIGLVGRSTGIGDGLGLALKRLDATPA 197 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 E K +VLL+DG N + A+ GIR+ TIA + Sbjct: 198 RE-----------KVVVLLSDGANNAGQTTPHDVAALARELGIRVHTIALGPRDLSDAEG 246 Query: 416 R------YFLSNCA--SPNSFFEANSTHELNKI 440 L + A S FF +T +L + Sbjct: 247 DPDVVDTEALRDVATTSGGRFFRVRTTDDLAAV 279 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 62/189 (32%), Gaps = 33/189 (17%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ LA I ++ D R+G F + ++ + I+ Sbjct: 112 INRLQAVKQVLAEFIDQ-RQGD------RLGLILFGSQAYVQAPLTFD--RTTVNILLQE 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A +TAI DA+ A + ++ ++LTDG NT Sbjct: 163 AGLGMAGNATAIGDAVGLAVKRLRERP-----------LEQRVAIVLTDGANTAGEITPD 211 Query: 388 AICNKAKSQGIRIMTIAFSVNKT-----------QQEKARYFLSNCA--SPNSFFEANST 434 A++ +R+ TI ++ L+ A + +F A + Sbjct: 212 KASELAQASAVRLYTIGIGAGADSAITGLLQRNPSRDLDEALLTRMAQQTGGQYFRARNL 271 Query: 435 HELNKIFRD 443 EL I+ Sbjct: 272 AELGGIYTS 280 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 86/236 (36%), Gaps = 4/236 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +K+L +S TG+ +I AL +P ++G G +D+ +W + L+ A A + + Sbjct: 3 SIRQTAKRLRQSNTGNAMMILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVAGA 62 Query: 66 VPLI--QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 L + + S+RA+ + + Y N ++ + ++++ Sbjct: 63 YSLSYNGTAGDWSARAEQEYDANRSITTGYATANDSTKGVTDYGSFTQNSVTVSATMDVS 122 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S+ ++LS+ + +N ++F ++ G +I K ++ + + + Sbjct: 123 LPFSS--ILLSTPTTINVNSQAMFEKTDGATGCMIALKPNDTAITINGNVTINAPCGMAV 180 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + S+ + G + + K V+ + S Sbjct: 181 SSTSSISINKSGGSGSINPGWVYTGGGVNDKFQELIDKFNATAPAADQVEVNEDAS 236 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFRATSNSKLKEVYEE 280 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 48/205 (23%), Positives = 79/205 (38%), Gaps = 17/205 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ LA ++ +K D R+G F D Sbjct: 108 DLSGSMAEQDFTSKQGDKISRLDATKEVLADFAKT-RKGD------RLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA---YDTIISSNEDEVHRMKNNLE 366 F+ + + +T ST + DA+ A ++ + + Sbjct: 161 TPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIGLATKVFEQSEKQSAAVQDSSIDANV 218 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA- 423 +K +++LTDG +T E I AK++G+RI IA +T E A + A Sbjct: 219 KEKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQ 278 Query: 424 -SPNSFFEANSTHELNKIFRDRIGN 447 S FEA + EL K + +IG Sbjct: 279 ESGGEAFEALNRDELTKAY-AQIGE 302 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 59/160 (36%), Gaps = 31/160 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F++ ++ L I D T + A++ AY+ +++S + Sbjct: 118 RIGIIIFSNDAYIHVPLTYDAAALELFIQSL-QTDLLPTNGTNVCGAIEMAYNKLMNSAD 176 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQE 413 K +VL TDGEN+ + N + GI + ++A V + QE Sbjct: 177 PTSRA--------KMMVLFTDGENSSSCTNALF--NNLRRFGIGVYSVAVGTKVGISIQE 226 Query: 414 KAR----------------YFLSNCASP--NSFFEANSTH 435 + FL A+ S++E N++ Sbjct: 227 NGKPLKDKNDKLVISKLDENFLRGIANSSRGSYYELNNSK 266 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 64/419 (15%), Positives = 121/419 (28%), Gaps = 40/419 (9%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD- 113 + +++ + S +S+ + F + ++ T + D Sbjct: 150 DGLEISLVLDVSGSMNSNSRLSNLKVAAKDFIDTMVANTTDGKMSISVVPYATQVSLPDD 209 Query: 114 -IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 I + T V NP + + L+ L RSM W + S Sbjct: 210 LIDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPWGYSNRDMRTYYSSPRL 269 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD-EKLSP 231 V D S + L F Q + K G + + Sbjct: 270 VRSP-----VCDERASREVLPLQKDATTLKNFIQNLSAGGNTSIDVGMKWGTALLDPSAR 324 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 +S + PG D S ++ + V Sbjct: 325 PAISAISTGIGASVPG------------DFSDRPAEYSDSDTIKIIVLMTDGQNTSQYYV 372 Query: 292 NDTVRMGATF--FNDRVISDPSF-SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 +D R G + +N S ++ S + + + GS A++ Y Sbjct: 373 DDDHRDGPSGVWYNSHYKSYSTYDSRYGGRYFFHYNNNWYNEPYGNGSGQSGTAVELNYA 432 Query: 349 TIISSNEDEV---------HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + + + + + Y + + G N+ N ++C AK++GI Sbjct: 433 ELFARTSLKYIYRYIFYEWMNFYDARDDWYYGIYSSHG-NSTKNARTRSVCEAAKAKGIV 491 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + TI F L +CAS ++ + E+ F I I R +R+T+ Sbjct: 492 VYTIGFEA----PSNGVAVLRDCASSDAHYFDVDGLEIKDAFAS-IATSI--RQLRLTQ 543 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 54/378 (14%), Positives = 109/378 (28%), Gaps = 41/378 (10%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + +G T ML VGG+ VD++R L+ A++ A+ Sbjct: 24 RAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLAAA-----D 78 Query: 72 LEEVSSRAKNSFTFPKQK-IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 L++ A + + + EY + + D + + + Sbjct: 79 LDQTQPPAVVVQDYLNKAGLGEYYQEPIVESGLGYKRVQATIDATFEAHLLRFSNGNDLP 138 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V +S+ + ++ L + S ++ + SR + + DF +M+ Sbjct: 139 VFATSKAEESIDGLEI--------SLVLDVSGSMNSNSRLSNLKVA-----AKDFIDTMV 185 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD-EKLSPYMVSCNKSLYYMLYPGPL 249 D + Q VG + + S Sbjct: 186 ANTTDGKMSISVVPYATQVSLPDDLIDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLE 245 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRD------ALASVIRSIKKIDNVNDTVRMGATFFN 303 + +R LVR A V+ K + + ++ + N Sbjct: 246 RSMHFTPWGYSNRDMRTYYSSPRLVRSPVCDERASREVLPLQKDATTLKNFIQNLSAGGN 305 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + WG L + + +G+ ++ D + Sbjct: 306 TSIDV--GMKWGTALLDPSARPAISAISTGIGA------------SVPGDFSDRPAEYSD 351 Query: 364 NLEAKKYIVLLTDGENTQ 381 + K IVL+TDG+NT Sbjct: 352 SDTI-KIIVLMTDGQNTS 368 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 64/196 (32%), Gaps = 41/196 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A I + DN +G T F + + L+ +++ Sbjct: 100 RMEAAKKVAAEFISD-RANDN------IGLTIFAGEAFTQCPMTTDHASLLN-LLQGVRT 151 Query: 330 DENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D G TA+ + A + K + K I+LLTDG N + Sbjct: 152 DIASRGLIADGTAVGMGLANAVSRL-----------KESKAKSKVIILLTDGSNNMGDIS 200 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNS 427 + AKS GIR+ TI N E L + A + Sbjct: 201 PLTAAQIAKSLGIRVYTIGVGTNTVAPYPVTVGGTTQYVNVPAEIDTKTLKDIAQSTDGG 260 Query: 428 FFEANSTHELNKIFRD 443 F+ A + EL +I+ D Sbjct: 261 FYRATNNAELKEIYND 276 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 84/267 (31%), Gaps = 29/267 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 R +EG + P R + + P + +L LD S Sbjct: 48 RPTEGSVVTRRTWPQLVCETLAWCLVVLALARPQFVEPPIEKVEPQRDILL---ALDLSQ 104 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + + + + VR +A + + R+G F D F+ Sbjct: 105 SMDARDFPGADGKPLARVEAVRQVVADFVGK----RPGD---RIGLVAFGDAPYPLAPFT 157 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 H+L+RT++ M ++ + + +K +++ Sbjct: 158 MD-HELVRTMIA------------DTVPGMAGPRTSLGDALGLAIKMFGKTTAPEKVLIV 204 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCASP--NSFF 429 LTDG +T + AKS+G+ T+ + + L A+ +F Sbjct: 205 LTDGNDTASRMPPLKAAEIAKSKGVIFHTVGIGDPAATGEDKLDTATLQKIAASTGGRYF 264 Query: 430 EANSTHELNKIFR--DRIGNEIFERVI 454 +L I+ D+I E + + Sbjct: 265 FGGDQSQLAAIYEVLDQITPEDQKNLS 291 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + V R+K++ K I+LLTDG N + + + AKS GIR+ TI N Sbjct: 175 NAVTRLKDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L+ A + ++F A S +L +++ + Sbjct: 235 PVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFRATSNSKLKEVYEE 280 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 72/200 (36%), Gaps = 24/200 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + ++ ++ V+D L + I + D RM F R Sbjct: 99 AVDISGSMDQRDFKAADGTPKQRLEAVKDVLRAFIAA-------RDGDRMALIIFGTRAF 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + L + +T TA+ DA+ T SS D Sbjct: 152 VQAPFTEDLQSLNGFLEQTAVGMAGPN--TALGDAIGLGIRTFESSEVD----------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++ +++L+DG +T + + A +G+ I TI E L + A Sbjct: 199 QRMMIVLSDGADTSSRMTPVIAASIAADKGVVIYTIGVGDPDATGEDRVDLDALKDIANK 258 Query: 424 SPNSFFEANSTHELNKIFRD 443 + +F A+ L +++R Sbjct: 259 TQGQYFFADDEAALTEVYRQ 278 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 69/198 (34%), Gaps = 24/198 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD S S + + + + VR +A + + D R+G F D Sbjct: 99 ALDLSQSMDTRDFKTPSGVLEPRVDAVRQVVADFVAR-RTGD------RIGLIVFGDAPY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ H L+R ++ + M A ++ + + + Sbjct: 152 PLAPFTLD-HALVRELLA------------DMVPGMAGASTSLGDAIGLGIKMFDQSHAQ 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCASP 425 +K ++LLTDG +T + AK++G+ + T+ +Q+ L + AS Sbjct: 199 EKVMILLTDGNDTASRMPPAQAADIAKTRGVVVHTVGIGDPATTGEQKVDLDALKHIAST 258 Query: 426 --NSFFEANSTHELNKIF 441 +F L I+ Sbjct: 259 TGGRYFFGADQTSLASIY 276 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 811 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 867 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ GST +A+ + + + N + ++++T Sbjct: 868 TK---LEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 919 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 920 DGE-SHDADKLNATAKALRDKGILVLAVGIDGANPVE------LLAMAGSSDKYFFVETF 972 Query: 435 HELNKIFRD 443 L IF D Sbjct: 973 GGLKGIFSD 981 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1017 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 1074 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1075 GNTHIGAALREVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1126 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1127 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1167 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 59/198 (29%), Gaps = 27/198 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P D + ++ S K + S+ +D + +R+G +++ Sbjct: 225 PSMADVVFLLDMSINGSEENFDYLK--------GFLEESVSALDIKENCMRVGLVAYSNE 276 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 S S G++K + V + + A A A + + + N Sbjct: 277 TKVINSLSMGINK---SEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQ 330 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + VL+T + + + +G+ I T+ Q L AS Sbjct: 331 GVPQIAVLVT---HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQ------LEKIAS- 380 Query: 426 NSFFEANSTHELNKIFRD 443 A K F D Sbjct: 381 ---HPAEQYVSKLKTFAD 395 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 69/202 (34%), Gaps = 35/202 (17%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P + +DSS L + A + + + R+G +N I Sbjct: 59 PCASPVDVVLSIDSSGSMTTSDPGDLRKSAAKEFVTGLDLSMD-----RVGVVSWNTSAI 113 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S + + I T G+T ++ +++A D + + Sbjct: 114 S-WPLTNNTKDIESAIDST-----GADGNTCLDTGLKSAIDLL------------SECSG 155 Query: 368 KKYIVLLTDGENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K IVLLTDG +T + ++A+S+GI + TI + L+ Sbjct: 156 SKVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTIGL-----GPDADARNLTEI 210 Query: 423 ASP--NSFFEANSTHELNKIFR 442 A F+ A + L I++ Sbjct: 211 AHSTGGEFYSAPDANALAGIYK 232 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 88/270 (32%), Gaps = 25/270 (9%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G+ + TA+LM VMLG+ VDV + L+++ A + + L++ + + Sbjct: 22 RRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQA 81 Query: 78 RAKN--SFTFPKQKI----EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +A + EE + + ++ D EV + + + Sbjct: 82 KATEYLVRNPVGSSMTFVNEEEVPAKIAQFVAEHGDDFEVEAGEWNASTRSFETTNTLPS 141 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 LS + P + F + +G S+ I+ + A I V+DFS SM D Sbjct: 142 TLSVSMEYPTMP-TFFGKILGKDSFSIRASSVA--------MYQPRDIMVVLDFSGSMND 192 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 FG+ V+S Q D Y + + Sbjct: 193 ------DSTFEAFGKLGRSWVESNLQQCWA----DIGNPTYGSLTFEPKWANCKGAVPTD 242 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + +V+ + + + + + Sbjct: 243 GSKPQIYVEYRNTSVYVTSTLNLENVVLQF 272 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 80/256 (31%), Gaps = 30/256 (11%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 LN ++ L+ G + + G I V S + L + + G+ Sbjct: 262 TLNLENVVLQFSGGTRQTFSGLSAKTGTFQGSSTNSGKQITKVWVKSGNNLSGEGTNYGE 321 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS-----------LYYMLYPGP 248 P + + VK N + Y+ C +S Y L Sbjct: 322 PFDFSSSNMNNMVKKAFGLNSVSYPYNGSWDAYIDYCEQSSNSNKNAGYRYKYGYLNLMN 381 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND---R 305 + + + ++D+LA + I +++ + R+G +N Sbjct: 382 YWLESRRSYAQTPVLWKTHAQPVRALKDSLAIFMDFITEVEVQD---RVGLAVYNAPNGE 438 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + V ++ + A +E T I + A +H ++ Sbjct: 439 GMVEVPLTLEVEQVATIANQRQAGHYHEY--TNIGGGLNAA----------RLHLDQHGR 486 Query: 366 EAK-KYIVLLTDGENT 380 K IVL+TDG+ Sbjct: 487 PNAFKMIVLITDGQAN 502 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 55/165 (33%), Gaps = 31/165 (18%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + + L+ + + + TAI + TA + + Sbjct: 127 IGLVMFAGESFTQCPLTTDHATLLNRLSEVEIGYLED--GTAIGLGIATACNRL------ 178 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ---- 412 K + K IVLLTDG N + + A+S GIRI T+A Sbjct: 179 -----KESHAKSKIIVLLTDGTNNAGSIAPSMAASLAESLGIRIYTVAVGTRGEAPYPHA 233 Query: 413 ------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E L A + S+F A LN+I+ + Sbjct: 234 TAFGTVIDNVKVEIDEASLKEIAQTTGGSYFRATDNESLNQIYDE 278 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 61/178 (34%), Gaps = 24/178 (13%) Query: 276 DALASVIRSIKKIDNV-NDTV------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 DA + V V R+G F F+ + + ++ Sbjct: 113 DATGKQTDRLSAAKQVLKQFVAGREGDRLGLIVFGSAAYLQAPFTDDRETWLALLDESIV 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ DA+ + H + + + +++LTDG +T + Sbjct: 173 NMAGPS--TALGDAIGLS----------IAH-FRESKTKNRVLIVLTDGNDTGSKVPPLD 219 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 AK++G+ I T+A +T E+A L + A + F A L + ++ Sbjct: 220 AAQVAKAEGVTIYTVAVGDPETVGEEALDLEVLDSIAQTTGGVSFNAADLKALQETYQ 277 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 123/404 (30%), Gaps = 73/404 (18%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 Y+++ K G ++ + + L +GG+ +D++R +L+ A++ + + Sbjct: 10 YAERFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRAVLAGATAVN 69 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + IE+Y + + +E DI + +N K Sbjct: 70 NATARA-------------VIEDYFAK---SGQSDYLAAQEAGDI----DIRLNSSK--- 106 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +++R L+ + +R G+ + + AE + I +D S SM Sbjct: 107 ---VTARATQTLD--TYLMRLAGVDTLTSAGNSTAEVTIPK------LEIAMALDVSGSM 155 Query: 190 LDYQRDSEGQPLNCFGQP-ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + D+ F D T + + + S + Y Sbjct: 156 IGARIDALKPAAIEFVDSILDSTEPNDAVISVVPFSWGVTPSKEIYEALTVNETHKYSSC 215 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L+ + S + + R+ + + + D +N + Sbjct: 216 LELNDSHFTDTTIDPNTAYNQLIYTSREGVT--FGDLTTT-PLGDF----LDTYNQTCYT 268 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA---------------------- 346 F+ + +T + GST+ ++ ++ A Sbjct: 269 QDYFNILPYATTKTALHDKINGLQAGGSTSNDEGVKWAAALLDPAFQPVVTSLQQPIQVP 328 Query: 347 --------YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 Y + + D + N E K IVL+ DG N Sbjct: 329 QDDGTILTYSLVEPALSDMP-AVFNESETLKVIVLMGDGANDNS 371 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 3/48 (6%) Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTH 435 IC +AKS+GI I TIAF Q + CAS N + A + + Sbjct: 520 ICREAKSEGIVIYTIAF--EMGSQPTGADKIKKCASSVNHHYNATTVN 565 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 66/200 (33%), Gaps = 24/200 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + ++ ++ VRD + + D RM F Sbjct: 99 AIDISGSMDSRDFTAPDGTRKQRLAGVRDVVRGFVE-------GRDGDRMALIVFGSAAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + ++ + +T TA+ D++ A T SS D Sbjct: 152 LQAPLTADLDAILALLDRTQVGMAGPH--TALGDSIGLAIRTFESSEID----------- 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 ++ ++LL+DG +T + + A + + I T+ E L A Sbjct: 199 QRLLILLSDGSDTASRMDPVNAAEIAAGRDVEIFTVGVGDPDATGENRVDLDTLRAIADR 258 Query: 424 SPNSFFEANSTHELNKIFRD 443 + ++F A L ++ Sbjct: 259 TGGAYFFAADEAALTAVYES 278 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/194 (10%), Positives = 66/194 (34%), Gaps = 18/194 (9%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 F + ++L++ G+F ++ A+ ++G + +D + ++ A A + + Sbjct: 4 ATFRLHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGS 63 Query: 65 SVPLIQSL------EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + + +++ P++ + + L + + Sbjct: 64 VRAYGEGATEDDAYDAAQKFFWSNYALPQESVVDALA-------VATEPSTQALSVKFTR 116 Query: 119 AVEMNPRKSAYQVVLSS----RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 V + + + + + R L + L++ R+ G ++ ++ A+ Sbjct: 117 NVNEDTATAEFVLDHNPLFLERLPLQIRRLAVAARASGAEACILALHHTADRAFEVSGSA 176 Query: 175 HG-VSIQWVIDFSR 187 ++ V+ S Sbjct: 177 IADLTGCAVLSNSG 190 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 63/436 (14%), Positives = 135/436 (30%), Gaps = 43/436 (9%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++PV + + + Y ++A AI T + ++ V++R ++S F Sbjct: 742 VILPVNAFITNFSMTIDGVVYNGRIEEKARAEAIYTTARERDLTVGHVAARDQSSSVFQT 801 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 R + R+ + M S + + +S + L+ ++ Sbjct: 802 STNLAPRKRVIFQLTYQEALQRKRGIYQYGVSFRMLQPVSMFSITVSISESVPLSTVN-- 859 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 ++G+++ + + VS ++ ++ FG Sbjct: 860 --ALGLETEQTSVPGPVPLQGITTVRNSPVSAVI--------TYTPTSNQQHLISPFGLN 909 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + ++ + ++ +S + L P F+ S Sbjct: 910 GKFVI--------EYDVFRDRTTEMVID--QSYFAHFITSNLPPMSKRVVFLIDVSGSMF 959 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS------FSWGVHKLIR 321 K VR A+ +++ + + D + FN V + G I Sbjct: 960 GIKIDQVRQAMNTILHGLAETDFFS------VIAFNSSVSRWSPSGTAAVLASGTTANIN 1013 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + G T I A++ A S+ A ++VLLTDG T Sbjct: 1014 SAMNFLNTTVVTRGGTDILQAVEAAIQLFDSAA------TGGTNTASDFMVLLTDGRPTD 1067 Query: 382 DNEEGIAICNKAK--SQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 AI + + ++G I TI F + N + F +++ Sbjct: 1068 GTVSSTAIISAIRNLNRGRFGINTIGFGTLVDMNLLRKIAAQNSGTSIQIFIDLNSYAQI 1127 Query: 439 KIFRDRIGNEIFERVI 454 F + I I Sbjct: 1128 SNFYEEISQPILSNTT 1143 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 68/178 (38%), Gaps = 21/178 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A+ V+ + + + R+G F F+ T+++ I Sbjct: 128 KIDRL-TAVKQVLTEFSQRRDGD---RLGLIVFGSAAYLQAPFTAD-KDTWLTLLQETEI 182 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 ST+I DA+ + T S+ D + +++LTDG +T + Sbjct: 183 AM-AGASTSIGDAIGLSISTFEHSDTDN-----------RVLIVLTDGNDTGSRVPPVDA 230 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFRD 443 A ++ ++I TIA +T E A L + + ++FEA L + + D Sbjct: 231 ARVANARDVKIYTIAIGDPETIGEDAMDVDTLKQVSDITGGAYFEALDRQALERAYLD 288 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 839 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 895 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ G+T +A+ + + + N + ++++T Sbjct: 896 TK---LEVISVLQNDQAMGGNTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 947 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 948 DGE-SHDADKLNATAKALRDKGILVLAVGIAGANPVE------LLAMAGSSDKYFFVETF 1000 Query: 435 HELNKIFRD 443 L IF D Sbjct: 1001 GGLKGIFSD 1009 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1045 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GAFIGEKEISFQIENIKQIF 1102 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1103 GNTHIGAALRKVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1154 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1155 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1195 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 68/230 (29%), Gaps = 27/230 (11%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SQN I+D + + P D + ++ S K Sbjct: 221 SMFSQNMTHIIKDVTKYKEGAVDDIFVEACQGPSMADVVFLLDMSINGSEENFDYLK--- 277 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S+ +D + +R+G +++ S S G++K + V + + Sbjct: 278 -----GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINK---SEVLQHIQNLSP 329 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 A A A + + + N + VL+T + + Sbjct: 330 RTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT---HRDSEDNVTKAAVNL 383 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + +G+ I T+ Q L AS A K F D Sbjct: 384 RREGVTIFTLGIEGASDTQ------LEKIAS----HPAEQYVSKLKTFAD 423 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 57/175 (32%), Gaps = 21/175 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 K +V++ + I +K D R+G F ++ L + + Sbjct: 107 KSKFDVVKEVVGDFIDK-RKND------RIGLINFASVAFVASPLTFEKDFLRKIL--QM 157 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAINDA+ Y+ + S+ K +LLTDG + Sbjct: 158 QEPGIAGKRTAINDALLQTYNILSKSDAKS-----------KIAILLTDGIDNASRISFD 206 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I I++ TI + L+ A FF A+ L KI+ Sbjct: 207 EIRRLISDSDIKLYTIGIGSYRDFDAPYLKALAQ-AGHGRFFAASDRRSLQKIYE 260 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 69/190 (36%), Gaps = 37/190 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A +V ++ +G T F + + L+ +++ D G Sbjct: 111 EAAKAVAAEFIAGRPNDN---IGLTIFAGEAFTQCPMTTDHASLLN-LLQNVRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + K++ K ++LLTDG N + + Sbjct: 167 LIQDGTAVGMGLANAVSRL-----------KDSKAKSKVVILLTDGSNNMGDLSPMTSAQ 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ------EKARYF----------LSNCA--SPNSFFEANS 433 AKS GIR+ TI NK + +Y LS+ A + +F+ A + Sbjct: 216 IAKSLGIRVYTIGVGTNKVARYPMPVTGGIQYVNIPVEIDTKTLSDIAATTDGNFYRATN 275 Query: 434 THELNKIFRD 443 EL +I+ D Sbjct: 276 NRELKQIYND 285 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 57/451 (12%), Positives = 137/451 (30%), Gaps = 65/451 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK+ G I+ L+P ML + + + + QA+ + I + ++ Sbjct: 5 IKAQQGSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACAYSQRANVS- 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 L + F K NF + V + Q Sbjct: 64 -------------------LTQGFAQYYKPNF----------ISHVNAQSTFLSGQKQCK 94 Query: 135 SRYDLLLNPLSL-FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + PL L + + ++ + + + + + V+D S SM Sbjct: 95 IQIGYAFTPLLKDLLPASSQNKVHASVQIQSTSTLTVHSEIKPMDLSLVLDISGSMSGRI 154 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + ++ +++N VS + + + G Sbjct: 155 G-----LLKRIINQAIQNIEQQNTKNNTQIRFSIVPFSSGVSISNAPWLAKSKGKALCVD 209 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + + + + L I+ + ++ + + Sbjct: 210 AMSYPGNVLNTAQTVADIDTHPSKLN-----IRAKEPLS------LINDCNVYSLLLPLT 258 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--EVHRMKNNLEAKKYI 371 + K+ + + + +GSTA T++ + + + ++L +++ I Sbjct: 259 NNLSKVRKHVDS-----LSILGSTASYQGFIWGVRTLLPNWQKAWNLQPETSSLLSQRLI 313 Query: 372 VLLTDGENTQDNEEGIAI----CNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + TDGE+ ++ + C + + I I I F ++ + F S Sbjct: 314 LF-TDGEDDSRDQFDKLVRSGMCQRIQDDFNIDISFIGFGLS---PRRLDQFKKCIGSNG 369 Query: 427 S--FFEANSTHELNKIFRDRIGNEIFERVIR 455 ++A + +L K F + + + + Sbjct: 370 KGVVYDAKNGSDLEKFFAEALLLDTSAHLSL 400 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 50/131 (38%), Gaps = 16/131 (12%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A SV + K + R G F++ ++ ++ ++ ++ K +E Sbjct: 49 SAAVSVAKDFVKRRPND---RFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEAS-- 103 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 +TAI + A + KN+ K I+L+TDG + + + AK Sbjct: 104 ATAIGMGLAKAVARL-----------KNSTAKSKVIILITDGVSNTGEIDPLTAAGMAKE 152 Query: 396 QGIRIMTIAFS 406 GI++ I Sbjct: 153 LGIKVYPIGVG 163 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 73/200 (36%), Gaps = 24/200 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD S S + + H + +D L ++ D R+G F + Sbjct: 99 ALDLSGSMDARDFRDAEGHEQNRLTAAKDVLEGFAAQ-REGD------RLGLIVFGNAAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + +T+++ + STA+ DA+ A +S+ Sbjct: 152 LQAPFT-DDRETWQTLLEESEVAM-AGQSTALGDAIGLAISIFQASDTTN---------- 199 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 + +++LTDG +T A + + I T+A T E+A L+ A Sbjct: 200 -RVLIVLTDGNDTGSRVPPRDAATIAAANDVTIYTVAVGDPATIGEEALDLETLNAVAET 258 Query: 424 SPNSFFEANSTHELNKIFRD 443 + + F+A T L K + + Sbjct: 259 TGGASFQALDTQALEKAYDE 278 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 72/187 (38%), Gaps = 16/187 (8%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS ++ ++++D +I +KK D + VR GA + D Sbjct: 810 VFVIDSSGSIDYQEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYL--DE 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D G+T +A+ + + +H+ + ++++TD Sbjct: 865 LGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GE + D E+ A + +GI ++ + + + L+ S + ++ + Sbjct: 920 GE-SHDAEKLNATAKALRDKGILVLAVGIAGANS-----WELLAMAGSGDKYYFVETFGG 973 Query: 437 LNKIFRD 443 L IF D Sbjct: 974 LKGIFSD 980 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 42/150 (28%), Gaps = 19/150 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D V++G F+ + I + +T A+ Sbjct: 655 ADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFS 712 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++L+TDGE + + +G+ I ++ + Sbjct: 713 PDKGARP-------NVRKFLILITDGEAQDIVRDP---ALALRKEGVIIYSVGVFGSNVT 762 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKI 440 Q L + P F + L I Sbjct: 763 Q------LEEISGKPEMVFYVENFDILQHI 786 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 75/203 (36%), Gaps = 15/203 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 LD S S S + + ++ + + ++ D R+G F D Sbjct: 104 LDLSGSMAETDFSPDPGKSLSRLDAAKE-VLKQFAATREGD------RLGLILFGDAAFL 156 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ + + +T ST + DA+ A + ++++ + +N+ + + Sbjct: 157 QAPFTADLETWQTLLQETDVA--MAGQSTHLGDAIGLAI-KVFNNSDRHGQQDQNSAKRE 213 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCA--S 424 K ++LTDG +T A G+R+ TIA +Q L A + Sbjct: 214 KVAIILTDGNDTGSFVSPRDAARVAAVNGVRLHTIAMGDPATVGEQALDLDTLQQLATLT 273 Query: 425 PNSFFEANSTHELNKIFRDRIGN 447 F+A +L + ++ IG Sbjct: 274 GGQLFQALDEAQLTRAYQ-VIGE 295 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 77/216 (35%), Gaps = 26/216 (12%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + KS ++ L S+SEE F D +H + + + L Sbjct: 83 VGEPIEQKKSAREIMVALDLSGSMSEEDFADKKGNKHD--RLTIAKQVLREFAAQ----- 135 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + R+G F D F+ ++ + +++++ + TA DA+ + Sbjct: 136 --REHDRLGLILFADSAYVQAPFTEDIN-VWQSLLEDVELGYAGFK-TAFGDAIGLS--- 188 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + ++ ++LLTDG++T + A G++I TIA Sbjct: 189 --------IAVFEQEQSRQRVMILLTDGDDTSSKMPPVKAAEIAAKYGVKIYTIAIGDPS 240 Query: 410 TQQEKARYF--LSN--CASPNSFFEANSTHELNKIF 441 T+ L A+ F A +L++ + Sbjct: 241 TKGRYKMDLPTLEKVSAATGGQMFHAMDRKQLDQAY 276 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 58/179 (32%), Gaps = 19/179 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS--FSWGVHKLIRTIVKTFAIDEN 332 ++AL +++ ++ D R F+ R+ +R K + Sbjct: 328 KEALLTILGDLRPAD------RFNFISFSSRIRVWQPGRLVPATPSAVRDAKKFVVMLPT 381 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGIAIC 390 G T I+ A+QT + H + I+ LTDG+ T AI Sbjct: 382 SGGGTDIDGAIQTGSSLLRD------HLSGRDAGPNSVSLIIFLTDGQPTVGEVRPGAIL 435 Query: 391 NKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A+ I TI + + R L NC E + K F D IG Sbjct: 436 GNARAAVRDKFCIFTIGMGDDVDYRLLERMALDNCGMMRRIPEEADASSMLKGFYDEIG 494 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 78/202 (38%), Gaps = 15/202 (7%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ LA ++ +K D R+G F D Sbjct: 108 DLSGSMAEQDFTSKQGDKISRLDATKEVLADFAKT-RKGD------RLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A + + + +K Sbjct: 161 TPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIGLAI-KVFEQSAAVQDSSVDANVKEK 217 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 +++LTDG +T E I AK++G+RI IA +T E A + A S Sbjct: 218 VVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESG 277 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL K + +IG Sbjct: 278 GEAFEALNRDELTKAY-AQIGE 298 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/384 (12%), Positives = 118/384 (30%), Gaps = 72/384 (18%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + L + G+ I+ AL + M+ G D +R ++ TA+I A Sbjct: 18 ARVFKTLRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAA 77 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ + + + ++E T ++N Sbjct: 78 VKEI--DTDDAVALKQKVSDWFHAQVEN-----------------------SYTLGDINI 112 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S +++ ++ + + +R I + + + + + SY +++ VID Sbjct: 113 DTSNHKITATASGTVP----TTLMRIANIDTVDVSVASAVKGPATSY-----LNVYIVID 163 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SML + + Sbjct: 164 TSPSMLLAATTAGQATM-------------------------------YSGIGCQFACHT 192 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + + + S+ +++ + + DA+ V+ I D+ + +++G D Sbjct: 193 GDAHKIGNKTYNNNYEYSTAKNIKLRADVAGDAVKDVLALIDTSDSNHQRIKVGLYSLGD 252 Query: 305 RVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + P+ S + T +T + ++ T + + + Sbjct: 253 TLTEVLAPTLSTDTARNRLTDASYGLTSATSKAATYFDVSLTTLKQKVGTGGDGTA---- 308 Query: 363 NNLEAKKYIVLLTDGENTQDNEEG 386 + K ++LLTDG +Q Sbjct: 309 -SNSPLKLVLLLTDGVQSQREWVT 331 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 65/143 (45%), Gaps = 4/143 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K F++K L S G+F +I+A+++P + + G+++ + ++++++ A++ A ++ Sbjct: 22 KIHFFNKLLFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEA-LSHG 80 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + LI ++ N + L +N + + + + I T + + Sbjct: 81 MSLICYQNDIER--DNLAKIILNDLIVSLKKNNFTKQEADLVAKNSK-IDITTLINDSTN 137 Query: 126 KSAYQVVLSSRYDLLLNPLSLFL 148 +Y + S Y + LN ++ Sbjct: 138 VKSYHFYIKSVYKMPLNKITKIF 160 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 64/477 (13%), Positives = 141/477 (29%), Gaps = 102/477 (21%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ +G+ +I A+ + +L G +D + L + A++ + Sbjct: 13 RFRRNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLA-ATGYTAMR 71 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S + ++ + L N ++ D V V + VV Sbjct: 72 GSSSDAKTAATNMFNGQMSAH-------KLTSNSLSIDITDSVSARTVT-----GSATVV 119 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + F+ G + + + A +Y + ++D S S Sbjct: 120 VKTS----------FMYMFGYPTMTVSASSSASASFPTY-----MDFYVLVDNSPSQGLG 164 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ L ++ +C + Y + Sbjct: 165 ATTADMTTL-----------------------QNATTDTCAFAC-----HDTYTSSSKKT 196 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM---------GATFFN 303 L + D + V + +VR A S+ + V++ RM G+ Sbjct: 197 LQTNSYYDKAKKLGVTMRIDVVRSATQSLTDTATSSQIVSNQYRMAVYSMGADCGSLGLT 256 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S S + + T D +A +I + D + Sbjct: 257 TVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGTMSAMSGVIPTQGDG-----S 311 Query: 364 NLEAKKYIVLLTDG-----------ENTQDNEE-----GIAICNKAKSQGIRI---MTIA 404 + +K++ ++DG + TQ C K++GI+I T Sbjct: 312 STNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVEPLTTTTCTALKARGIKIAVLYTTY 371 Query: 405 FSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHE----LNKIFRDRIGNE 448 ++ ++ + +CASP ++E +S+ L +F+ I + Sbjct: 372 LAITSNGYYNTWVKPWRDSIGTIMKSCASPGYYYEVDSSGSIGSALTALFQQAIASA 428 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 52/405 (12%), Positives = 113/405 (27%), Gaps = 58/405 (14%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 VDV L+ A + L ++ + ++ T K+ + Sbjct: 33 FTVDVAYMQLVRTELRAATDASAKAGMEALRRTQDTEAAIDAAIATAAANKVGGRSLTLT 92 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + ++ R V D +V N + Y V + S ++ + I Sbjct: 93 ADQIEFGLAFRNV-----DNSVSFNAGQLPYTAVRVNSA----MTESSAAGAVPLFFGSI 143 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 + E + V I + ID S SM + + D Sbjct: 144 FGTGQFEPTRSAVSASTEVEICFAIDRSHSMCFDLTGVDWSYPPGTPRNPDPVAFPPHPT 203 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 + + ++ S+ P P ++ + S+ + K + Sbjct: 204 LSRWASLSRAMQTFV-----SITASQEPKPRVAMVTWASKITQSNYEGKLTKTN------ 252 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + V D + + L + I ++ +G+T Sbjct: 253 ------------------------SPEVFVDVPLTTNLADLNQAIKG--RSEKVMLGATN 286 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + + A + ++ + + I+L+TDG Q + A ++GI Sbjct: 287 MAAGIDEARKILNATKSTRPYA-------HRIIILMTDGLWNQG-RNPLLAAQDAANEGI 338 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I +++ + + A ++ L F D Sbjct: 339 VIHSVSLLPRSGDITPQVS----STTGGVNYPATNSAALEAAFAD 379 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 55/176 (31%), Gaps = 48/176 (27%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-- 379 T ++T + G+T + + T+ + R KK I+++TDG N Sbjct: 388 TALRTAVNNMIASGNTNVPLGTMWGWHTLSPNAPFGDGRPYGTERLKKIIIIMTDGANVM 447 Query: 380 ------TQDNEEGIAI----------------------------------CNKAKSQGIR 399 G+ C K + I Sbjct: 448 SDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDNRFDHATAATEDMCGNMKDKDIE 507 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVI 454 + T+A V+ T A+ L CA+ + +F +S + F DRI I I Sbjct: 508 VYTVAVQVDST----AQTLLRRCATDTDHYFPVDSAAGIGAAF-DRIAGAIENLRI 558 Score = 37.3 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 59/195 (30%), Gaps = 23/195 (11%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + G+ ++ AL +P M+ + VD+ R S ++ A Sbjct: 9 SALKTFGTRFSDDRRGNVAMMFALALPPMMLMTLGGVDIARVSTVRMNVQDALDA----- 63 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++R++ + + L+ N + T Sbjct: 64 -------ATLAAARSQYTDNPRINAVG-------LAALQANLAPYG-DVTLDTTQTNFRL 108 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + V ++ ++ ++FL G + A S + + + VID Sbjct: 109 NTATGAVEADAKVNVRALVANIFLPPYG---QFFDDQLPANAHSEVLRSNNRIEVALVID 165 Query: 185 FSRSMLDYQRDSEGQ 199 + SM + + Sbjct: 166 NTGSMDGAKLTNTKT 180 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 85/267 (31%), Gaps = 29/267 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 R +EG + P + R + + P + +L LD S Sbjct: 48 RPTEGSVIARRTWPQILCEGAAWCLVVVALARPQFVEPPIEKVEPQRDILL---ALDLSQ 104 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + + + + V+ +A + + R+G F D F+ Sbjct: 105 SMDTKDFPGADGKPLARVDAVKQVVADFVGR----RPGD---RIGLVAFGDAPYPLAPFT 157 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 H L+ T++ M ++ + V + +K +++ Sbjct: 158 MD-HALVETMIA------------DAVPGMAGPRTSLGDALGLAVKMFEKTTVPEKVLIV 204 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFF 429 LTDG +T + AKS+G+ + + + + L A + +F Sbjct: 205 LTDGNDTASRMPPLKAAEIAKSKGVVVHAVGIGDPLATGEDKLDTATLQKIAEKTGGRYF 264 Query: 430 EANSTHELNKIFR--DRIGNEIFERVI 454 +L I++ D+I +++ Sbjct: 265 FGGDQAQLASIYQVLDQITPGDQKKLS 291 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 20/189 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 54 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 110 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ GST +A+ + + + N + ++++T Sbjct: 111 TK---LEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 162 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE + D ++ A + +GI ++ + N + L+ S + +F + Sbjct: 163 DGE-SHDADKLNATAKALRDKGILVLAVGIDGANPVE------LLAMAGSSDKYFFVETF 215 Query: 435 HELNKIFRD 443 L IF D Sbjct: 216 GGLKGIFSD 224 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 260 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 317 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 318 GNTHIGAALREVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 369 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 370 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 410 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV SS V K +DAL +++ ++ D+ N Sbjct: 277 NGYFVHYFAPRDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILHDLRPQDHFN---- 332 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H + T K + + G T IN A+QT + Sbjct: 333 --IIGFSNRIKV-----WKDHLVSVTPNNVRDGKVYIHHMSPSGGTDINGALQTGIALLH 385 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVN 408 S H +++ IV LTDG+ T I N + + I T+ + Sbjct: 386 DS--VARHDLEDRSV--SLIVFLTDGKPTVGETHTPKILNNTREAARGRVCIFTVGIGDD 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + L NC E F D I Sbjct: 442 VDFRLLEKLSLENCGFTRHVHEDEDAGAQLIGFYDEI 478 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 54/149 (36%), Gaps = 18/149 (12%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 MG F + ++ + + + T STAI DA+ A T+ Sbjct: 126 MGVVIFGTFAYTASPLTYDLEAMESMLKMTTV--GIAGESTAIGDALMQAMRTL------ 177 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K I+LLTDG + A KAK +GI+I TI + Sbjct: 178 -----SYGEAQSKAIILLTDGYHNAGRSSPKAAVAKAKEKGIKIYTIGVGKSSDYDA--- 229 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A S + A S +L +++++ Sbjct: 230 ALLDTIAKESGGKSYAAASAAQLKEVYKE 258 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 74/205 (36%), Gaps = 23/205 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D + FV SS V++ + ++++ + D R+G + V Sbjct: 3 SSCDSKRLDLIFVIDSSRSVRPHDFEKVKEFIVTILQFLDVAP---DVTRVGLIQYGSTV 59 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ S + + + + + T A+Q A + S +E ++ N+ Sbjct: 60 KNEFSLK--TYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIAFSESE-GARPLRENVP 116 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-- 424 + I+++TDG + + + KA++ GI I I L + S Sbjct: 117 --RIIMIVTDG---RPQDPVAEVAAKARNSGILIFAIGVGQVD------YNTLKSIGSKP 165 Query: 425 -PNSFFEANS---THELNKIFRDRI 445 + F + L +F++++ Sbjct: 166 HQDHVFLVANFSQIESLTSVFQNKL 190 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 62/180 (34%), Gaps = 26/180 (14%) Query: 278 LASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + + + D V++ ++ ++ + H I +V+ GS Sbjct: 414 VRAFLEVLVNSFDIGPSKVQISLVQYSRDPHTEFAL--NTHHDINAVVRAVRTFPYRGGS 471 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AM+ D I ++ + +VL+TDG + ++ ++ Sbjct: 472 TNTGKAMKYVKDKIFVASRGAR------QNVPRVMVLITDG---KSSDSFKDAATNLRNI 522 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFERV 453 + I + ++ R L A+P N FE F+ RI E+ + + Sbjct: 523 DVEIFAVGV------KDAVRSELEAIANPPADNHVFEVED----FDAFQ-RISKELTQSI 571 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 19/167 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A + + V++ ++G +NDR D + ++ + + G Sbjct: 205 AAIRIFTDLLATTPVDE--QIGLASYNDRASEDVQLTENFAEVNNAMDRLR-----TGGF 257 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T+I+ MQ + ++ R + ++ ++++TDG + + E + + Sbjct: 258 TSISRGMQ--------AGQEIALRGRPPEFVERTMIVMTDGRHNRGP-EPRVVATDLAAD 308 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 G+ I TI F + + F A + +L I+R+ Sbjct: 309 GVTIHTITFGAGA-DFGRMQDVAR--IGGGRHFHATNGDQLRDIYRE 352 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 53/198 (26%), Gaps = 17/198 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A++M + L V +D+ + L+ + + + Sbjct: 33 RHGAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSST---------DAAANAAATTL 83 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 Q+ ++ N N D + + D V K A+ + Sbjct: 84 ADTLDRNLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVN---GKYAFNAGEAPFN 140 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ----WVIDFSRSMLDYQ 193 + +N G ++ ++ V+D S SM + Sbjct: 141 GVRVNGQRTAGSLSGPVPLFFGNV-TGTSIFEPEAFATATYVERDITLVVDRSGSMAGSR 199 Query: 194 RDSEGQPLNCFGQPADRT 211 + + F T Sbjct: 200 FNDLQAAIRIFTDLLATT 217 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 78/230 (33%), Gaps = 31/230 (13%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 Y+ + + P D + I K LV + D Sbjct: 19 YISAEDGETRASCRTAPADLVFILDGSYSVGPENFEIIKSWLVNIT--------RNFDIG 70 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +++G ++D + + G H+ +++ G+T A+Q A+D + Sbjct: 71 PKFIQVGVVQYSDYPVLEIPL--GTHESTENLIREMESIHYLGGNTRTGRAIQFAFDHLF 128 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + K++ K V+LTDG + +E + +A+ I + I + Sbjct: 129 A---------KSSRFLTKIAVVLTDG---KSQDEVKDVAAEARKNKITLFAIGVGSEIEE 176 Query: 412 QEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 E L A S F +++I ++ I ++ E + T+ Sbjct: 177 DE-----LKAIANKPSSTYVFYVEDYIAISRI-KEVIKQKLCEESVCPTR 220 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 60/165 (36%), Gaps = 13/165 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 A +I +KK D + V+ GA ++D + I+ D GST Sbjct: 828 AFMIDLVKKADVGKNQVQFGALKYSDFPEVLFNL--NEFSSKSEIISFIQNDHPRGGSTY 885 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ + S +HR + ++++TDGE + D A + +GI Sbjct: 886 TAKALAHSAHLFSESLGSRMHRGVP-----QVLIVITDGE-SHDAHLLNATARALRDKGI 939 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + + LS S + +F + L IF + Sbjct: 940 LVLAVGI-----EGANHEELLSMAGSTDRYFFVENFEGLKGIFEN 979 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 59/412 (14%), Positives = 130/412 (31%), Gaps = 46/412 (11%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 + + A + L Q +++ S + + K + ++ +NF F Sbjct: 41 NKMINALPIEASKYRIALAQYSDDLHSEFQLNTFKSKNPMLNHVKKNFA------FRGGS 94 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 R + + Y L++ D P L + + G ++ A+A + R Sbjct: 95 PRL-----GLALQKAHKTYFSGLTNGRDPKRFPPVLVVLASGPSEDDVEAPAKA--LQRD 147 Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 K + +Q D + P F R V +S ++ K Sbjct: 148 RVKIISLGMQAASDR-------DLKAMATPQFDFLLRTIREVSMFSPNMTRIIEDVVKFE 200 Query: 231 PYMVSCNKSLYYMLYP--GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 ++ L+ +P D S+ + F+ S+ + ++ ++ +I Sbjct: 201 EGTLTIVLFLFLSNFPIEACHDASVVDIVFLVDESVNGTDENFEHLK---GFLVETIDSF 257 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + +R+G +++ G +K I++ + G A+ Sbjct: 258 DVKENCMRIGLVMYSNETKLVSRLGTGTNK--SDILQQIDGLSPKAGRALTGAAINVTRK 315 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 I S + + + VL+T + + + +G+ + + Sbjct: 316 EIFSRGAGS----RKSQGVLQITVLIT---HRSSEDNVSEAALSLRREGVTVFAVGIEGA 368 Query: 409 KTQQEKARYFLSNCAS-PNSFF-----EANSTHELNKIFRDRIGNEIFERVI 454 Q L AS P + + +IF+ ++ NEI +V Sbjct: 369 NETQ------LDQIASYPREQYVSMVKSYSDMGAYYRIFQKKLRNEIQNKVS 414 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 48/148 (32%), Gaps = 21/148 (14%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTI 350 + V++G F+D + +T + + +G T I A+ D Sbjct: 654 ANQVQVGLVQFSDINKEGFQLN---QYDTKTKISDAIDGLSLIGRGTLIGGALTFVSDYF 710 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S KK++VLLTDG + + + G+ I ++ ++ Sbjct: 711 SVSKGARP-------NVKKFLVLLTDG---KSQDAVKEAAVALRQDGVIIYSVGVFGSEY 760 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHEL 437 Q L + + F + L Sbjct: 761 SQ------LEEISGRSDMVFYVENFDIL 782 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 65/186 (34%), Gaps = 32/186 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A I+ + V +G F+ + +L + + Sbjct: 113 SRLEAAKAAAREF---IRAMPPG---VEVGLVAFSSYATLLQPPTTDRERLEQAVDLLDL 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI D + A + + D +VLL+DG N + + Sbjct: 167 AHR-----TAIGDGLVAALRVLPLEDSDA--------PGGMSVVLLSDGRNNYG-IDPLE 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------LSNCA--SPNSFFEANSTHE 436 +A++QG+R+ T+ +++ A + L A + +++ A+S E Sbjct: 213 AARQAEAQGVRVYTVGVGLSENTYVFANGYYIRAGLDEETLQEIAALTGGAYYRASSADE 272 Query: 437 LNKIFR 442 L +++ Sbjct: 273 LRAVYQ 278 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 57.7 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 51/141 (36%), Gaps = 21/141 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A+ ++I + D G F+ + S + +++ + Sbjct: 110 RITAAKEAIGTLINQLDLKD------YAGIITFDSGASTAAYLS----PDKQRVIEKLGM 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 STAI D + A V K+ K ++LL+DGE+ Sbjct: 160 IAASDDSTAIGDGLALA-----------VDMSKSIPNRKSVVILLSDGESNAGYVSPETA 208 Query: 390 CNKAKSQGIRIMTIAFSVNKT 410 AK G+++ T+A ++ Sbjct: 209 AEFAKESGVQVFTVAMGSSEK 229 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 57.7 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 75/196 (38%), Gaps = 29/196 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + ++ D +G F+ S + +I +I Sbjct: 107 RLEAAKKSAQGFVETLP------DRFNVGLVAFSSTATVVSSPTHDHQAVIGSIEN---- 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TAI + + + ++I S +ED + IVLL+DGENT + A+ Sbjct: 157 -LQLGPGTAIGEGVFASLESISSFDEDA-----DVDPPPSAIVLLSDGENTSGRDISQAV 210 Query: 390 CNKAKSQGIRIMTIAFSVNKT---------QQEKARYFLSNCASP--NSFFEANSTHELN 438 A Q + + TIAF + + L AS F+EA S EL+ Sbjct: 211 A-MAAEQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGGHFYEAESETELD 269 Query: 439 KIFRDRIGNEIFERVI 454 +++ D IG+ + ++ Sbjct: 270 EVYED-IGSSLGTELV 284 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 57.7 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 19/167 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A + + V++ ++G +NDR D + ++ + + G Sbjct: 169 AAIRIFTDLLATTPVDE--QIGLASYNDRASEDVQLTENFAEVNNAMDRLR-----TGGF 221 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T+I+ MQ + ++ R + ++ ++++TDG + + E + + Sbjct: 222 TSISRGMQ--------AGQEIALRGRPPEFVERTMIVMTDGRHNRGP-EPRVVATDLAAD 272 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 G+ I TI F + + F A + +L I+R+ Sbjct: 273 GVTIHTITFGAGA-DFGRMQDVAR--IGGGRHFHATNGDQLRDIYRE 316 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 52/192 (27%), Gaps = 17/192 (8%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ A++M + L V +D+ + L+ + + + Sbjct: 3 VLIAIMMFLFLIVVAFSIDIAQMHLARTELRSST---------DAAANAAATTLADTLDR 53 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 Q+ ++ N N D + + D V K A+ + + +N Sbjct: 54 NLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVN---GKYAFNAGEAPFNGVRVNG 110 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ----WVIDFSRSMLDYQRDSEGQ 199 G ++ ++ V+D S SM + + Sbjct: 111 QRTTGSLSGPVPLFFGNV-TGTSIFEPEAFATATYVERDITLVVDRSGSMAGSRFNDLQA 169 Query: 200 PLNCFGQPADRT 211 + F T Sbjct: 170 AIRIFTDLLATT 181 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 71/188 (37%), Gaps = 18/188 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + ++++D +I +KK D VR GA + D G Sbjct: 810 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKAQVRFGALKYADDPEVLFYL--GD 864 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ D+ G+T +A+ + + +H+ + ++++TD Sbjct: 865 LDTKMEVISMLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----QVLIVITD 919 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GE + D E+ + +GI ++ + N + L+ S + +F + Sbjct: 920 GE-SHDAEKLNGTTKALRDKGILVLAVGIAGANPVE------LLAMAGSSDKYFFVETFG 972 Query: 436 ELNKIFRD 443 L IF D Sbjct: 973 GLQGIFSD 980 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 67/199 (33%), Gaps = 27/199 (13%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ S +++ +ASV++ D + V++GA F+ + G Sbjct: 1001 VFLMDGSNSIDPSDFKKMKEFVASVVQDF---DVSLNRVQIGAAQFSHTYQPEFPL--GT 1055 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + G T I A+ D R+ + +++LTD Sbjct: 1056 FTDEKEVSFHIENIQQIFGYTHIGAALHQVGRYF---QPDMGSRINTGTP--QVLLVLTD 1110 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G Q +E + + G+ I ++ + + + + ++ E Sbjct: 1111 G---QSQDEVAQAAEELRHNGVDIYSVGI-----GNVDHQQLIQITGTADKKLTVDNFDE 1162 Query: 437 LNKIFRDRIGNEIFERVIR 455 L +I +R++R Sbjct: 1163 L---------KKIKKRIVR 1172 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 65/188 (34%), Gaps = 28/188 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R+A S + T R+G ++D+V+ + V K+ + + Sbjct: 528 LDEAREAARSFLDQCD-----FTTTRVGLISYSDQVVLQTDLTDNVRKVEAGLAR----- 577 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T + A++ + + H KY+V+LTDG + + Sbjct: 578 LEADGTTNLAGALELGRRKLATVP--TGHV--------KYLVVLTDGYPDDPDNALLEAA 627 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 AK GI I+ I E + +L AS + EL + F I I Sbjct: 628 -HAKGSGIEIVAIGTG------EADQAYLDRIASTQAGSIFARKGELVRAFG-HIARVIA 679 Query: 451 ERVIRITK 458 E + K Sbjct: 680 EGGRSLRK 687 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 56/159 (35%), Gaps = 25/159 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + TG +I L +P+++ G V+ R L++A A + A+ L + Sbjct: 6 RRFLADRTGAVALIIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAGELSVA 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +VS + + L + T + R T+V + ++ Sbjct: 66 GSDVSV----------EAMARRLAFDSARATDPGITRVSAAVVGRGTSVTVAINETV--- 112 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 SLF R + + S I A AE+ + Sbjct: 113 ------------QSLFGRLLTLPSMEIGASATAESSGST 139 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 55/158 (34%), Gaps = 22/158 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D+ S S+ ++ ++ I ST I D + A + ++ Sbjct: 145 RIGLVIFADQADVAASLSFDTAAVV-HALEEAQIGLVGR-STGIGDGLGLALKRLDAA-- 200 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +K ++LL+DG N + A+ GIR+ TIA Sbjct: 201 ---------TAREKVVILLSDGANNAGQTTPHDVAGLARDLGIRVHTIALGPRDLSDADG 251 Query: 416 R------YFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 L + + S FF +T +L D I Sbjct: 252 DPDVVDTEALRDVSATSGGRFFRVRTTDDL-AAVADAI 288 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 60/431 (13%), Positives = 137/431 (31%), Gaps = 98/431 (22%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN 97 + +D+ + A + A + + + +K E R Sbjct: 36 SVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVAD------GMSKNQARKRAETDARK 89 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 F K+ + ++ V+ + V +S +++ S +MG + Sbjct: 90 FLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHE------SYMTHAMGFDNID 143 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +E+ + +++D S SM + + Q + Sbjct: 144 YTADSESTISFG----QGKYEFIFLVDVSPSMGIGASNRDRQIMQ--------------- 184 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +G + P+ S +++ + +V+DA Sbjct: 185 --RAIGCQFACHEPWYSSVSRAKSAGARL-----------------------RIDVVKDA 219 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGS 336 L S++ +++ V+ +R G F++ + + G+ K R K E G Sbjct: 220 LKSLVTQLEEATEVD--LRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGG 277 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-------- 388 T + + S + +K + K++I++++DG N + G Sbjct: 278 TNFHG--------VFSDFNGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTP 329 Query: 389 ------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYF------LSNCASP-NSFF 429 C++ K + T+ + + ++A Y + CA+ + F+ Sbjct: 330 NWRPYNYSFNPRWCDEFKKG--EVRTVHTML--VEPDRAHYVRASTSSMRACATSADFFY 385 Query: 430 EANSTHELNKI 440 ANS E++K Sbjct: 386 SANSAAEIDKA 396 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 70/518 (13%), Positives = 141/518 (27%), Gaps = 85/518 (16%) Query: 13 KLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G+ I A ++P+ +G G +D+ R + + L+ A A + A+ S Sbjct: 14 RRLADDGGNVAITVAFAMVPLAIGTLG-AIDLARGASAKVQLQDALDAAALGAARSSANS 72 Query: 72 LEEVSSRAK-----------------NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 + + + + +SFTF + DI Sbjct: 73 PDTLQAAGERLLRQNLALGGDFELVSSSFTFGPDNKVLARAQVRVEPYVAGLAGVNNMDI 132 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 T V + +VL + + N L++ A + K Sbjct: 133 AAATEVVRAGMQLEIALVLDNTGSMNQNNKLYHLKTAAKAFVTAMETAAEGNTVPNSIKI 192 Query: 175 H--GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S +D S + + + + + + + + Sbjct: 193 SLVPFSHTVRVD-SDAYRNAAWIDQNGSSPINNEIFPTATGTQWANRFTLFSQLGTSWRG 251 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 V ++ Y + P + + + + S + N Sbjct: 252 CVESRQAPYDIQDTPPTTGATLFTPYF--APDEPDYPAEWYGTKFANSYVDDRTSSTNWR 309 Query: 293 DTVRMG-ATFF-NDRVISDPSFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDA 342 VR G T + N + + L I ++ D GST I Sbjct: 310 --VRQGNLTKYVNTKGLGTSKGPNAGCGLRPIIRLTTDFDGLRDAVDDLVADGSTNIPMG 367 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---------------------- 380 + + T+ + KK +VL+TDGENT Sbjct: 368 LVWGWHTLAPMAPFPDGVPYLTEKHKKIVVLMTDGENTILYKDTPNGSDYSGVGHARQGR 427 Query: 381 --------------------QDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYF 418 ++ + +C K ++ I I I + Sbjct: 428 VLDPAGRPITESSSQRERTAALDDRLLKLCANMKAPAKDIEIYAI----RVEVSSGSSSV 483 Query: 419 LSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L CAS + +++ + ++ F+ G + R Sbjct: 484 LQTCASSADHYYDVQNAADMTMAFQSIAGQIAALHLSR 521 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 66/181 (36%), Gaps = 21/181 (11%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 K +V++ +A I+ +K D R+G F D ++ K T Sbjct: 102 PKDPYKNKFDVVKEVVADFIKK-RKND------RIGMVTFADVAFIASPLTF--EKDFLT 152 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + TAINDA+ AY+ + S K I+LLTDG + Sbjct: 153 NITEMQKLGMAGKRTAINDALVQAYNLMSKSKAKS-----------KIIILLTDGRDNMS 201 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + + + +++ TI + + L+ A + A S L+KI+ Sbjct: 202 KIPLSDVKHMIEKRDVKLYTIGIGGPRDYDAQYLKTLAK-AGKGQAYAARSAAMLSKIYD 260 Query: 443 D 443 + Sbjct: 261 E 261 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 72/213 (33%), Gaps = 27/213 (12%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQP-ADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 I ++D S SM+D + + + + Q ++ K Y + + G + S Sbjct: 84 IVLLVDTSGSMIDRKVEGKYRINYSVDQVFIEKEWKWYRRVSLENGQKYIIHELLYTSER 143 Query: 238 ----KSLYYMLYPGPLDPSLSEEHFVDSSS---LRHVIKKKHLVRDALASVIRSIKKIDN 290 Y Y + ++ S + + K ++ A + + + N Sbjct: 144 DIIINGQIYTEYIEIDNKRYYLKYDYSDGSWYAIAYSESKIDELQKAAKNFVNKFETKSN 203 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +G + ++ S + + + I + + G+T I D ++ A + Sbjct: 204 TK----IGLVSYGNKGEVVHSLTNELDR----INSSIDYGLSVYGATNIGDGIRKANGLL 255 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 N A KYIVL+TDG T Sbjct: 256 N-----------NGSNADKYIVLMTDGIPTAAT 277 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D G ++ D+ G+T Sbjct: 829 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYL--GDLGSKWEVISVLQKDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + + + + ++++TDGE + D ++ A + +GI Sbjct: 887 TAEALGFSDHMFTEARGSRLQKGVP-----QVLIVITDGE-SHDADKLNATAKALRDKGI 940 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 941 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 980 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 19/172 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ LASV++ D + VR+GA F+ + G + I + Sbjct: 1018 MKEFLASVVQDF---DVSVNRVRIGAAQFSHTYRPEFPL--GTFVGKKEISFQIENIQQI 1072 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I A++ D R+ + +++LTDG Q +E Sbjct: 1073 FGYTHIGAALRQVGHYF---RPDMGSRINAGTP--QVLLVLTDG---QSQDEVARAAEDL 1124 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +G+ I ++ + + + ++ EL K+ + RI Sbjct: 1125 RHKGVDIYSVGI-----GDVDDQQLIQITGTAGKKLTVHNFDELTKV-KKRI 1170 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D G ++ D+ G+T Sbjct: 809 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYL--GDLGSKWEVISVLQKDQPMGGNTY 866 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + + + + ++++TDGE + D ++ A + +GI Sbjct: 867 TAEALGFSDHMFTEARGSRLQKGVP-----QVLIVITDGE-SHDADKLNATAKALRDKGI 920 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 921 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 960 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 19/172 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ LASV++ D + VR+GA F+ + G + I + Sbjct: 998 MKEFLASVVQDF---DVSVNRVRIGAAQFSHTYRPEFPL--GTFVGKKEISFQIENIQQI 1052 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I A++ D R+ + +++LTDG Q +E Sbjct: 1053 FGYTHIGAALRQVGHYF---RPDMGSRINAGTP--QVLLVLTDG---QSQDEVARAAEDL 1104 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +G+ I ++ + + + ++ EL K+ + RI Sbjct: 1105 RHKGVDIYSVGI-----GDVDDQQLIQITGTAGKKLTVHNFDELTKV-KKRI 1150 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 42/335 (12%), Positives = 100/335 (29%), Gaps = 32/335 (9%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + + I A+ + +LG G VD R + +++ A +A++ Sbjct: 8 KRIRNSISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVLM- 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 VS A + Q+I + + R F + + D+ ++ A + Sbjct: 67 ----------VSRDAAANPAMTSQQITDAVQRYFNSL----YNDKSAFNVSVSAAYTPST 112 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +A +++ S + + + F++ G + + + + V+D Sbjct: 113 SSAAAKILASGQGAIE----TDFMKIAGFP--QLSFGTSSTSTWGNSRMRVA----LVLD 162 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ---NGKVGIRDEKLSPYMVSCNKSLY 241 + SM D + + Q + + + +D + + + Sbjct: 163 NTGSMRDNGKMAALQRAAKDMIDSLSAFAKTADDVYISIIPFAKDVNVDKSNYNAAWINW 222 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV-NDTVRMGAT 300 P + E + ++ K + + R+G Sbjct: 223 TEWEAEP-PILIEENYPINVRYNGLTYDKWEDIGPGAPCPFDTRNNGSPRPTQNSRVGKF 281 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 F + S S G L + I + G Sbjct: 282 SF-ACMDRPGSLS-GATDLSSLYTNRYLIPSDPSG 314 Score = 57.3 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 64/168 (38%), Gaps = 16/168 (9%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + +++ + + +K + GST + + T+ ++N Sbjct: 440 YAEQLNGCLPATITPVSSQSSTLKNQIDSMSPSGSTNQAIGLAWGWQTLSTTNGPFPAPA 499 Query: 362 KNNL-EAKKYIVLLTDGENTQDNEEGIA-------------ICNKAKSQGIRIMTIAFSV 407 K+ + Y+VLL+DG NT++ G +C K K G I T+ V Sbjct: 500 KDKAYVYQDYLVLLSDGLNTRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTV--QV 557 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 N ++ L +CAS +F S ++ F++ + R+ + Sbjct: 558 NVGNRDPLSQVLQDCASNGNFQMITSANQTADAFQNILTQISQLRIAK 605 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 78/503 (15%), Positives = 148/503 (29%), Gaps = 86/503 (17%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + +F KK +S G ++ A LM +++ GM +D A+ QA A Sbjct: 6 MGYFKQFSDQFKKFNRSEDGVVAVLVAFLMVLLIVFAGMAIDFGLGFNTRRAVNQALDAA 65 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + L + + S I++Y N +N++ + V Sbjct: 66 VLAVANKLSTT--------ELSSNTVDSLIDQYFEENLKNSV-------GGDVVHTKPVV 110 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +P+ ++ P+ L S + + + E V + Sbjct: 111 TYDPKGDTVAATATATVKTSFLPVLKLLNSESGDFGELTVTSSSTARFPKTKVEVAVVVD 170 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY---SSQNGKVGIRDEKLSPYMVSCN 237 S S+ + S L+ + ++S S VG++ +K + Sbjct: 171 VTGSMSGSIGSLKTASRDM-LDTLLPDDNTRLQSRVRISYVPYNVGVKLDKTLARKATFE 229 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSS-----------------------SLRHVIKKKHLV 274 KS Y ++ D + S E+ L + K Sbjct: 230 KSQYGCVHARVRDLAYSGENHDYEDEDDDERVDYIGTNYSWCPNAQMVPLTNDRTKIESS 289 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +AL + G + G + Sbjct: 290 INALR------------ASSATAGQIGIAWGWYTLSPEWRGFWPTESKPDFYDNNGVRKY 337 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHR--MKNNLEAKKYIVLLTDGENTQDNEEGIA---- 388 + + AY S D H+ +KN + + + G+ D+ + IA Sbjct: 338 AVLMTDGSFN-AYYAADYSKADAEHKKLIKNKSDVQNSQDPMDSGKLDADDHKKIASKVK 396 Query: 389 -------------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSF 428 +C K + I I T+ F + K + + CAS +F Sbjct: 397 WEYDYSSSLSGVPFKTASNLCKNMKKEDIVIYTVFFGSDY----KGKKIMEECASNSETF 452 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 + A + L + F I N+I Sbjct: 453 YHATNQSALIQAF-SSIANDIKS 474 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/219 (21%), Positives = 74/219 (33%), Gaps = 27/219 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV S V K R+AL +++ ++ D R Sbjct: 243 NGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQD------R 296 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H L T K + + G T IN A+Q A + Sbjct: 297 FNIIGFSNRIKM-----WKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLN 351 Query: 352 S--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 + + D R + I+ LTDG+ T + I + K I I T+ Sbjct: 352 NYVAQNDIEDRSVS------LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIG 405 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + L NC E + F D I Sbjct: 406 DDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 444 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/219 (21%), Positives = 74/219 (33%), Gaps = 27/219 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV S V K R+AL +++ ++ D R Sbjct: 277 NGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H L T K + + G T IN A+Q A + Sbjct: 331 FNIIGFSNRIKM-----WKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLN 385 Query: 352 S--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 + + D R + I+ LTDG+ T + I + K I I T+ Sbjct: 386 NYVAQNDIEDRSVS------LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIG 439 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + L NC E + F D I Sbjct: 440 DDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/219 (21%), Positives = 74/219 (33%), Gaps = 27/219 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV S V K R+AL +++ ++ D R Sbjct: 277 NGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H L T K + + G T IN A+Q A + Sbjct: 331 FNIIGFSNRIKM-----WKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLN 385 Query: 352 S--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 + + D R + I+ LTDG+ T + I + K I I T+ Sbjct: 386 NYVAQNDIEDRSVS------LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIG 439 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + L NC E + F D I Sbjct: 440 DDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 65/194 (33%), Gaps = 41/194 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +A I+ + ++ F R ++ L + I Sbjct: 106 TRLEASKKTMADFIKK-------RNFDKISLVAFALRASVLSPSTFDYTSLEKEIGN--- 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 I +E GST+I + TA D + S +D K I+LLTDGEN + Sbjct: 156 IKIDEEGSTSIGLGIATAVDMLRSVKDDAE----------KVIILLTDGENNSGEIDPKL 205 Query: 389 ICNKAKSQGIRIMTIAFS--------VNKTQQEKARYFLSN-----------CAS--PNS 427 A + I+I TI V T + + AS Sbjct: 206 ASEIASNFNIKIYTIGIGDAAGSHAWVTYTDPNYGKRRIRADFTLNEKALIEIASITGGK 265 Query: 428 FFEANSTHELNKIF 441 +F A ++ L+ ++ Sbjct: 266 YFNAKTSSALDNVY 279 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 65/181 (35%), Gaps = 28/181 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ +R+ + D R+G F + ++ V + R I + Sbjct: 114 VSRLDAVKAVARDFVRA-RAGD------RVGLVIFAETAYFAAPLTFDVEAVGRLIDQ-- 164 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A STAI+D + A + S + +VLL+DG N + Sbjct: 165 ATIGISGRSTAISDGLGLALKRLARS-----------DARSRVVVLLSDGVNNAGAVQPR 213 Query: 388 AICNKAKSQGIRIMTIAFSV------NKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + A+ GIR+ TIA K++ L A S F +T +L + Sbjct: 214 DAGSLAERLGIRVHTIALGPADLETDPKSRDAVDTATLRAIAETSGGETFRVRTTDDLRQ 273 Query: 440 I 440 + Sbjct: 274 V 274 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 79/216 (36%), Gaps = 28/216 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 S+ S + + +K + + +++R + D F+D Sbjct: 71 KESATPLSIVMAIDGSESVVTNDRLEKEAGKKFVRALLREQDEFD---------LMDFSD 121 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V SF+ ++ + + D +TA+ DA+ A + +N R Sbjct: 122 TVREVVSFTNDKKRIENGLNELRKGD-----ATAVYDAVYLASQRLGETNAGGGRR---- 172 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQEKARYFLSN 421 + +VL+TDG+NT +A+ G+ + + + + + L Sbjct: 173 ----RVLVLITDGDNTVHGVGYDQAVEQAQRAGVMVYALIVVPIEADAGRNTGGEHALIQ 228 Query: 422 CASP--NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A+ +++ N +L K++ ++ +++ + + Sbjct: 229 MATDTGGNYYYVNDPRDLAKVY-AKVSDDLRTQYVL 263 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 68/192 (35%), Gaps = 34/192 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + +A ++ + D R+G F R + + + + Sbjct: 118 QPVDRLTAAKAVIADFLQR-RSGD------RVGLLVFGQRAYMLTPLTLDLSAVREQLRD 170 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T + TA+ DA+ A + + E ++ +VLLTDG NT + Sbjct: 171 T--VAGLAGRETALGDAIGLAVKRLRTQPE-----------GQRVLVLLTDGVNTTGVLQ 217 Query: 386 GIAICNKAKSQGIRIMTIAFS------------VNKTQQEKARYFLSNCA--SPNSFFEA 431 + A ++ +R+ TIAF V E L A + FF A Sbjct: 218 PLKAAELAAAEQVRVYTIAFGGDGGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFRA 277 Query: 432 NSTHELNKIFRD 443 + ++L I+ + Sbjct: 278 HDANQLAGIYAE 289 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 16/164 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + +V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSEAEGAR---PSEERVPRVLVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + Q L ASP + + +L + F Sbjct: 171 YAVGV------QRADVGSLRTMASPPLDQHVFLVESFDLIQEFG 208 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 56/153 (36%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 405 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 459 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E + R ++ + +V TDG + ++ Sbjct: 460 YMERGTMTGLALRHMVE--HSFSEAQGARPRDLNVPRVGLVF-TDG---RSQDDISVWAA 513 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 514 RAKEEGIVMYAVGVGKAVEEE------LREIAS 540 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 16/164 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + +V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSEAEGAR---PSEERVPRVLVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + Q L ASP + + +L + F Sbjct: 171 YAVGV------QRADVGSLRTMASPPLDQHVFLVESFDLIQEFG 208 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 56/153 (36%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 405 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 459 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E + R ++ + +V TDG + ++ Sbjct: 460 YMERGTMTGLALRHMVE--HSFSEAQGARPRDLNVPRVGLVF-TDG---RSQDDISVWAA 513 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 514 RAKEEGIVMYAVGVGKAVEEE------LREIAS 540 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 61/166 (36%), Gaps = 15/166 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D ++ D+ G+T Sbjct: 828 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYL--DDLSTKWEVISVLQKDQPMGGNTY 885 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D ++ + +GI Sbjct: 886 TAEALGFSDHMFTEARGSRLHKGVP-----QVLIVITDGE-SHDADKLNDTAKALRDKGI 939 Query: 399 RIMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ + N + L+ S + +F + L IF D Sbjct: 940 LVLAVGIAGANPVE------LLAMAGSSDKYFFVETFGGLKGIFSD 979 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 60/171 (35%), Gaps = 16/171 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+ + G + I + Sbjct: 1015 KKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPL--GTFIGKKEISFQIENIQQIF 1072 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I A++ D D R+ + +++LTDG Q +E + + Sbjct: 1073 GYTHIGAALREVGDYF---RPDMGSRINAGTP--QVLLVLTDG---QSQDEVAQAAEELR 1124 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 +GI I ++ + + + + ++ EL K+ + RI Sbjct: 1125 HKGIDIYSVGI-----GDVDDQQLIQITGTADKKLTVHNFDELTKV-KKRI 1169 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 43/184 (23%), Positives = 75/184 (40%), Gaps = 14/184 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ LA +++ +K D R+G F D F+ + + +T Sbjct: 129 ISRLEATKEVLADFVKT-RKGD------RLGLILFGDAAFVQTPFTADQSVWLELLNQTD 181 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S+ED+ +N +K ++LTDG +T E I Sbjct: 182 VA--MAGQSTHLGDAIGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDGNDTGSYVEPI 239 Query: 388 AICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 A ++ +RI IA + +Q ++ A S F+A + EL + + D Sbjct: 240 DAAKVAAAKDVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAY-D 298 Query: 444 RIGN 447 IG Sbjct: 299 EIGE 302 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 66/189 (34%), Gaps = 35/189 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDEN 332 +A +V ++ +G T F + + + L+R + A Sbjct: 111 EAAKNVAAEFISGRPNDN---IGLTIFAGESFTQCPMTTDHASLLNLLRNVRTDIAARGL 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + K++ K ++L+TDG N + + Sbjct: 168 ISDGTAVGMGLANAVSRL-----------KDSKAKSKVVILITDGSNNMGDISPMTSAQI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANST 434 A+S GIR+ TI NK E LS+ A + +F+ A + Sbjct: 217 AQSLGIRVYTIGVGTNKVAPYPMNVGGTTQYVNIPVEIDSKTLSDIAAVTEGNFYRATNN 276 Query: 435 HELNKIFRD 443 EL +I+ D Sbjct: 277 KELKQIYND 285 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 71/214 (33%), Gaps = 46/214 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++ +A ++ R+G F+ + + L + + + Sbjct: 110 RITVAKEVIAEFVK-------GRKDDRIGLVVFSGHAFTQVPLTLDYDFLQNLLGQVQTV 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A+ + + + +N+ K ++LLTDG N + + E Sbjct: 163 RRPD--GTAIGLALAHSVNGL-----------RNSTTKSKVVILLTDGSNNRGDIEPAQA 209 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARY------------------FLSNCA--SPNSFF 429 A++ +R+ TI + L + A + FF Sbjct: 210 AEIARALDVRVYTILVGKPGNGEYPVHDPWRDETYLIPAPTAEDEVALRDIAEQTGGIFF 269 Query: 430 EANSTHELNKIFRDRIGNE-----IFERVIRITK 458 A L ++ D I E+++R T+ Sbjct: 270 RAGDEQGLRDVY-DTIDKMERSQVASEKLVRYTE 302 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 62/157 (39%), Gaps = 15/157 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D +D VR+GA ++D S +I + + G T A++ Sbjct: 830 VKKADVDSDRVRVGALKYSDYPEVLFYLSGNKSAVIEHLRRRRYT----SGHTYTARALE 885 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E + + K+ ++++TDG + D + +K +++GI I + Sbjct: 886 HA-NIMF----TEEYGSRIQQNVKQMLIIITDGV-SHDRDNLSDTASKLRNKGINIYAVG 939 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 Q + + ++ F ++ L I+ Sbjct: 940 V-----GQANQLELETMAGNKSNTFHVDNFSNLKDIY 971 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 N+ ++ + + ++ F+ SS ++ + +++ I+ D+ Sbjct: 598 KNEVVHRICTEKGCEDMKADIMFLVDSSGSIGHDNFGKMKTFMKNLLAKIQIGP---DST 654 Query: 296 RMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G F+D + + + ++ I + I T A+ + Sbjct: 655 QIGVVQFSDINQEEFQLNKYFTQNETSDAIDRMSLI----NRGTLTGSALTFVGQYFTPT 710 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + KK+++L+TDGE + + +G+ I ++ Q Sbjct: 711 KGAR-------TKVKKFLILITDGEAQDPVRDP---AKALRDKGVVIFSVGVYGANRTQ- 759 Query: 414 KARYFLSNCASPNS-FFEANSTHEL 437 L + +S F+ + +L Sbjct: 760 -----LEEISGDSSLVFQVENFDDL 779 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 61/181 (33%), Gaps = 34/181 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + L I + + + TAI + T+ + + Sbjct: 136 RIGLVIFSGESFTQCPITTDHGVLKNQIAQVKSGMLQD--GTAIGMGLATSVERL----- 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + + K I+LLTDG N + + AK+ IR+ TI Sbjct: 189 ------RTSKAKSKVIILLTDGVNNTGLIDPLTALEIAKAFKIRVYTIGVGTIGKAPFPM 242 Query: 413 --------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + + + +F A S EL I+ + +++ + + I Sbjct: 243 TMPDGSIQMQMQDVQLDEPLMKKISVETGGKYFRATSNKELENIYGEI--DKLEKTKVEI 300 Query: 457 T 457 T Sbjct: 301 T 301 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 54/383 (14%), Positives = 125/383 (32%), Gaps = 72/383 (18%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G+ I+ AL + M+ G D +R ++ TA+I A Sbjct: 23 RVFKAFRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAA- 81 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 K T ++E + F ++ ++ T +++ Sbjct: 82 -------------VKEIDTDDTDALKEKVADWFHAQVENSY-----------TLGDIDID 117 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 S +++ ++ + + ++ I + + + + + SY +++ VID Sbjct: 118 TSNHKITATASGTVP----TTLMKIANIDTVDVSVASAVKGPATSY-----LNVYIVIDT 168 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML T S+ +G + + + K+ Y Y Sbjct: 169 SPSML-----------------LAATTAGQSAMYSGIGCQFACHTGDAHTVGKTKYANNY 211 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + S+ + + + + DA+ V+ I D+ + +++G D Sbjct: 212 --------------EYSAAKTIKLRADVAGDAVRDVLDMIDDSDSNHQRIKVGLYSLGDT 257 Query: 306 VIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + P+ S + +T + ++ T + + + Sbjct: 258 LTEVLTPTLSTDTARNRLADASYGLTSATSKAATYFDVSLATLKQKVGTGG-----DGTS 312 Query: 364 NLEAKKYIVLLTDGENTQDNEEG 386 + K ++LLTDG +Q Sbjct: 313 SGSPLKLVLLLTDGVQSQREWVT 335 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 59/168 (35%), Gaps = 31/168 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVK 325 K L ++A A + ++ D +G F+DR +I + + TI+ Sbjct: 422 SKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPL-NNKEEAVDTILS 474 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I ++ AY+ + H I+LLTDG++ N E Sbjct: 475 -----VTPGGGTEIYGSLAKAYENLADIKLQRKH-----------IILLTDGQSQPGNYE 518 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + K GI + T+A + LS S F+ Sbjct: 519 DLI--EQGKDNGITLSTVAIGQDAD--ANLLEALSEMGS-GRFYNVID 561 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 76/502 (15%), Positives = 137/502 (27%), Gaps = 99/502 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALST------- 69 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQVV 132 + + + +++ E R Y Sbjct: 70 --------ENNYRNDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNAS 121 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L Y L L L+ +L+ +W + A S + + + +V DFS SM Sbjct: 122 LQMNYQLAL--LNSYLKQTPSPTWDVNENGAARKYLSSIAE--PIDVVFVTDFSGSMDLP 177 Query: 193 QRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNK--------SLYYM 243 D E VK + GI P+ + S Y Sbjct: 178 FGDIERNNRITKLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCH 237 Query: 244 LYPGPLDPSLSEEHF-------------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P + + +D+ S + L D +++ I+K Sbjct: 238 FPYSPKKIDRNGHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHR 297 Query: 291 VND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 N+ R A F D+ S + I I+ + I+ M Sbjct: 298 DNEIPTKTRDQAKNFLDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDIL 357 Query: 348 DTIISSNEDEVHRM-----------------------------------KNNLEAKKYIV 372 D E + + K ++ Sbjct: 358 DPFFCLKETNAKSLNFDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMI 417 Query: 373 LLTDGENT-------QDNEEGI----------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +L+DG++ D + GI +C K K GI+++ I Sbjct: 418 ILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNI 475 Query: 416 RYFLSNCASPNSFFEANSTHEL 437 + +C +F+ A + HEL Sbjct: 476 IDWEKDCVGTGNFYLAKNAHEL 497 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 64/179 (35%), Gaps = 28/179 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +V+ + ++++ +K D + F+DR + ++ I Sbjct: 133 RMDMVKSSALNLLKQFRKQD------LISVVAFSDRAEVVIPPT----RVPDLAKDDHRI 182 Query: 330 D-ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +Q + + S + + ++LLTDG D+E I Sbjct: 183 SMLQVGGGTEIYQGLQLGIEQLRSIDP----------RFMRQLILLTDGHTYGDDEACIE 232 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + +A GI+I T+ E L A S + S +LNK F ++ Sbjct: 233 LAEEAAQDGIQINTMGI-----GHEWNDELLDKIATISGANSIFVTSPKDLNKFFEQKL 286 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 77/190 (40%), Gaps = 14/190 (7%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S I + ++ LA +++ +K D R+G F D F+ + Sbjct: 123 SGTVKKISRLEATKEVLADFVKT-RKGD------RLGLILFGDAAFVQTPFTADQSVWLE 175 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T ST + DA+ A S+ED+ +N +K ++LTDG +T Sbjct: 176 LLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDGNDTG 233 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHEL 437 E I A ++G+RI IA + +Q ++ A S F+A + EL Sbjct: 234 SYVEPIDAAKVAAAKGVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDEL 293 Query: 438 NKIFRDRIGN 447 + + D IG Sbjct: 294 EQAY-DEIGE 302 >gi|257387423|ref|YP_003177196.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257169730|gb|ACV47489.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 788 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 48/277 (17%), Positives = 81/277 (29%), Gaps = 38/277 (13%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV---GIRDE 227 Y S+ +D DY A + + G V G Sbjct: 296 YDVTTAASVPSALD------DYAAVVVQDTSASDVGNATALQEFVVNGGGLVVAGGDNAY 349 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + Y S S+ + + VD SS + A+A + Sbjct: 350 ENGGYETSPIGSMLPVQVGNATGGESNIVVLVDVSSSAESGLSI---QKAVA-----LDV 401 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 +D + D R+G FN R G+T I +Q A Sbjct: 402 LDQLGDENRVGVVAFNHNAYRVSELR--TLGQNRAETAEKIRQLESGGATDIAVGLQGA- 458 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D ++ E I+LL+DG++ A+ N+ +G R++++ Sbjct: 459 DELLGDREGT-------------IILLSDGQDRLGP--PAAVANQLGREGTRVVSVGVGK 503 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + S S+F A+ T L +F Sbjct: 504 RVGVPTMRQIAGE---SGGSYFAADETERLRLLFGGS 537 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 56/152 (36%), Gaps = 17/152 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + F+ + + + ++ TA+ DA+ A ++ ++E Sbjct: 140 RLGLIVFGNAAYLQAPFTDDIATWLTLLEESEVAMAGPS--TALGDAIGLAI-SLFQASE 196 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +V+LTDG +T + + A + + I T+A T E+A Sbjct: 197 TR----------NRVLVVLTDGNDTGSRVPPVDAASIAAANDVTIYTVAVGDPSTIGEEA 246 Query: 416 RYF--LSNCASP--NSFFEANSTHELNKIFRD 443 L AS + F A T L + Sbjct: 247 LDLETLDAIASSTRGASFLALDTLALKDAYEQ 278 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 57/172 (33%), Gaps = 39/172 (22%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R ++ D G+T I + + + K +V+LTDGEN Sbjct: 332 QRNVIDDAIEDMIASGTTNIPIGISWGVRVLSPGMPFTEGVSYDEEGTIKAMVVLTDGEN 391 Query: 380 -----------------------------------TQDNEEGIAICNKAKSQGIRIMTIA 404 N+ A C AKS GIR+ TI Sbjct: 392 YLDGRNNPNYSHYSGYGYMRDGRLGIQTSSDSTIRNALNDRTEAACEYAKSLGIRVYTIT 451 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 F VN + R + +CA+ P +F++ S L F G+ R+ R Sbjct: 452 FQVNSSS---TRDMMRDCATHPTLYFDSPSDDALRSAFEMIAGDLTNLRLSR 500 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 63/199 (31%), Gaps = 29/199 (14%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P + S V S +K + + A + + + D+ +G F+ Sbjct: 404 APDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIATAEVLTRNDS------IGVYAFDS 457 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + V G T ++ A A + + + H Sbjct: 458 EAHVVVPMTRLTS---SSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAKIKH----- 509 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++LTDG+ + E A+ ++ +++G+ I T+A L AS Sbjct: 510 ------MIILTDGQTSGQGYE--ALASQCRAEGVTISTVAI-----GDGAHVGLLQAIAS 556 Query: 425 --PNSFFEANSTHELNKIF 441 + + +IF Sbjct: 557 LGGGKSYTTLDAANIVRIF 575 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 103/345 (29%), Gaps = 66/345 (19%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK- 213 ++ + ++ + + + I V+D S SM + + T Sbjct: 87 TYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYV 146 Query: 214 ----SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD-SSSLRHVI 268 SY + + Y + Y PS S V S + Sbjct: 147 EKDGSYQAVRCSAISWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFYKSKQSEE 206 Query: 269 KKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFN-------------------DR 305 + ++DA+ + + ++I++ V++ + + Sbjct: 207 TRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNY 266 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S +W L + + G+T + +Q A + S Sbjct: 267 SQTVHSLAWTPEDLQKE--QAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA-------- 316 Query: 366 EAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKAR 416 +K V +DG T + I + K+ ++++I + + + A Sbjct: 317 --QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNAN 374 Query: 417 YFLSNCAS---------------PNSFFEANSTH-ELNKIFRDRI 445 F++ +S +++ A S +L IF++ I Sbjct: 375 KFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 419 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 78/525 (14%), Positives = 159/525 (30%), Gaps = 92/525 (17%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + K+ ++ G + +I +L ++G+ + VD + Q + A + Sbjct: 6 YFIKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLA--- 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF--TDREVRDIVRDTAVEMNP 124 + ++ + ++ K++ + +E L + + + V +VR Sbjct: 63 LMAENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTY 122 Query: 125 RKSAYQVVLSSRYDLLLNPLSL----------------FLRSMGIKSWLIQTKAEAETVS 168 K ++ S +YD N L F R + + AET + Sbjct: 123 FKDNLKI--SDKYDYQCNNLQGPNGEQLKSIACEISGKFERPSWLYLGKNNGLSFAETTT 180 Query: 169 RSYHKE---------HGVSIQWVIDFSRSML-------------DYQRDSEGQPLNCFGQ 206 + +K + + V D S SM D R+ + Sbjct: 181 INANKIYIQKNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLE 240 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL------------- 253 + SQ ++G + Y P Sbjct: 241 QNNTEEGKVISQYNRIGFTSFAFGAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTM 300 Query: 254 ---SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN------- 303 +V S+ R+ DA ++I+ I++ + +F+ Sbjct: 301 QYSELLSYVGSNQQRYSYATLAQYFDAFVDYDKTIESINSFDGKDLSSLMYFSKNSWCLG 360 Query: 304 -DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE-VHRM 361 + ++ W +V F +G+T + + + ++++N DE Sbjct: 361 SANTRINSTYIWAGKNESADLVSRF-NRVPALGATLSSSGLLIGANLLMNTNPDENAQPS 419 Query: 362 KNNLEAKKYIVLLTDGE---NTQDNEEGIAI-------CNKAKSQ----GIRIM-----T 402 K ++ I++L+DGE N + I C K KS+ + Sbjct: 420 KLGANTQRIILVLSDGEDQINNASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTR 479 Query: 403 IAFSVNKTQQEKARYFL-SNCASPNSFFEANSTHELNKIFRDRIG 446 I F + C ++ AN+ EL + FR IG Sbjct: 480 IGFVAFGYGPSGTQKAAWEKCVG-KYYYVANNKEELLESFRKIIG 523 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 71/414 (17%), Positives = 134/414 (32%), Gaps = 49/414 (11%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + +I + + + + E+ ++ E+ NFT + D+ + + Sbjct: 52 AEFSVKSTIISRYAFTTVSCRMLNRASEDQDAEFQMQIPESAFITNFTMEKGTDMFKASL 111 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V + K+A+ + +LL L + + ++ + + E + + Sbjct: 112 VIPSKDKAAFFLSYE---ELLQRRLGKYEHIISVRPQQLVGRLTVEVNILERSGITSLEV 168 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--------YSSQNGKVGIRDEKLSP 231 + SR + + + P + T ++ + GI + + Sbjct: 169 -LPLHNSRKKGSGKAEGDEGPPPSTIINHNETFAKVVFKPTVVQQAKIAQNGILGDFIIQ 227 Query: 232 YMVS----------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 Y V + + P L P FV S V K R+AL ++ Sbjct: 228 YDVKREQNIGDIQVLDGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTI 287 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGS 336 + ++ D R F++R+ W H L T K + + G Sbjct: 288 LNDLRPQD------RFNIIGFSNRIKM-----WKDHLLPVTPDNIRNGKIYMYHLSPTGG 336 Query: 337 TAINDAMQTAYDTIIS--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T IN A+QTA + + + D R + I+ LTDG+ T + I + K Sbjct: 337 TDINGALQTAIKLLNNYVAQNDIEDRSVS------LIIFLTDGKPTFGETNTLRILSNTK 390 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I I T+ + + + L NC E F D I Sbjct: 391 EATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 444 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 54/165 (32%), Gaps = 20/165 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + D R+G ++ V ++ S ++ I K E Sbjct: 46 ELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSCVRTEFPLS--MYHSKDEIKKAVMNVE 100 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + +V TDG + ++ Sbjct: 101 YMEKGTMTGLALKHMVENSFSEAEGARPAEKN--IPRVGLVF-TDG---RSQDDIQEWAK 154 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 KAK GI + + + L AS FF + Sbjct: 155 KAKEAGITMYAVGVGKAVEDE------LREIASDPVEKHFFYSAD 193 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 63/428 (14%), Positives = 142/428 (33%), Gaps = 52/428 (12%) Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + L ++E+ + ++ R + + E ++ Sbjct: 49 LQKKAKLSDAIEQATLALTVENDGIPNAAQQTKNRELVLSYANAYLPSEGFS-DPIINID 107 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-SRSYHKEHGVSIQ 180 N Y ++ Y + S S+ IQT E + +++ + Sbjct: 108 DNTNYLGYNAAVTMTYPVEFLGRSPLTNSIS----NIQTTDNGEAIKNKTIEVSEPTDVV 163 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADR-------TVKSYSSQNGKVGIRDEKLSPYM 233 +V D+S SML D T+K+ S+ N I + Sbjct: 164 FVADYSGSMLLSFSDDVSIKNGERINALRSAFRILHNTIKNNSNVNTIGFIPFGSGTKRK 223 Query: 234 VSCNKSL--YYMLYPGPLDPSLSEEHFV-DSSSLRHVIKKKHLV------RDALASVIRS 284 VS N Y L P + ++ ++ + ++ ++ + + S+ + Sbjct: 224 VSENGENKEYCHLPFSPKIYKPNGDYLSENAEATKNAWTFLDVIGDHIDYKKTIMSITEN 283 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ ID ++ + + + S+ + + I+ +G T I+ + Sbjct: 284 VQPIDIPMRDIK------HKEICLSGTNSYSLEREQFDYSIENIIEMAPLGGTLISSGIL 337 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-------------ICN 391 +A + I D H KK +++L+DG ++ ++ +C Sbjct: 338 SA-NNIFKETADNGH--------KKLMIILSDGMDSYNSTMLPNKGFFISKTLIDEGMCE 388 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFL--SNCASPNSFFEANSTHELNKIFRDRIGNEI 449 GI++ IA + + A ++ C ++++EA++ HEL + + Sbjct: 389 MIIKNGIQMAFIAIAYSPENNVNAPEYINWKQCVGEDNYYEAHNAHELELELQQAVSVSA 448 Query: 450 FERVIRIT 457 V R T Sbjct: 449 TSEVGRNT 456 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 18/193 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S P S S++ + + + A + + +++ ++ + +G Sbjct: 253 SPLSARVPREAPEVTMVFVLDRSGSMQQAVGDSNRLGVAKNATLSALELLNPQSQ---IG 309 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + S + + +D G TAI + AY + S Sbjct: 310 VIVFDTEETTVVPLS--TLDIPAAQIALDRVDT--GGGTAIYPGLVAAYRELQRS----- 360 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K+I+++TDG + + EGI A + + I + E Sbjct: 361 ------ESPAKHIIVMTDGLSQPGDWEGILRQITADGTTVSAVAIGVGADTGAAENIARL 414 Query: 419 LSNCASPNSFFEA 431 + A + FEA Sbjct: 415 GNGVAHISRDFEA 427 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 69/402 (17%), Positives = 136/402 (33%), Gaps = 82/402 (20%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ K G I + ++M ++L GGM VD++R+ L+ + A + A+ Sbjct: 19 FQRFRKDEDGALIIFSLMMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 73 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 L++V + A + + + ++ + + Y+ Sbjct: 74 DLDQVMAPADVVRDYMDKAGMLHFLQ----------------------GDPIVDQGINYR 111 Query: 131 VVLSSRYDLLLNPLSLFLRSM-GIKSWLIQTKAEAETVSRSYHKEHGVS---IQWVIDFS 186 +V ++ P+ LF + + S + TVS S E VS I V+D S Sbjct: 112 IVTANASA----PMPLFFYDLPKVFSSPFTPGMSSLTVSGSSTAEERVSDVEISLVLDVS 167 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM R + +P + N G+ + PY ++ P Sbjct: 168 SSMNSNNRMTNLRPAAREFVTTVLANNT----NAPQGLITISMIPYS--------AVVNP 215 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 G + S + AL D+V+ G ++ + Sbjct: 216 GTDIAPHLNINRTHEYSTCPMFDDTEFTTTALNLG----ASYDHVSHFSYGG----SNDM 267 Query: 307 ISDPSFSWGVHKLIRTI---------VKTFAIDENEMGSTAINDAMQTAY---------- 347 +P+++W + I + T + + G+TAI+ ++ Sbjct: 268 PINPNYTWCFAGDLNAIKPHTTNEADLHTAINNLHAYGNTAIDMGVKWGVALLDSSTQSL 327 Query: 348 -DTIISSNEDEVHRMKNNLE-------AKKYIVLLTDGENTQ 381 ++ ++ V + N K +VL+TDG+NTQ Sbjct: 328 ISSLAGASGTGVPAIANGRPELHTQADVLKVLVLMTDGQNTQ 369 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 55/446 (12%), Positives = 121/446 (27%), Gaps = 55/446 (12%) Query: 37 GGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIR 96 +++DV + + A + L + P + Sbjct: 160 ISLVLDVSSSMNSNNRMTNLRPAAREFVTTVLANNTNAP-QGLITISMIPYSAVVNP-GT 217 Query: 97 NFENNLKKNFT-DREVRDIVRDTAVEMNPR--KSAYQVVLSSRY----DLLLNPLSL--F 147 + +L N T + + DT ++Y V Y D+ +NP F Sbjct: 218 DIAPHLNINRTHEYSTCPMFDDTEFTTTALNLGASYDHVSHFSYGGSNDMPINPNYTWCF 277 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + +A+ T + H +I + + ++LD S L Sbjct: 278 AGDLNAIKPHTTNEADLHTAINNLHAYGNTAIDMGVKWGVALLDSSTQSLISSLAGASGT 337 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + + L N + ++ P S D + Sbjct: 338 GVPAIANGRPELHTQADVLKVLVLMTDGQNTQQWDLVEPYKSGMSPVWFDLDDVNQPLWD 397 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTV--RMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + ++ + + G + D G + + + Sbjct: 398 V---DFNKTSVQYQGEATNSRWDDWFYWNGYSGTLRYRDY-------PNGFNSRLAYVNA 447 Query: 326 TFAIDENEMGST----AINDAMQTAYDTIISSNEDEVHRMK------------------N 363 + T ++ ++ + + + Sbjct: 448 SPVDASGPGEGTRYVDNGDELYHASWQQLFADRSYFLINNHYFLDAYYAGAWSWNEYWGT 507 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + IV ++ N IC A++QGI I T+AF + L +CA Sbjct: 508 DNSIDHLIV-----NGSEANTRLSNICAAARAQGIVIYTVAFEA----PSGGQTALQDCA 558 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 S +S + +++ F I ++I Sbjct: 559 SSSSHYFDVDGTDISGAF-SAIASDI 583 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 65/434 (14%), Positives = 124/434 (28%), Gaps = 71/434 (16%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ S G I+T L+ ++ V G VD++R+ L+ A A++ A+ Sbjct: 22 HLRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAADLDQ 81 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM------- 122 + V S I + + + E L + +D + A Sbjct: 82 ELCPRVVVNDYISKEGFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSDSDPSDT 141 Query: 123 -------------------NPRKSAYQVVLSSR----YDLLLNPLSLFLRSMGIKSWLIQ 159 + V L + LN + F++ G+ + Sbjct: 142 ASSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKWSGVDTINST 201 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + AE + V I V+D S SM + + + F + S Sbjct: 202 AVSAAEESIGN------VEISLVLDVSGSMEGAKLTNLQKAAKDFVKEMLEKSADDSLSI 255 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + ++ P + +K F I + Sbjct: 256 SIIPYSEQVGVPDYM-MDKINTTGGNKVANCIEFQPADFTAIPFTAFSIGAPSEATNPPP 314 Query: 280 SVIRSIKKIDNVNDTVRMG---------ATFFNDRVISDPSFSWGVHKLIRTIV------ 324 SV +S+ + ND R G ++ R +V Sbjct: 315 SVPQSLHFTNRSNDFRRGGNRDHRSTNDVVSRFSPWDANFPCREDTPTDRREMVVIQNDL 374 Query: 325 ---KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY----------- 370 + GST+IN ++ + S + + + N+ K Sbjct: 375 DTLNKQINNLVAAGSTSINIGLKWGLALLDESIQPLIKTVANDTNVPKIFEDRPRPTNTT 434 Query: 371 -----IVLLTDGEN 379 +VL+TDG+N Sbjct: 435 DTLKVVVLMTDGKN 448 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 45/80 (56%), Gaps = 9/80 (11%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELN 438 ++ NE+ +++C KA+ + + I +IAF + + L +CA P ++EA T ++ Sbjct: 597 SKKNEQVVSLCGKAEEKEVLIFSIAFEAPSS----VKQMLKDCAVKPARYYEATGT-QIE 651 Query: 439 KIFRDRIGNEIFERVIRITK 458 ++F D I I + +R+T+ Sbjct: 652 RVF-DSISTSI--QNLRLTQ 668 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 55/164 (33%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + ++V+S+ + + I + T + A Sbjct: 203 IQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETNTALGINVARSEAFKQG 261 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 AKK ++++TDGE + D+ + + +++ GI IA + Sbjct: 262 --------GRRGAKKVMIVITDGE-SHDSADLQQVIEESEKDGITRYAIAVLGYYNRRGI 312 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 313 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 355 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 55/164 (33%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + ++V+S+ + + I + T + A Sbjct: 181 IQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETNTALGINVARSEAFKQG 239 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 AKK ++++TDGE + D+ + + +++ GI IA + Sbjct: 240 --------GRRGAKKVMIVITDGE-SHDSADLQQVIEESEKDGITRYAIAVLGYYNRRGI 290 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 291 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 333 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 50/132 (37%), Gaps = 14/132 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++V+ + R+G F+ + L+ + + E+G T Sbjct: 120 ASAVLEDFVNKRPND---RIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELG-T 175 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 AI DA A + S + + + I+LLTDG+N Q E + A + G Sbjct: 176 AIGDATAAATKRLQMSKDSK----------SRIIILLTDGDNNQGEIEPVPAAQLAAAIG 225 Query: 398 IRIMTIAFSVNK 409 +I TI + + Sbjct: 226 AKIYTIGLGIEE 237 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 60/467 (12%), Positives = 129/467 (27%), Gaps = 84/467 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++S G+F ++ L+M ++ V G+ VD + + ++ A+ +I+ Sbjct: 8 QYLRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDIIEEG 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ ++ + ++ +E++ V Sbjct: 68 KTLAEAELSARKYFDAELA----------------------FSSGVGLEVSTFTLTQGVD 105 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + L+++G + + A A V I D + SM Sbjct: 106 GIVKLGVSGKTSTSLLKAVGREEIPVSVDAAAHVGGG------SVEIAIAFDVTNSMGFG 159 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 E + + + +G + + + V+ +L PG Sbjct: 160 TTWGEAT------SVIASALNALKANSGSMALTFIPFTD-RVNVGMGRANLLNPGDQTAV 212 Query: 253 LSEEHFVDSSSLRHVIKKK----HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 K K + + D+ K DN S Sbjct: 213 KKGGWGGCVDVRATKKKNKGETEYFMPDSAPEKGDRFTKFDNGTPAAHKSGYKLACNPQS 272 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND-AMQTAYDTII-------------SSN 354 + V + + K G T D + + + Sbjct: 273 IIGPTSNVSDVTSQLGKLT------KGGTGRFDLGFAWLWYALSPNWKGFWSGGAPADNG 326 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE----------------NTQDNE---EGIAICNKAKS 395 + + +K VL TDG NT + +AIC + Sbjct: 327 VNLADYPTASTNTRKIAVLATDGLTNAYVYEYGKTNLAGWNTGSKDHFENVVAICKSMAA 386 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKI 440 Q I + + + +KA + CAS +++ S L Sbjct: 387 QKIEVHVM----HVNGNDKAEPYFRECASATGGGYYKVASKQTLVDA 429 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 103/345 (29%), Gaps = 66/345 (19%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK- 213 ++ + ++ + + + I V+D S SM + + T Sbjct: 113 TYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYV 172 Query: 214 ----SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD-SSSLRHVI 268 SY + + Y + Y PS S V S + Sbjct: 173 EKDGSYQAVRCSAISWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFYKSKQSEE 232 Query: 269 KKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFN-------------------DR 305 + ++DA+ + + ++I++ V++ + + Sbjct: 233 TRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNY 292 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S +W L + + G+T + +Q A + S Sbjct: 293 SQTVHSLAWTPEDLQKE--QAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA-------- 342 Query: 366 EAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKAR 416 +K V +DG T + I + K+ ++++I + + + A Sbjct: 343 --QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNAN 400 Query: 417 YFLSNCAS---------------PNSFFEANSTH-ELNKIFRDRI 445 F++ +S +++ A S +L IF++ I Sbjct: 401 KFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 445 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 51/353 (14%), Positives = 101/353 (28%), Gaps = 40/353 (11%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 R L KN R + E + +A+ + P L + S Sbjct: 251 RYAVKGLAKNPGLHRYRAEIFMDGDEASANNAAFDFT-----RVEGPPNVLIVEGTPGTS 305 Query: 156 WLIQTKAEAETVSRSYHKEHGVSI----QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 I E+ + +S+ V D D G+ + Q Sbjct: 306 GNITAALESGMIGTEVIPPELLSLEAAKYAVYDSIIFNNVSGSDVGGKQMELIEQAVRSF 365 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + G+ + + L PSL +D S K Sbjct: 366 GIGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSG-SMDGNKI 424 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVKTFA 328 L +++ + ++ D V G F+D+ V+ + V + Sbjct: 425 ELAKESAMRTVELMRAKDTV------GVVAFDDQPWWVVPPQKLG------DKEEVLSSI 472 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A+ +A + ++ + H I+L+TDG+ + N Sbjct: 473 QSIPSAGGTNIYPAVSSALEEMLKIDAQRRH-----------IILMTDGQ-SAMNSGYQD 520 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I + ++A ++ + A+ ++ L +F Sbjct: 521 LTDTMVENKITMSSVAVGMDA-DTNLLQSLAD--AAKGRYYFVEDETTLPAVF 570 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 59/168 (35%), Gaps = 31/168 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVK 325 K L ++A A + ++ D +G F+DR +I S + Sbjct: 422 SKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPLS------SKEEAV 469 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T I ++ AY+ + H I+LLTDG++ N E Sbjct: 470 DTILSVTPGGGTEIYSSLAKAYENLADLKLQRKH-----------IILLTDGQSQAGNYE 518 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + K GI + T+A + LS+ S F++ Sbjct: 519 DLIT--EGKEDGITLSTVAIGQDAD--ANLLEALSDMGS-GRFYDVID 561 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 57/164 (34%), Gaps = 16/164 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + +D + R+G ++ +V S G + + T Sbjct: 72 LVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTMTG 129 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + +V++TDG + + + +A+++GI I Sbjct: 130 LAIQYAMNVAFSEAEGAR---PSEERVPRVLVIVTDG---RPQDRVAEVAAQARARGIEI 183 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + Q L ASP + + +L + F Sbjct: 184 YAVGV------QRADVGSLRAMASPPLDQHVFLVESFDLIQEFG 221 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 63/180 (35%), Gaps = 23/180 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 418 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 472 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E + R ++ + +V TDG + ++ Sbjct: 473 YMERGTMTGLALRHMVE--HSFSEVQGARPRDLNVPRVGLVF-TDG---RSQDDISVWAA 526 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA------NSTHELNKIFRDRI 445 +AK +GI + + ++ L AS S N+ L + + I Sbjct: 527 RAKEEGIVMYAVGVGKAVEEE------LREIASEPSELHVSYSPDFNTMTHLLENLKGSI 580 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 57/164 (34%), Gaps = 16/164 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + +D + R+G ++ +V S G + + T Sbjct: 59 LVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + +V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSEAEGAR---PSEERVPRVLVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + Q L ASP + + +L + F Sbjct: 171 YAVGV------QRADVGSLRAMASPPLDQHVFLVESFDLIQEFG 208 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 63/180 (35%), Gaps = 23/180 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 405 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 459 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E + R ++ + +V TDG + ++ Sbjct: 460 YMERGTMTGLALRHMVE--HSFSEVQGARPRDLNVPRVGLVF-TDG---RSQDDISVWAA 513 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA------NSTHELNKIFRDRI 445 +AK +GI + + ++ L AS S N+ L + + I Sbjct: 514 RAKEEGIVMYAVGVGKAVEEE------LREIASEPSELHVSYSPDFNTMTHLLENLKGSI 567 >gi|149050644|gb|EDM02817.1| similar to vitrin (predicted) [Rattus norvegicus] Length = 427 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 121/394 (30%), Gaps = 47/394 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 67 DVVQALDIGPGGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 126 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + I+ + Sbjct: 127 FVTKNFFSKANGNRGGAPNVAV--------------VLVDGWPTDKIEEVSR-------V 165 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ +V + + Q E + + + + L Sbjct: 166 ARESGINVFFVTVEGAAEREKQHVVEPNFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 225 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 226 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 280 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + ++ G T+ A+Q A + + Sbjct: 281 ---TRIGAVQYT--YEQRLEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAAIQYALEQLF 335 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++L+TDG + ++ A +G+ I + Sbjct: 336 KKSKPNK---------RKVMILITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 381 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q++ ++ A +SFF + L K F RI Sbjct: 382 QDELEVIATHPARDHSFF-VDEFDNLYK-FVPRI 413 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 84/220 (38%), Gaps = 32/220 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDT--VRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K ++++++A ++ + + R F++ + +++ ++ + + Sbjct: 747 TKSEVLQNSIARFAATLGYLSPGSQIAMTRFSVDTFSNAECALLNWTNDTGEVTAAMNQE 806 Query: 327 FAIDENEMGSTAIN-DAMQ----------TAYDTIISSNEDEVHRMKNNLEA-------K 368 + E G D ++ Y I S E+ + Sbjct: 807 YGNPLAEGGRANQTLDGLRVYNYGITGSTHTYRGIESYIENMTNGASGGYVPNAPQGNNS 866 Query: 369 KYIVLLTDGENTQDN-EEGIAICNKAKSQGIRIMTI---AFSVNKTQQEKARYFLSNCAS 424 +Y+++ TDG++ N ++ + + K+ G I+T+ + + E + FL AS Sbjct: 867 RYLIIFTDGKDNSGNLQKSMDDTDALKNNGYTIITVLMQSAGMTSEDVEHSTTFLKRLAS 926 Query: 425 PN-----SFFEA--NSTHELNKIFRDRIGNEIFERVIRIT 457 N F+ A N L K+F+D I +EI + + T Sbjct: 927 SNASGEKYFYTAMYNDPEGLVKVFQD-IAHEIAKPLQGYT 965 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 39/171 (22%), Positives = 66/171 (38%), Gaps = 20/171 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ +V+R + N R+G F F+ H+ ++ I Sbjct: 110 TAVKTVLRQLA---NERAGDRLGLIVFGSSAYLQSPFTED-HRTWLLLLNETRIRM-AGP 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 STA+ DA+ A + K+ + ++LLTDG +T + A + Sbjct: 165 STALGDAVGLA-----------IKLFKDAETEHRVLLLLTDGNDTGSLVPPVDAARVAAT 213 Query: 396 QGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 + IRI IA E+A L+ A + FEA S+ +L +F+ Sbjct: 214 EDIRIYPIAVGDPTAVGEEAIDLDTLARMAEVTGGQAFEALSSEDLIAVFK 264 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 81/220 (36%), Gaps = 14/220 (6%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + Y + +D SLS ++ +D + ++ + +D + + I D + + Sbjct: 1471 TKYPVDLVFVIDKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEI-IPDLCSQQYDIQ 1529 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + S W + + ++ I ST + A+ A D + + + Sbjct: 1530 IAGYQFSGSSTRVLDW--SREEQQVLSGLKIARTSS-STEPSQALADALDMLKTGSP--A 1584 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 HR ++N+ KKY++ +TDGE T+ + G I TI S + + Sbjct: 1585 HRNQSNV--KKYLIFMTDGEPTEPEDWSYNAVRNHAVPGASIYTIGVSSDASTN-LMEGI 1641 Query: 419 LSNCASPNSF----FEANSTHELNKIFRDRIGNEIFERVI 454 S S + F+ S + F +I +EI Sbjct: 1642 RSTALSNGMYAPATFKGTSAQLIRDAFT-QIKDEIISTST 1680 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 60/205 (29%), Gaps = 29/205 (14%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--RVISDPS 311 S + ++ +V AL + D G F+D ++ Sbjct: 1063 KSGSMDQSFGSGNSDARREVVNSALELFFNQLSDGDYNIQF---GGYKFSDSGERVNFND 1119 Query: 312 FSWGVH----KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 W + GST + +++A + + ++ Sbjct: 1120 QGWETEYWETDTSNALSHLKLTSRETDGSTYPSQTLRSAISALEN--------VELGENG 1171 Query: 368 KKYIVLLTDGENTQDNEEGIAI----CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 K+Y++ LTDGE Q++ C A G I N +S Sbjct: 1172 KRYLIFLTDGEPGQNSYSFSEKEAENCYSAIKNLDSGTTFYAIQV-ANSDSHGFMESMVS 1230 Query: 421 NCASPNSF----FEANSTHELNKIF 441 N S + F NS ELN F Sbjct: 1231 NANSVDGVTAQKFVGNSADELNAAF 1255 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 44/264 (16%), Positives = 84/264 (31%), Gaps = 24/264 (9%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + M G A K G G + K SP L + Sbjct: 754 RTGRMGAKGMPGTPGSPGADGNNAPDGAKGADGDRGFPGPQGPKGSP----GAPGLPGVP 809 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + F+ S + L R + ++ ++ DN VR+G ++ Sbjct: 810 GGAVCVGREIDVFFLLDGSGSIDGRDFELQRSFVRDLVSNLMSGDND---VRVGVAEYSS 866 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + I + + G+TA ++ A D I S+ Sbjct: 867 ---TYTQIVFPFSSSQSAIDSSLSSMIQTAGATATGTSLGEAADDIGSTARSSA------ 917 Query: 365 LEAKKYIVLLTDGENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + ++L+TDGE + +E+ I + ++ G+ I I + ++ E L Sbjct: 918 ---ARVLILMTDGETSDGDEQNIDPSVDALRALGVSITAIGVGNSASESE----LLQIAG 970 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 S + F + +L+ IG Sbjct: 971 SSDHVFNNIAFVDLSSFINQIIGQ 994 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 103/345 (29%), Gaps = 66/345 (19%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK- 213 ++ + ++ + + + I V+D S SM + + T Sbjct: 47 TYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYV 106 Query: 214 ----SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD-SSSLRHVI 268 SY + + Y + Y PS S V S + Sbjct: 107 EKDGSYQAVRCSAISWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFYKSKQSEE 166 Query: 269 KKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFN-------------------DR 305 + ++DA+ + + ++I++ V++ + + Sbjct: 167 TRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNY 226 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S +W L + + G+T + +Q A + S Sbjct: 227 SQTVHSLAWTPEDLQKE--QAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA-------- 276 Query: 366 EAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKAR 416 +K V +DG T + I + K+ ++++I + + + A Sbjct: 277 --QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNAN 334 Query: 417 YFLSNCAS---------------PNSFFEANSTH-ELNKIFRDRI 445 F++ +S +++ A S +L IF++ I Sbjct: 335 KFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 379 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 65/175 (37%), Gaps = 33/175 (18%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A+ + I S+ D + F D FS +L + F Sbjct: 265 KLAIDNFIDSMTLTD------KTALISFEDEAKLVSEFSDNKEELKGLVNPYF------G 312 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKA 393 G T + +++ A + + + ++I+LL+DG+ N N + + Sbjct: 313 GGTNVRASVEMAIEQLNTVQ----------HWYTRHIILLSDGDVNININLANNTVDDLI 362 Query: 394 KS---QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 K I+I TI Q+ L +CA + +F A + +L+ I++D Sbjct: 363 KKAVDNNIKIHTIGLGSGADNQK-----LKDCAEYTGGQYFTAETAEKLDAIYKD 412 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 T NE IC+ A++QGI + ++AF + + L CAS + A ++ Sbjct: 495 TVKNERTRQICDAARAQGITVYSVAFEA----EAGGQALLQYCASTTGHYYATVGPQIRT 550 Query: 440 IFRDRIGNEIFERVIRITK 458 +F I + I + +R+T+ Sbjct: 551 VFHS-IASHITQ--LRLTQ 566 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 117/401 (29%), Gaps = 82/401 (20%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 11 YALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS-- 68 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 P + + +Y+ + D +V + + Sbjct: 69 -----------LTQSRSPAEVVRDYVAK-------AGLEDYLDEPVVNANTLNVRS---- 106 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 V ++ Y + ++F++ + I + AE + V I V+D S S Sbjct: 107 --VTATAAYSMP----TVFMKLLDIDRLEAPAVSTAEERVSN------VEISLVLDMSNS 154 Query: 189 MLDYQR-----------------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 M+ D N A S G+V + L+ Sbjct: 155 MVTDGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLAT 214 Query: 232 YMVSCNKSLYYML--------------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 Y ++ Y PL S + E F SSS + A Sbjct: 215 YPNVSHRQPYSSCVEFAASDFTTTALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETA 274 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + D + D WGV L + + A + Sbjct: 275 AGNPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVN 334 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 AY + K +VL+TDG+ Sbjct: 335 GAFAGRPLAYQS---------------GNVMKVVVLMTDGQ 360 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 50/202 (24%), Positives = 77/202 (38%), Gaps = 17/202 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L+ ++ +K D R+G F D Sbjct: 113 DLSGSMAEQDFTSKTGEKISRLDAAKEVLSDFAKT-RKGD------RLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A I + E R +K Sbjct: 166 TPFTPDQKVWLELLNQTDVA--MAGQSTHLGDAIGLA---IKVFEQSEKSRTDVEESKEK 220 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++ +RI IA +T E A + A S Sbjct: 221 VAIVLTDGNDTGSFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNTIKRIAKESG 280 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL K + D IG Sbjct: 281 GEAFEALNRDELAKAY-DEIGK 301 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 60/194 (30%), Gaps = 40/194 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ ++ I+ + R+G + + + ++ + + Sbjct: 110 RLVALKEVASNFIKD----RPND---RIGLVVYAGESYTKTPITSDKRLVLEALKEIKYG 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI + T+ + + K++ K I+LLTDG N E Sbjct: 163 ALTD--GTAIGMGLATSVNRL-----------KDSKALSKVIILLTDGVNNAGFIEPATA 209 Query: 390 CNKAKSQGIRIMTIAFSVN-------KTQQEKA-----------RYFLSNCA--SPNSFF 429 A I+ TI N + + L A + +F Sbjct: 210 AELAVEYDIKTYTIGLGTNGNALSPIAYNPDGSFRYGMAEVQIDEALLEQIATLTGGKYF 269 Query: 430 EANSTHELNKIFRD 443 A +L I+ + Sbjct: 270 RATDNKKLEAIYNE 283 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 50/393 (12%), Positives = 116/393 (29%), Gaps = 75/393 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G + + +M+ + G+ +D++R+ L+ + A++ A+ L+ Sbjct: 24 FCRDEAGVLTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAA-----DLD 78 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + ++ + F + D V+++ + +V Sbjct: 79 QTTNAKAVVEDY------------FAKAGMSQYLD----------GVQVSKGLNFKEVEA 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + F+ GI++ +++AE ++ + + V+D S SM + Sbjct: 117 QVSATIP----TWFMNMSGIETLDAFARSKAEERIQN------IEVSLVLDISGSMG-WD 165 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + G+ + PY + ++ Sbjct: 166 GKLANMRTAADQFVRTMMAGNDNVAADGTGLTSVSIIPY--------HAVVNVPDELLDE 217 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + S + I K +D + R + Sbjct: 218 YAVSTQQTVSNCVRFTATDFQSIS----IDRTKTLDRLAHFDRNNSNLHTFNGDRLIGRP 273 Query: 314 W---GVHKLIRTIVKTF------AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 W G + I + + G+TA + M+ A + ++ V M + Sbjct: 274 WCQVGTYGAILPWSTSVTDLTNKVAELGASGNTATDIGMKWAAALLDPGTQNIVDDMIDG 333 Query: 365 LEAK----------------KYIVLLTDGENTQ 381 + K +VL+TDGENT Sbjct: 334 GHLEADLAGRPVLYSDPETIKVVVLMTDGENTS 366 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 5/78 (6%) Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 N+ + IC++ K+Q + I TI F ++ + + CAS + + E++ Sbjct: 481 NSTADGYTEQICDQLKAQDVVIFTIGFEA----PQRGQDLMRYCASSSGHYFDVEGVEIS 536 Query: 439 KIFRDRIGNEIFERVIRI 456 + F I N I + + + Sbjct: 537 EAF-SSIANTIQQLRLSL 553 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 43/301 (14%), Positives = 93/301 (30%), Gaps = 35/301 (11%) Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 AY L + ++++ + + A V+ + + +++++D + Sbjct: 265 AYNRDRYPSVSLPV--VAIYPKEGTFWTDHPYAILNAPWVTDEQREAANIFLRYLLDRPQ 322 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + +P N + E P ++S +S++ Sbjct: 323 --QELALRYGYRPSNTDVAVGAPITPENGVDPQQPQTLLEVPRPDVLSAIRSIWEQNK-- 378 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + +D S + + AL + ++ D G T F+D+ Sbjct: 379 ---KRVDVMAVLDVSGSMEDEGRLEQAKAALRIFVEQLQDDDGF------GLTIFSDQAT 429 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S RT V G T + D + AY + ++ Sbjct: 430 VLTPIS--PIGSRRTEVLNRIAGLTPRGGTRLLDTVVEAYQELTATP--------PGQRI 479 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQ----GIRIMTIAFSVNKTQQEKARYFLSNCA 423 + +V+LTDG + + + + + I++ TIAF + + L A Sbjct: 480 -RAVVVLTDGLDNRSQRSAEDVLDLLRQDREGYSIKVFTIAFGGDAD-----VHLLKEIA 533 Query: 424 S 424 S Sbjct: 534 S 534 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 78/204 (38%), Gaps = 23/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + + VD+S K L + +L + ++K D V G+ Sbjct: 292 PVKERTPVHLVYLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGDTVALCTYAGSV---RE 348 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V++ + I+ A D GSTA++ + AY ++ + H + Sbjct: 349 VLAPTGI-----ESKGKILAALA-DLTAGGSTAMSSGIDLAY-SLAERTLVKGHVNR--- 398 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 +++L+DG+ ++E + +A+ +GI + T+ F + + Sbjct: 399 -----VIVLSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLANQ- 452 Query: 423 ASPNSFFEANSTHELNKIFRDRIG 446 ++ +S + ++F +++G Sbjct: 453 -GDGNYAYIDSEAQARRVFSEQVG 475 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 48/219 (21%), Positives = 74/219 (33%), Gaps = 27/219 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L P FV S V K R+AL +++ ++ D R Sbjct: 277 DGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQD------R 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAIDENEMGSTAINDAMQTAYDTII 351 F++R+ W H L T K + + G T IN A+QTA + Sbjct: 331 FNIIGFSNRIKM-----WKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLN 385 Query: 352 S--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 + + D R + I+ LTDG+ T + I + K I I T+ Sbjct: 386 NYVAQNDIEDRSVS------LIIFLTDGKPTFGETNTLRILSNTKEATGGQICIFTVGIG 439 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + L NC E F D I Sbjct: 440 NDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 478 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 104/318 (32%), Gaps = 19/318 (5%) Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 +Y ++ S L P + + ++ I ++ + E T ++S +S V+ Sbjct: 244 TDGYSYDIIESGARLLREVP-NSVVYAVTIGEIYLRKELELITGNKSNVMIGSMSYGTVV 302 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 ++ R + + N S + + + +E + + + Sbjct: 303 KRIKNCEARARAQQLRDENPVELVHPGEFLSDAFSHRQSVQTNENIKKDEPAKDSVTEPT 362 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 D DSS + L + ++++ + R+ F Sbjct: 363 DKLPVNDCQYDVGIIFDSSGSLEKNFQTQL------QIANKLQQMPIRPNLTRVAIVQFA 416 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + +K + K G+T N A++ +S D Sbjct: 417 GKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKRDNCKM--- 473 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K +V TDG + +D EGI K QGI + T+ S +K Sbjct: 474 -----KLLVF-TDGYSAEDTAEGI---EALKRQGITVYTVGISTDKNAGLNVSELKGMAT 524 Query: 424 SPNSFFEANSTHELNKIF 441 SP+ +F+++ L K F Sbjct: 525 SPSHYFDSSDFDNLLKHF 542 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 58/168 (34%), Gaps = 23/168 (13%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIK--KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 L V +S + IK + + A++ ++ + ++G F Sbjct: 86 KKKGVELMIALDVSNSMMAQDIKPSRLEKAKMAISRMVEKLSND-------KIGLIVFAG 138 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + + D + TAI A+ A + E Sbjct: 139 DAYVQLPITTDYSSA-KLFLSNISTDIVPVQGTAIGSAIDLAARSFTPETETS------- 190 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K I+++TDGEN QD+ +A +A +GI I TI + + Sbjct: 191 ----KAIIVITDGENHQDDA--VAAAKQAHEKGIVIHTIGMGLEQGAP 232 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 5/85 (5%) Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCASP-NSFFEANS 433 D +E + C AK+ G+ I TI FSV + + + +CA+ N +F A Sbjct: 494 DTSRNMQDELTLEACTNAKTAGVEIYTIGFSVPVDPIDAQGLKMMQDCATDANHYFAATD 553 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 LN F IG+ + + +R++K Sbjct: 554 VDSLNAAFAS-IGSGVGK--LRLSK 575 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 50/420 (11%), Positives = 117/420 (27%), Gaps = 58/420 (13%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G+ +I L ++ + G VD R + L+QA +A + + ++ + Sbjct: 19 CDRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATTTN-Q 77 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKN-------------FTDREVRDIVRDTAVEMN 123 + + + + + + + T ++ I T Sbjct: 78 QAQSQAQVYLLTNVRNAVAVVTKAEISADRLTLCLDSTAQIPTTIMKIAHIETITTKATT 137 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEHGVSIQWV 182 ++ + + L+L+ +S G KS + + A + + Y K V + V Sbjct: 138 CAQTPGGMNGTYEIALVLDNSGSMSKSAGGKSKIAALRDAATSFVNNIYSKTTDVKMSIV 197 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + S + L+ + + +G + K + Sbjct: 198 PFSAGVRVLDPSVSSNRTLSWIDVNGNNSQHWLVFGDGSLVAATAKAAAKTAGFTSRFDI 257 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI------KKIDNVNDTVR 296 L+ S + + + ++ D N+++ Sbjct: 258 FTKLKSLNSSWDWGGCFEGPKYPLNVSDTAVDTSNAETLFVPFLAPDEPSTKDKYNNSLY 317 Query: 297 M------------GATFFNDRVISDPSFSWGVHK------------------------LI 320 G + ++ + +G K Sbjct: 318 TNNYLAETGGSCSGTVTGDWKL-LTRACKYGKPKKDGSGAGPNSSCPTSSSQTVLQLTAT 376 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ + T E G T +++ + TI + R + K IV +TDG N Sbjct: 377 QSTITTKISGLTENGYTNLHEGFMWGWRTISPTGPFAAGRAYATKDNHKIIVFMTDGFNN 436 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/267 (14%), Positives = 81/267 (30%), Gaps = 54/267 (20%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 K + V+ S + P+D +L + + K ++ A+ + Sbjct: 573 KKNADNTYTVNVDVTGAASSSTITTTQPVDFTLVLDVSGSMRENMGSVTKLQALQSAVNN 632 Query: 281 VIRSIKKIDNVNDT----VRMGATFF--------NDRVISDPSFSWGVHKLIRTIV---- 324 + KI+ + VR+G F ++ D ++ ++++ + Sbjct: 633 FLDEAAKINKGAQSGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTYNYSQIVKKLTADTD 692 Query: 325 --KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 K G+T + Q A+ + + EAKK ++ TDG+ T + Sbjct: 693 GLKNEVNKLTAGGATRADYGFQHAFTVMSEAR----------TEAKKVVIFFTDGKPTSE 742 Query: 383 NE-------EGIAICNKAKSQGIRIMTIAFS---VNKTQQEKARYFLSNCASP------- 425 + + + K G + +I + F+ +S Sbjct: 743 KTFDGKVANDAVEYAKQLKDSGAIVYSIGVFDGANPASTATSENKFMHAVSSNYPNAANY 802 Query: 426 ---------NSFFEANSTHELNKIFRD 443 + A LN IF + Sbjct: 803 EDLSEGSNAGYYKTATDASGLNSIFEE 829 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 T NE IC+ A++QGI + ++AF + + L CAS + A ++ Sbjct: 495 TVKNERTRQICDAARAQGITVYSVAFEA----EAGGQALLQYCASTTGHYYATVGPQIRT 550 Query: 440 IFRDRIGNEIFERVIRITK 458 +F I + I + +R+T+ Sbjct: 551 VFHS-IASHITQ--LRLTQ 566 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 66/398 (16%), Positives = 117/398 (29%), Gaps = 82/398 (20%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 14 RRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS----- 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 P + + +Y+ K D +V + + V Sbjct: 69 --------LTQSRSPAEVVRDYV-------TKAGLADYLDEPVVNANTLNVRS------V 107 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ Y + ++F++ + I + AE + V I V+D S SM+ Sbjct: 108 TATAAYSMP----TVFMKLLDIDRLEAPAVSTAEERVSN------VEISLVLDMSNSMVT 157 Query: 192 YQR-----------------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 D N A S G+V + LS Y Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLSTYPN 217 Query: 235 SCNKSLYYML--------------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 ++ Y PL S + E F SSS + A + Sbjct: 218 VSHRQPYSSCVEFAASDFTTTALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETAAGN 277 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + D + + D WGV L + + A + Sbjct: 278 PTVTPFSHDPEALKLAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAF 337 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 AY + K +VL+TDG+ Sbjct: 338 AGRPLAYQS---------------GNVMKVVVLMTDGQ 360 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 76/502 (15%), Positives = 137/502 (27%), Gaps = 99/502 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 12 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALST------- 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQVV 132 + + + +++ E R Y Sbjct: 65 --------ENNYRNDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNAS 116 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L Y L L L+ +L+ +W + A S + + + +V DFS SM Sbjct: 117 LQMNYQLAL--LNSYLKQTPSPTWDVNENGAARKYLSSIAE--PIDVVFVTDFSGSMDLP 172 Query: 193 QRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNK--------SLYYM 243 D E VK + GI P+ + S Y Sbjct: 173 FGDIERNNRITKLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCH 232 Query: 244 LYPGPLDPSLSEEHF-------------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P + + +D+ S + L D +++ I+K Sbjct: 233 FPYSPKKIDGNGHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHR 292 Query: 291 VND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 N+ R A F D+ S + I I+ + I+ M Sbjct: 293 DNEIPTKTRDQAKNFLDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDIL 352 Query: 348 DTIISSNEDEVHRM-----------------------------------KNNLEAKKYIV 372 D E + + K ++ Sbjct: 353 DPFFCLKETNAKSLNFDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMI 412 Query: 373 LLTDGENT-------QDNEEGI----------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +L+DG++ D + GI +C K K GI+++ I Sbjct: 413 ILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNI 470 Query: 416 RYFLSNCASPNSFFEANSTHEL 437 + +C +F+ A + HEL Sbjct: 471 IDWEKDCVGTGNFYLAKNAHEL 492 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 7/81 (8%) Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + T NE IC+ A++QGI + ++AF + + L CAS + A ++ Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEA----EAGGQALLQYCASTTGHYYATVGPQI 548 Query: 438 NKIFRDRIGNEIFERVIRITK 458 +F I + I + +R+T+ Sbjct: 549 RTVFHS-IASHITQ--LRLTQ 566 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 55/384 (14%), Positives = 126/384 (32%), Gaps = 54/384 (14%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 14 RRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS----- 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 P + + +Y+ + D +V + + V Sbjct: 69 --------LTQSRSPAEVVRDYVAK-------AGLEDYLDEPVVNANTLNVRS------V 107 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ Y + ++F++ + I + AE + V I V+D S SM+ Sbjct: 108 TATAAYSMP----TVFMKLLDIDRLEAPAVSTAEERVSN------VEISLVLDMSNSMVT 157 Query: 192 YQRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + N D + + +G G +S + + L + Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPN 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S + + L A + + ++ + + + + ++ + Sbjct: 218 VSHRQPYSSCVEFAASDFTTTALANGATLTGSGN-SELFSSSSSTQTPTYYWCPEETAAG 276 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-- 368 + + +K + GSTAI+ M+ + S + V + + + Sbjct: 277 NPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGA 336 Query: 369 --------------KYIVLLTDGE 378 K +VL+TDG+ Sbjct: 337 FAGRPLAYQSGNVMKVVVLMTDGQ 360 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/101 (24%), Positives = 39/101 (38%), Gaps = 15/101 (14%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 V R++ + ++LLTDG NT E + A+ Q I+I TI N+ Sbjct: 1 MAVKRLRERPAESRLLILLTDGANTAGEIEPLKAAELAQQQQIKIHTIGVGANEMLVPGL 60 Query: 411 --------QQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 + L AS +F A T +L +I+ Sbjct: 61 FSSRRVNPSADLDEDTLKKIASQTGGQYFRAQDTEQLQQIY 101 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 7/81 (8%) Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + T NE IC+ A++QGI + ++AF + + L CAS + A ++ Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEA----EAGGQALLQYCASTTGHYYATVGPQI 548 Query: 438 NKIFRDRIGNEIFERVIRITK 458 +F I + I + +R+T+ Sbjct: 549 RTVFHS-IASHITQ--LRLTQ 566 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 57/384 (14%), Positives = 127/384 (33%), Gaps = 54/384 (14%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 14 RRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS----- 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 P + +E+Y+ K D +V + + V Sbjct: 69 --------LTQSRSPAEVVEDYV-------TKAGLEDYLDEPVVNANTLNVRS------V 107 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ Y + ++F++ + I + AE + V I V+D S SM+ Sbjct: 108 TATAAYSMP----TVFMKLLDIDRLEAPAVSTAEERVSN------VEISLVLDMSNSMVT 157 Query: 192 YQRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + N D + + +G G +S + + L + Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISISIVPYTGQVNAGADLLATYPN 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S + + L A + + ++ + + + + ++ + Sbjct: 218 VSHRQPYSSCVEFAASDFTTTALANGATLTGSGN-SELFSSSSSTQTPTYYWCPEETAAG 276 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-- 368 + + +K + GSTAI+ M+ + S + V + + + Sbjct: 277 NPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGA 336 Query: 369 --------------KYIVLLTDGE 378 K +VL+TDG+ Sbjct: 337 FAGRPLAYQSGNVMKVVVLMTDGQ 360 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 59/193 (30%), Gaps = 38/193 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A I N R+G F + + HK++ ++ Sbjct: 111 TRLEAAKKVAAEFI-------NDRPNDRIGLVIFGGESFTQCPLTTD-HKVLLNLLTEVK 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + + + + K++ + ++LLTDG N + Sbjct: 163 FGMIEDG-TAIGLGLANSVNRL-----------KDSKSKSRVVILLTDGSNNAGQIAPLT 210 Query: 389 ICNKAKSQGIRIMTIAFS------VNKTQQEKARYF----------LSNCA--SPNSFFE 430 A S IR+ TI + L+ A + +F Sbjct: 211 AAELAASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVSGDFDERTLTEIAAITKGQYFR 270 Query: 431 ANSTHELNKIFRD 443 A L+ I+ + Sbjct: 271 ATDNTSLSAIYDE 283 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 80/265 (30%), Gaps = 27/265 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + F +M+ S +P + + R + + P +V Sbjct: 33 LRLPFFDTMVQLTGKSPARPGVRVARGQLWLNVAVWLLLVLALARPQWVEPPLVHTEPVR 92 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +L +D S S + + + + + V+ +A I + D+ R+G Sbjct: 93 DILL---AVDISQSMDSVDFNDAQGQPLSRWDAVKAVVADFIAQ--RADD-----RLGLI 142 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F + ++ A +TA+ DA+ + ++ E Sbjct: 143 VFGTGAYPQAPLTRD--HASLKLLLDEAAVGMAGPNTAVGDAIGLGIRMLDAAEEQ---- 196 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-- 418 K ++LLTDG +T A + + TI E F Sbjct: 197 -------DKVLILLTDGNDTGSAVPPARAAALAAQHHVTVHTIGIGDPAATGEDRVDFDI 249 Query: 419 LSNCASP--NSFFEANSTHELNKIF 441 L A FF A L +++ Sbjct: 250 LREVARTAGGQFFPARDLATLREVY 274 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 50/202 (24%), Positives = 77/202 (38%), Gaps = 17/202 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + ++ L+ ++ +K D R+G F D Sbjct: 113 DLSGSMAEQDFTSKNGEKISRLDAAKEVLSDFAKT-RKGD------RLGLILFGDAAFVQ 165 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T ST + DA+ A I + E R +K Sbjct: 166 TPFTPDQKVWLELLNQTDVA--MAGQSTHLGDAIGLA---IKVFEQSEKSRTDVEESKEK 220 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SP 425 ++LTDG +T E I AK++ +RI IA +T E A + A S Sbjct: 221 VAIVLTDGNDTGSFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNTIKRIAKESG 280 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 FEA + EL K + D IG Sbjct: 281 GEAFEALNRDELAKAY-DEIGK 301 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 105/375 (28%), Gaps = 77/375 (20%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 D+ L S G +L+ A + T + + I V+D S SM + Sbjct: 84 DIQLEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGT 143 Query: 198 GQPLNCFGQ--------PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYP 246 + + Y+ GK+ +K+ + + L Sbjct: 144 RPESERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKI 203 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFF 302 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 204 SAEDSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKF 263 Query: 303 ---------NDRVISDPSFSWGVHKLIRTIVK------TFAIDENEMGSTAINDAMQTAY 347 ND + ++ + S V +L K + G+T + + A Sbjct: 264 ASDESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQ 323 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKSQGIRI 400 A+K +V TDG+ T +++ I + K I Sbjct: 324 RAFQLGG--------TREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALI 375 Query: 401 MTIAFSVNKTQQEKARY--------------------------------FLSNCASPNSF 428 +I + + + + + Sbjct: 376 YSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYY 435 Query: 429 FEANSTHELNKIFRD 443 A ELN IF + Sbjct: 436 KAATDADELNNIFNE 450 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 105/375 (28%), Gaps = 77/375 (20%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 D+ L S G +L+ A + T + + I V+D S SM + Sbjct: 84 DIQLEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGT 143 Query: 198 GQPLNCFGQ--------PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYP 246 + + Y+ GK+ +K+ + + L Sbjct: 144 RPESERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKI 203 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFF 302 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 204 SAEDSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKF 263 Query: 303 ---------NDRVISDPSFSWGVHKLIRTIVK------TFAIDENEMGSTAINDAMQTAY 347 ND + ++ + S V +L K + G+T + + A Sbjct: 264 ASDESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQ 323 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKSQGIRI 400 A+K +V TDG+ T +++ I + K I Sbjct: 324 RAFQLGG--------TREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALI 375 Query: 401 MTIAFSVNKTQQEKARY--------------------------------FLSNCASPNSF 428 +I + + + + + Sbjct: 376 YSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYY 435 Query: 429 FEANSTHELNKIFRD 443 A ELN IF + Sbjct: 436 KAATDADELNNIFNE 450 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 60/183 (32%), Gaps = 20/183 (10%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 S+ F+ S + + S++ I + R+ +++ Sbjct: 26 SKEYPQGKSDLVFLLDRSASVGSANFEAEKGFVESLLGQF-SISPAS--TRVDVVSYSED 82 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V+ + + G T N A+Q A + + S +D Sbjct: 83 VVRHIDYIREPKNKCHFSQDIRHVTYRNSGKTNTNGALQEARNIFVGSRQD--------- 133 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K +VLL+DG++ + + + G+ I TIA + + L++ A+ Sbjct: 134 -VHKVVVLLSDGQSNTGGDPT-TTAEELRQDGVEIFTIAIGLFNKDE------LNSIATS 185 Query: 426 NSF 428 + Sbjct: 186 DHH 188 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 105/375 (28%), Gaps = 77/375 (20%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 D+ L S G +L+ A + T + + I V+D S SM + Sbjct: 86 DIQLEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGT 145 Query: 198 GQPLNCFGQ--------PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYP 246 + + Y+ GK+ +K+ + + L Sbjct: 146 RPESERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKI 205 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFF 302 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 206 SAEDSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKF 265 Query: 303 ---------NDRVISDPSFSWGVHKLIRTIVK------TFAIDENEMGSTAINDAMQTAY 347 ND + ++ + S V +L K + G+T + + A Sbjct: 266 ASDESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQ 325 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKSQGIRI 400 A+K +V TDG+ T +++ I + K I Sbjct: 326 RAFQLGG--------TREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALI 377 Query: 401 MTIAFSVNKTQQEKARY--------------------------------FLSNCASPNSF 428 +I + + + + + Sbjct: 378 YSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYY 437 Query: 429 FEANSTHELNKIFRD 443 A ELN IF + Sbjct: 438 KAATDADELNNIFNE 452 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 56/163 (34%), Gaps = 18/163 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + ++V+ + S + + + +I++ T + A Sbjct: 154 QVGVLQYGEKVVHEFKLS-DYKSVEEVVKRARSINQRGGEETNTALGINVACSQAF---- 208 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 H + KK ++++TDGE + D+ + + + GI IA + Sbjct: 209 --KHGGRRGA--KKVMIVITDGE-SHDSADLQQVIKDCEKDGITRYAIAVLGYYNRRGIN 263 Query: 412 QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 264 PEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 305 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 71/188 (37%), Gaps = 18/188 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWG 315 FV SS + ++++D +I +KK D + VR GA + D +G Sbjct: 789 VFVIDSSGSIDYDEYNIMKD---FMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDDFG 845 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ D+ GST +A+ + + + N + ++++T Sbjct: 846 TK---LEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRL-----NKGVPQVLIVIT 897 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DGE + D ++ A + +GI ++ + L+ S + +F + Sbjct: 898 DGE-SHDADKLNATAKALRDKGILVLAVGI-----DGANPVELLAMAGSSDKYFFVETFG 951 Query: 436 ELNKIFRD 443 L IF D Sbjct: 952 GLKGIFSD 959 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ D + VR+GA F+D + G + I + Sbjct: 1012 KKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQIF 1069 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1070 GNTHIGAALREVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1121 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1122 HRGIDIYSVGI-----GDVDDQQLIQITGTAEKKLTVHNFDELKKV 1162 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 55/179 (30%), Gaps = 23/179 (12%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P D + ++ S K + S+ +D + +R+G +++ Sbjct: 178 PSMADVVFLLDMSINGSEENFDYLK--------GFLEESVSALDIKENCMRVGLVAYSNE 229 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 S S G++K + V + + A A A + + + N Sbjct: 230 TKVINSLSMGINK---SEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQ 283 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + VL+T + + + +G+ I T+ Q L AS Sbjct: 284 GVPQIAVLVT---HRDSEDNVTKAAVNLRREGVTIFTLGIEGTSDTQ------LEKIAS 333 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 76/502 (15%), Positives = 137/502 (27%), Gaps = 99/502 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALST------- 69 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQVV 132 + + + +++ E R Y Sbjct: 70 --------ENNYRNDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNAS 121 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L Y L L L+ +L+ +W + A S + + + +V DFS SM Sbjct: 122 LQMNYQLAL--LNSYLKQTPSPTWDVNENGAARKYLSSIAE--PIDVVFVTDFSGSMDLP 177 Query: 193 QRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNK--------SLYYM 243 D E VK + GI P+ + S Y Sbjct: 178 FGDIERNNRITKLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCH 237 Query: 244 LYPGPLDPSLSEEHF-------------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P + + +D+ S + L D +++ I+K Sbjct: 238 FPYSPKKIDGNGHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHR 297 Query: 291 VND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 N+ R A F D+ S + I I+ + I+ M Sbjct: 298 DNEIPTKTRDQAKNFLDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDIL 357 Query: 348 DTIISSNEDEVHRM-----------------------------------KNNLEAKKYIV 372 D E + + K ++ Sbjct: 358 DPFFCLKETNAKSLNFDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMI 417 Query: 373 LLTDGENT-------QDNEEGI----------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +L+DG++ D + GI +C K K GI+++ I Sbjct: 418 ILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNI 475 Query: 416 RYFLSNCASPNSFFEANSTHEL 437 + +C +F+ A + HEL Sbjct: 476 IDWEKDCVGTGNFYLAKNAHEL 497 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 60/384 (15%), Positives = 114/384 (29%), Gaps = 60/384 (15%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 K G ++ ++ + GG+ VD+ + A++ A+ Sbjct: 10 FRKDEGGGMLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNAVLAAAS------- 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + L + ++ +T E + V Sbjct: 63 ----------------LSQDLDAEEVVRSYLTSAGLDPSEVEIETREE---KIGGILVGR 103 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + L + F R I + + +EA + I V+D S SM D Sbjct: 104 TVEASLPAGLNTYFFRFFDIDTLGMTISSEATERVED------IEISLVLDVSGSMGDIT 157 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN-KSLYYMLYPGPLDPS 252 D G ++ + A V++ S + G + PY N S Y + S Sbjct: 158 SDRSGIKMDLLKRAAGDFVETILSD-AEEGRVSISIVPYSTKVNPGSALLGQYTVSQEHS 216 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S D+ H+ + I + + + R A + R S + Sbjct: 217 YSHCVDFDADDFTHLRIDTATELQRTGHFL-----IGSESTSNRT-AGQWVCRFDSGFAV 270 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT-----------------AYDTIISSNE 355 + +K +GST+I+ + A + + + Sbjct: 271 T--PLSSSVAELKAQIAALTPLGSTSIDMGAKWGLALLDPSAQTPIAAMIASGQVNRAFQ 328 Query: 356 DEVHRMKNNLEAKKYIVLLTDGEN 379 H + + K +VL+TDGEN Sbjct: 329 GRPH-VYGADNSMKVLVLMTDGEN 351 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 6/81 (7%) Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ + + IC A + G+ I +I V+ T + L +CAS S + E+ Sbjct: 480 DSVEKDRRLRQICGVANAAGVVIYSIGMDVDNT---NSLNLLKDCASSESHYFDVEGLEI 536 Query: 438 NKIFRDRIGNEIFERVIRITK 458 F D I I ++R+TK Sbjct: 537 QTAF-DMIAASIS--MLRLTK 554 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 T + + IC AK++G+ + TI F + + L CAS S + E++ Sbjct: 511 TTKDARTLDICEAAKAKGVVVFTIGFEA----PSRGQEVLQACASSASHYYDVDGLEISD 566 Query: 440 IFRDRIGNEIFERVIRITK 458 F I + I R +R+T+ Sbjct: 567 AFAS-IASAI--RQLRLTE 582 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 121/395 (30%), Gaps = 61/395 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + +G +++ +M +GG+ +D+VR L+ A++ A+ Sbjct: 25 RQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAA------ 78 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ P+ + +Y+ ++ EV + ++ +A + Sbjct: 79 --DLDQPLD-----PEAVVLDYMSKS-GLGDYTTVVVPEVSPTAKRVKASVDTNFTASWM 130 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 D + NP + L + + + + A + V I V+D S SM Sbjct: 131 NNVFYDDYIRNPDTYQLEPI---TLPLLASSTAVESIGN------VEISLVLDVSGSMRS 181 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + A + N + G + PY + L + Sbjct: 182 NDRLVN-------LKRAAKEFVQTMDDNTEDGKMSISIVPYSTQVSMPE-AFLDELNVSS 233 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR-----MGATFFNDRV 306 H ++ S + ++ ++ + R A ++ Sbjct: 234 EHDYSHCINFSGSDFNNAGISTTQAYERTMHFTVWNSGDYRSRTRLVRQPTCAAHSDNPE 293 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI--------------IS 352 + S V +L I +T+I+ M+ + + Sbjct: 294 RTALLLSDNVTQLQNYID-----AFVPSENTSIDLGMKWGSALLDPSVQPVIASLADDAN 348 Query: 353 SNEDEVHRMKNNLEAK------KYIVLLTDGENTQ 381 N+ R N K IV++TDG+NT Sbjct: 349 PNQSIASRFANRPVPYTDTETLKVIVMMTDGQNTS 383 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 69/198 (34%), Gaps = 35/198 (17%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L PSL +D S K L ++A +++ D +G F Sbjct: 398 DLKGKKELPSLGMVIVLDRSGSMAGY-KIQLAKEAAIRSAELLREKDT------LGFIAF 450 Query: 303 NDR--VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 +DR I D K+I I G T I +++ AY+ + Sbjct: 451 DDRPWQIIDTEPIKDKEKVIEKI-----NGLTSGGGTNIFPSLELAYEQLT--------- 496 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +K+I+LLTDG+ + + + + + K I + T+A E + L Sbjct: 497 --PLELQRKHIILLTDGQ-SATSPDYLTTIQEGKENNITLSTVAIG------EGSDSVLL 547 Query: 421 NCASP---NSFFEANSTH 435 S F++ N + Sbjct: 548 EELSDEGGGRFYDVNDSS 565 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 62/174 (35%), Gaps = 19/174 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ + +++ + N VR+G + D + + ++++ K Sbjct: 478 MKKFILEFLQTFRVGPN---HVRIGVVKYADSPTLEFDLH--TYTDVKSLEKAITNIHQV 532 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A+ + H++K Y+V++TDG +T ++ +K Sbjct: 533 GGGTETGKALDFMRPQFDRAVTTRGHKVKE------YLVVITDGNSTDKVKDP---ADKL 583 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++QG+ + I + L P F N+ L I D I + Sbjct: 584 RAQGVVVYAIGVK-----DAVEKELLEISGEPQRTFYVNNFDALKPIKDDIITD 632 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 70/236 (29%), Gaps = 27/236 (11%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + Y + + + + + S + S +L D +DSS + Sbjct: 607 SGEPQRTFYVNNFDALKPIKDDIITDICSTDGSDLSLLSTVCKDVPGDLIFLIDSSGSIY 666 Query: 267 ---VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 K K ++ + +K + D V +G ++ + + Sbjct: 667 PEDYQKMKDFMKSLV-------QKSNIGKDQVHVGVLQYSTEQKLVFPLI--QYYTKDQL 717 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K + G T +A+ + N + K+ +V++TDGE Sbjct: 718 SKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNG-------GRPDLKQRLVVVTDGE---SQ 767 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 ++ +++G+ + +I T Q L + + L Sbjct: 768 DDVKLPAEALRAKGVIVYSIGVVAANTSQ-----LLEISGDADRMYAERDFDALKD 818 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 57/151 (37%), Gaps = 11/151 (7%) Query: 14 LIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +++ G IITAL ++ +L G+ +D+ R + ++ L+ A + ++ L Q+ Sbjct: 10 ILRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNG 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD---------REVRDIVRDTAVE-M 122 LI + N+T+ V+ +V + ++ Sbjct: 70 PANVKSEAKEAAENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSVKIMVNNHSINSF 129 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 +Q +S+ ++ PLS + I Sbjct: 130 FASALNFQQTVSAEATAVIGPLSGKRHLIPI 160 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 49/175 (28%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--------KYIVLLTD 376 K+ + GSTA + A+ + + D + K +L+TD Sbjct: 478 KSKIDQMSAGGSTAGQIGIAWAWYAL---SPDFASLFSGEGQPGAYAPSDTLKVAILMTD 534 Query: 377 GE------------------------------NTQDNEEGIAICNKAKSQGIRIMTIAFS 406 GE N + +A+C +++G+ + T+ F Sbjct: 535 GEFNTPFRDGVIALDAGTGSGGLDSHIDLNSSNGDPFAQSVALCQAMQAKGVVVYTVGFD 594 Query: 407 VNKTQQEKA-----RYFLSNCASPN--SFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + CA+ FF+A+ +L + FR IG +I I Sbjct: 595 LGSATGREGVVDTALDVMRECATNEQTHFFQADDGTDLKEAFR-AIGRDITRLRI 648 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 68/239 (28%), Gaps = 27/239 (11%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ G+ ++ A+ +PV+L + +D+ QA++ +L Sbjct: 19 RFLRQTQGNVAMMFAMALPVLLMITLGAIDI----------HQASKVKAQLQDALDAAAL 68 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S + K + ++ D + A V Sbjct: 69 AAARSTFTDDVNINKVGLAALKAN------MPSYFGEASGDTASFVLLNNRVTGEATVNV 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++L P L + + +E + S + E +D + SM + Sbjct: 123 KVLVANVVLPPYGKLLDDF------LPVSSRSEVLRASRNVEVA----MALDITGSMDNC 172 Query: 193 QRDS-EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 R+ L A V L PY N + GPLD Sbjct: 173 SRNCPPTSKLEDLQAAAKELVDIVVQDQQTPFYSKVALIPYAAGVNVGSSAISARGPLD 231 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 54/385 (14%), Positives = 111/385 (28%), Gaps = 66/385 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +G + VM+ GG+ +D++ ++ A++ A+ Sbjct: 25 RFWADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAAN------ 78 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ P+ +E+Y L+ D + D + V S Sbjct: 79 -------LSNTRDPQTVVEDYFR---AMKLEDTLGDVQTGDSLGAKRVRAEGNGS----- 123 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +N S FL +G+ + A AE + + I V+D S SM Sbjct: 124 --------IN--SHFLGLIGVDQLDVYGAATAENAT------APLEISLVLDVSGSMQGQ 167 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + F S + + ++ + + Sbjct: 168 KIRDLKEAAKAFVDAVLGEGGDNSRVTVSLIPYN-ATVNLGDDLSERFNLDRWQNYSSCA 226 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + E +S S+ + + + D + + + + Sbjct: 227 IFESSDYNSLSIDP------------NAGLEQLAHFDPYDYSGNSPDLTAPWCAEGN-NL 273 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT--------AYDTIISSNEDEVHRMKNN 364 + H + G+TAI+ M+ A I D + Sbjct: 274 AIVPHSSDADYLSDVIDSFEAQGNTAIDLGMKWGLALLDPAARPVIGDMQADGLVPSSAR 333 Query: 365 LEAK-------KYIVLLTDGENTQD 382 K++V++TDGENTQ+ Sbjct: 334 YRPSDYGTQTMKFVVVMTDGENTQE 358 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 86/292 (29%), Gaps = 28/292 (9%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS-----CNKSLYY 242 S Y+ G F + + LS V Y Sbjct: 330 SSARYRPSDYGTQTMKFVVVMTDGENTQEYNLKPWMLNPNALSDVWVDDHGTPGKGDDRY 389 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL-VRDALASVIRSIKKIDNVNDTVRMGATF 301 + ++ + +S + + V A ++ + +D ++T + + Sbjct: 390 SIRVKDNYGDSNDVFYWPHASRNNYRNGPYSWVTRTAAQMVNGVAVVDGDSETTKAKCSS 449 Query: 302 F---NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + + T+ + ++ + + + + Sbjct: 450 YKGAGHNAGQETLIENVLGMDYGTLDLDGDGIAGANDDCSNYPPVRLTWQELFGNVKTTY 509 Query: 359 H------------RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + R N Y T + +Q N IC KAK Q + I TI Sbjct: 510 YANAWYWQAYMDGRASYNDYYNAYYSWETTVDASQANTNLATICAKAKQQDVTIFTIGVE 569 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + NCAS S + S+++L FR I + + E +R+T+ Sbjct: 570 A----PQAGLNAMRNCASSASHYYNVSSNQLVDTFRS-ISDVVVE--LRLTE 614 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 59/163 (36%), Gaps = 23/163 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + + + T + + G T+I ++ + I SN+ Sbjct: 348 GMVTFDSSAEIQNNLT-KIIDENTYQKITANLPQKPSGGTSICGGLKAGFQAISQSNQST 406 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE+ Q + C + K G I TIA + ++ Sbjct: 407 SGSE---------IILLTDGEDNQMSS-----CFEEVKQSGAIIHTIALGPSA---DREL 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNE---IFERVIRI 456 LSN F+ + L F RI + I ++ +++ Sbjct: 450 ETLSNMTRGRRFYAHKDINGLIDAF-SRISSRSGNISQQAVQL 491 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 54/160 (33%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S + +K R+AL ++ + D + Sbjct: 257 NGYFVHYFAPDSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFD---- 312 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + K++A G T INDAM A + +N++ Sbjct: 313 --LISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQE 370 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E+ + I+LLTDG+ T + I + Sbjct: 371 ELLPEGSITL----IILLTDGDPTVGETNPLNIQKNVRKA 406 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 61/425 (14%), Positives = 129/425 (30%), Gaps = 75/425 (17%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 A+L+P+++ +V + ++ A A + Sbjct: 20 AMLIPMIIAAASTIVIGYQVQLSNRGMQ-ATDAASLAC---------------------- 56 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 ++ + + + D I + + ++ +S Y L SL Sbjct: 57 ------EFSGEYDGTMAQGYLDYYRPKIDKVSG-QIGTHSGC---NVSLSYSLSTIFTSL 106 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 L + T E V+ E + + V+D S SM D + Sbjct: 107 TLSDASF--VVSSTANEKAYVTEDVASE-PLELILVLDISGSMASDLDDLKAILKRGLAS 163 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 ++ + S + KV I + P L+E + Sbjct: 164 LKEQQNNALSKDHIKVSIVP--------------FSDGVSVNNAPWLNETGTFCVEGITE 209 Query: 267 VIKKKHLVRDALASVI----RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 K I +K + A + ++++ Sbjct: 210 SGGKFSAAHTVANLDITHDQTPVKTFQPDKWLMDCSAMS------VTLPLTADLNQVTNA 263 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--EVHRMKNNLEAKKYIVLLTDGENT 380 + G TA + + + + EV +N + ++ +VL+TDG + Sbjct: 264 VDSLR-----TEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNFDKVERKLVLMTDGADY 318 Query: 381 QDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHE 436 + + + +C++AK G+ + + F V + E+ + CA + F A++T E Sbjct: 319 GSHFDELINAGLCDRAKDYGVALNFVGFGVYGARLEQ----FTRCAGDANGVFSASNTQE 374 Query: 437 LNKIF 441 L+ F Sbjct: 375 LDSYF 379 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 74/198 (37%), Gaps = 16/198 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S +S I + V+ L ++ +K D R+G F D Sbjct: 108 DLSGSMAEKDFTSIDGIKISRLDAVKKVLNDFAKT-RKGD------RLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + + +T E ST + DA+ TI E++ + + KK Sbjct: 161 TPFTADHEVWLDLLNQTRV--EMAGKSTHLGDAIGL---TIKRFEENDNSQPLSTTSRKK 215 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNC--ASP 425 ++LTDG +T + AK +GIRI IA +Q ++ AS Sbjct: 216 VAIILTDGNDTDSYVPPMDAAKVAKVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASG 275 Query: 426 NSFFEANSTHELNKIFRD 443 F+A + EL + + Sbjct: 276 GQAFQALNQDELINAYAE 293 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 59/177 (33%), Gaps = 27/177 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS---DPSFSWGVHKLIRTIVK 325 + R A+ +V+ ++ G + D V L R VK Sbjct: 48 TRMESARRAVGAVVDALPDGYP------TGLRVYGADRPQGCADTRLVRPVRPLDRAAVK 101 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T I +++ A + + + + + IVL++DGE+T Sbjct: 102 SAVAGVRPTGDTPIGLSLRKAAEDLPAPRDGAAR--------TRTIVLVSDGEDTCGTPP 153 Query: 386 GIAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 C A G+RI T+ F V +E+ A +++A L Sbjct: 154 P---CEVAARLAGQGAGLRIDTVGFQVKGAAREQLECVAE--AGNGRYYDAPDADAL 205 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 61/448 (13%), Positives = 119/448 (26%), Gaps = 77/448 (17%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G I+ A +M +L + G + + L+ A A +A + L+++ Sbjct: 16 HPRRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQSTT 75 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S RA + R + D DI + + ++ + + Sbjct: 76 SGRAAA---------RDIASRFEVGGKALSLND---NDIQFGRSTRQSNGSYSFAINGTP 123 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQW------VIDFSRS 188 + + G ++ + Y E + V+++ V+D S S Sbjct: 124 TNAARVFGRKTKTSAAG----PVELPFGGFVGAPEYSTELNAVAMRLDYDIVIVLDRSGS 179 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M G + + + Sbjct: 180 M---------------------------------GWDLSGVEFEYPEAVRQRPLVENYFS 206 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+S+ + + + A V G F Sbjct: 207 PPDPTGSRWAILSASVNDFLTILNQRQVAAR-----------VGLVTYAGDYTFGKYSSV 255 Query: 309 DPSFSWGVHKLIRTIVKTFAI--DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + TI +G T I + A + +S++ R+K Sbjct: 256 KLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITAAQTMLTTSSQA---RLKTGQP 312 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I++ +DG Q E + I ++ F + + A Sbjct: 313 ---IIIVFSDGMFNQGTEPVSLAASAYSQSSTIIHSVTFGATAQGRATMNSV-TATAGKG 368 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 AN+ EL + FR I N I V Sbjct: 369 LSLHANTAAELAESFRS-IANAIPIVVT 395 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 54/160 (33%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S + +K R+AL ++ + D + Sbjct: 257 NGYFVHYFAPDSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFD---- 312 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + K++A G T INDAM A + +N++ Sbjct: 313 --LISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQE 370 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E+ + I+LLTDG+ T + I + Sbjct: 371 ELLPEGSITL----IILLTDGDPTVGETNPLNIQKNVRKA 406 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 69/446 (15%), Positives = 136/446 (30%), Gaps = 57/446 (12%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+++ G+ A+ + L G+ VD V + + +Q Q + L + Sbjct: 152 KRIVLMTDGNQNTGDAVKRAMALAQQGVRVDAV---FLDSMPEQEVQITSLDIPSELYEG 208 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 S A +S + Y R E R I +D A + ++ Sbjct: 209 QSYDISVAIDSTVNASAVLRLYADRQLIGQQDVQIQKGENRFIFKDKADTSGIKTYEAEL 268 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-------------------SRSYH 172 L N + ++ G+ + I E E + Sbjct: 269 QTDKDGVLQNNQMDAYVNIKGVPTVGIVEGQEGEGREIIKILEAADIKTTLFTPHTLPSD 328 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 E Q +I S D + + + S + +G Sbjct: 329 LEELRKFQALILCDVSFDDIGEERMPAIDSFVKVLGRGMLVSGGDNSYMLGGYMGTQLEK 388 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 M+ + L L L + + ++ I K + ++A +++ D+V Sbjct: 389 MLPVDMDLSKKADIPSLGLVLVIDKSGSMTDGQYGITKLEMAKEAAIRSTEALRPTDSV- 447 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE---NEMGSTAINDAMQTAYDT 349 G F+D SW V + G T + A+ AY Sbjct: 448 -----GVICFDDAA------SWVVGMRQADDLAEIQDSIGTIRPGGGTNMYPALDLAYKA 496 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + ++ H I++LTDG++ + +GIA ++ GI + ++A ++ Sbjct: 497 LEEADTKLKH-----------IIVLTDGQSATGDFDGIA--HRMAEDGITLSSVAVGMDA 543 Query: 410 TQQEKARYFLSNCAS--PNSFFEANS 433 + LS A ++ + Sbjct: 544 D-----KNLLSRLAEIGNGRYYYTDE 564 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 78/207 (37%), Gaps = 28/207 (13%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 ++ F + + ++ + + V++ L ++ ++ D R+G F D FS Sbjct: 105 KQKDFTNQAGVQ--VDRLSAVKEVLGEFLQR-REGD------RVGLVVFGDAAYLQAPFS 155 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + R + + TA DA+ + + S +E K I+ Sbjct: 156 TDLQLSRRLLDECEVGMAGPR--TAFGDAIGLGVN-LFSESEA----------PAKTIIA 202 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFF 429 LTDG +T+ + A + IRI T+A + + + L A + S+F Sbjct: 203 LTDGNDTKSQVPSVEAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGSYF 262 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRI 456 A L I+ +EI R I++ Sbjct: 263 FAADRASLAGIYDQL--DEIETRKIKM 287 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/280 (14%), Positives = 88/280 (31%), Gaps = 39/280 (13%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 +D + + Q D V+ + +N +E+ + Sbjct: 228 HKDQREKASIMYSQSIDTVVEFCTEKNHNREAPNEQNKKCNHRSTWEVIQDSEDFKTTTP 287 Query: 253 LSEEHFVDSSSLRHVIKKKHLV------RDALASVIRSIKKI------DNVNDTVRMGAT 300 ++ + + SL + ++ + + + ++ + + V +G Sbjct: 288 MTTQPPSPTFSLLQIGQRIVCLVLDKSGSMTIGNRLKRLNQAGKLFLLQTVEQGSWVGMV 347 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ ++ T ++ G T+I +++A+ I + Sbjct: 348 TFDSAAYVQSELV-QINSATERDTLTKSLPTTASGGTSICSGLRSAFTVIKKKYPTDGAE 406 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ----EKA 415 I+LLTDGE+ I+ C ++ K G I T+A + Q+ K Sbjct: 407 ----------IILLTDGEDN-----TISACFDEVKQSGAIIHTVALGPSAAQELEQMSKM 451 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 L AS + + + L F GN+ + Sbjct: 452 TGGLQTYASD----QVQN-NGLVDAFAALSSGNKAVSQRS 486 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 46/344 (13%), Positives = 92/344 (26%), Gaps = 29/344 (8%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 S G IITAL + V++G G+ +D + L+ A + AS L S Sbjct: 10 SQRGVVAIITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYELTGSPISPE 69 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-AVEMNPRKSAYQVVLSS 135 + FT + + N + ++ + A P A Sbjct: 70 N-----FTRAENAGKTVGTENRVDFQGGAIAAADIDVTFSTSLAGSWLPAGGATGNSKYV 124 Query: 136 RYDLLLNPLSL-FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 R + N ++ F++ MG + A A + + S S + Sbjct: 125 RCTITRNGIAPWFMQVMGFGDQTVSAIATATLAP--SQNNCAIPMGLCTHPSSSAPHFGY 182 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN---------KSLYYMLY 245 F + T+++ + V +P + L Sbjct: 183 VKGDWYSMNFKESGGGTMENLTGDFRWVDFDPSTTTPNCSGKGAQELSCLFEGAGQCNLP 242 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI----------RSIKKIDNVNDTV 295 P + + + +K + A + +K + Sbjct: 243 PNGPSTCPTSGNSTPTPGCVGDAGQKTSIGQAFNTRFGICQGSACTDGELKSAPPDFTGL 302 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTA 338 T ++ + S G + + +E+ T Sbjct: 303 AYNTTTWSLGRDAYKGSSGGTPNFVAARSSHLQVQSSELPSGTN 346 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 77/254 (30%), Gaps = 29/254 (11%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M D + + P S N + +E M+ Sbjct: 40 MSSNDDDEKIVTRSRPTPPIVPARVKLRSINNDMAPLEESKLKVMLELTGGDSSSYGRPG 99 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 LD + S K ++ A+ +I+ + D R+ F+ Sbjct: 100 LDLVAVLDV-----SRSMEGDKMEKMKTAMLFIIKKLGPTD------RLSIVTFSGGANR 148 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 K N G+T I +QTA + + + + Sbjct: 149 LCPLRQTTGKSQEEFENLI-NGLNADGATNITAGLQTALKVLKGRSFNGERVVG------ 201 Query: 369 KYIVLLTDGENTQDNEEG-IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L++DGE ++ +++ N + I T F +N + K +++ + + Sbjct: 202 --IMLMSDGEQNAGSDATGVSVGN------VPIHTFGFGIN--HEPKGLKAIAHNSIGGT 251 Query: 428 FFEANSTHELNKIF 441 F + + L K F Sbjct: 252 FSDVQNIDSLTKAF 265 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 76/231 (32%), Gaps = 41/231 (17%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + S + S Y P D +++ ++ + + + + H + ++A I +K D Sbjct: 90 VFSLDISGSMSSYDQPEDLAVNRRVIAEAINNKELHPRLHYAKKSIADFIDK-RKSD--- 145 Query: 293 DTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F S + + ++ I + D N T I A+ Sbjct: 146 ---RLGLVVFGAEAYSVCPPTNDHEYLQNRLKEISTEYLGDYNR--QTNITAAIS----- 195 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKAKSQGIRIMTIAFS-- 406 + R++ + KK I+L+TDG +T N A I TI Sbjct: 196 ------GGLARLRKSKAPKKIIILVTDGSHTANSNLTPRMAAKAAAKSDAVIYTIGVGNE 249 Query: 407 -------------VNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +N + + L A + +F ++ + + Sbjct: 250 VAWNVENFFGSSRLNASNSDFDEELLKEIAEKTGGLYFSVREAEQMKDVLK 300 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 105/374 (28%), Gaps = 40/374 (10%) Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG-IKSW 156 F N D + V V + + D +LN G K+ Sbjct: 38 FAENQSNKIPDTVKAEEVSAETVPGETSGTE---SSTDGTDEILNTEGSPENGSGNGKTA 94 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 ++AE G++ Q V + + Sbjct: 95 DTGAVSDAEVTPEKEAATEGITEQDVPEAENVGAGNPVAVHLSRGAASEAAPEHQKYIKK 154 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + Y K + +L S++ + D + V Sbjct: 155 NAENDYTLTLNVKGMYDSETTKPMIDVLLIVDKSGSMNWKMDTDKVGKPSRMDVLKQVVT 214 Query: 277 ALASVIRSI---KKIDNVNDTV-RMGATFFNDRVISDPSF--SWGVHKLIRTIVKTFAID 330 + SI +ID V G+ F D+ D W + V + Sbjct: 215 GTGGLTDSIFGNTQIDAQMAVVTYSGSNDFLDQRYDDAEIIQEW---TKQKDTVNNAVNN 271 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---------- 380 G T ++T + S E+ KK+++ L+DG+ T Sbjct: 272 IQAKGGTNCEAGLRTGATALEGSRENA----------KKFVIFLSDGDATFYYGDDGYTK 321 Query: 381 ------QDNEEGIAICNKAKSQGIR-IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 AI K G+ TI + + + + ++ AS F+ AN+ Sbjct: 322 GPGSGSSPTAREKAIAQVQKITGLEGFYTIGMTSSSSSEFLTNLANNSKASEKRFYPANN 381 Query: 434 THELNKIFRDRIGN 447 T L K F++ +G Sbjct: 382 TEALEKAFQEIVGE 395 >gi|27379052|ref|NP_770581.1| hypothetical protein blr3941 [Bradyrhizobium japonicum USDA 110] gi|27352202|dbj|BAC49206.1| blr3941 [Bradyrhizobium japonicum USDA 110] Length = 472 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 65/498 (13%), Positives = 138/498 (27%), Gaps = 91/498 (18%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + + II AL+M + + GM +D L AA A I A P + + Sbjct: 11 RFARDRKANVAIIFALMMVPTIFLLGMALDYTLALRKREQLNAAADAAAIAAVRPAMLTQ 70 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + K N T V+ A + Sbjct: 71 SDTTVVKATAEAVF--------------AAKANLPGLSAVPTPTVTIVD---SGLARTIT 113 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S N F +G ++W + A + + ++ ++D S SM Sbjct: 114 VSYTAQSTNN----FPGVLGKQTWQV-----AGSATARASSAPNMNFYLLMDDSPSMGIG 164 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ L + PA Y S G + + + + + Sbjct: 165 ATTTDISNLIKYTAPA------YQSAGGSQNCGFACHETNIAHDGGTKDNLAIARQRNIT 218 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISD 309 L + V S+ + + + + ++ + N + N Sbjct: 219 LRID-LVTSAVNQLLNSWSNCPQSGVSGGVMQCMSALNNTTYKAALYTFDLGLNALATLT 277 Query: 310 PSFSWGVH--------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 S G + V + + G T I A+++ D + + + Sbjct: 278 TPTSAGTQVSNIALMPVAYQNCVVVTTNCKTDNG-TDIAGALKSLNDVMPTPG---LGSN 333 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGI-------------------AICNKAKSQGIRI-- 400 + ++ + L+TDG + IC K++GI+I Sbjct: 334 ASGDTPQEVVFLVTDGVEDKIVSGASTCPNASLASNNRCQQPLDTTICTTIKNRGIKIAV 393 Query: 401 -MT-----------------IAFSVNKTQQEKARYF----LSNCASPNSFFEANSTHELN 438 T +++ + L +CASP + + +++ Sbjct: 394 LYTEYLQLKTPNIPVTNSWYMSWVDPYNEPTSLTGTIAQKLQSCASPGFYASVQTGGDIS 453 Query: 439 KIFRDRIGNEIFERVIRI 456 + + Sbjct: 454 DALTNLFIKVASSTASLV 471 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 2/49 (4%) Query: 410 TQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 E + L CAS + +F+A +L F+ IG + +V R+T Sbjct: 1 MAPEGGQALLQYCASDASHYFQAEKMEDLFAAFK-AIGAKASTQVTRLT 48 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 75/246 (30%), Gaps = 28/246 (11%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 K + N K +S + + P+L +D+S Sbjct: 1558 DVKTLQKHFQFSNSKDQTIPIMISVKTLEQTSDMEIESNLLEGRPNLDLICVIDNSGSMS 1617 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K V++ + +I + D R+ FN V+K + ++ Sbjct: 1618 G-QKIENVKNTILQLIDMLNDND------RLSIITFNSHAQQLCGL-RKVNKDNKENLQK 1669 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I +QTA+ + S + I LL+DG++ + Sbjct: 1670 ITKSIYANGGTNITSGLQTAFSILQSRKQRNSVSS---------IFLLSDGQDNNSDSRI 1720 Query: 387 IAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKI 440 + + + I + F + + A SF+ ++++ Sbjct: 1721 RNLLQTTYQQLQEECFTIHSFGFGNDHDGP-----LMQRIAQIKDGSFYYVERNDQVDEF 1775 Query: 441 FRDRIG 446 F D +G Sbjct: 1776 FIDALG 1781 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 82/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S Q+G I + Y+ C++ Sbjct: 552 EGFRLAEDGKRCRRKDVCKSIQHGCEHICVNNGNSYLCRCSEGFVLAEDGRHCKRCTEGP 611 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 612 IDLVFVIDGSKSLGEENFEIVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLK 668 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K A + + A++ Y+ + E R + + IV Sbjct: 669 -GFSSA-KDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGA--RPLSTRVPRAAIVF 724 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ KAK+ GI + + ++ L AS F Sbjct: 725 -TDG---RAQDDVSEWARKAKANGITMYAVGIGKAIEEE------LQEIASEPIDKHLFY 774 Query: 431 ANSTHEL 437 A + Sbjct: 775 AEDFSTM 781 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 82/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S Q+G I + Y+ C++ Sbjct: 580 EGFRLAEDGKRCRRKDVCKSIQHGCEHICVNNGNSYLCRCSEGFVLAEDGRHCKRCTEGP 639 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 640 IDLVFVIDGSKSLGEENFEIVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLK 696 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K A + + A++ Y+ + E R + + IV Sbjct: 697 -GFSSA-KDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGA--RPLSTRVPRAAIVF 752 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ KAK+ GI + + ++ L AS F Sbjct: 753 -TDG---RAQDDVSEWARKAKANGITMYAVGIGKAIEEE------LQEIASEPIDKHLFY 802 Query: 431 ANSTHEL 437 A + Sbjct: 803 AEDFSTM 809 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 82/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S Q+G I + Y+ C++ Sbjct: 552 EGFRLAEDGKRCRRKDVCKSIQHGCEHICVNNGNSYLCRCSEGFVLAEDGRHCKRCTEGP 611 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 612 IDLVFVIDGSKSLGEENFEIVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLK 668 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K A + + A++ Y+ + E R + + IV Sbjct: 669 -GFSSA-KDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGA--RPLSTRVPRAAIVF 724 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ KAK+ GI + + ++ L AS F Sbjct: 725 -TDG---RAQDDVSEWARKAKANGITMYAVGIGKAIEEE------LQEIASEPIDKHLFY 774 Query: 431 ANSTHEL 437 A + Sbjct: 775 AEDFSTM 781 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 82/257 (31%), Gaps = 31/257 (12%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG-----KVGIRDEKLSPYMVSCNKSLY-- 241 + D + N S+S+ + GI + + C + Sbjct: 113 IADNADPNPRFLSNEVIDVMKENYNSFSNLKWQYFGSEEGIFTIYPASLIDDCANYDHRF 172 Query: 242 --YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +P S S+ ++ K L+ A+ + I + ++ + G Sbjct: 173 RPWYVEAATPEPKNVVIVIDTSGSMANLHSGKSLINIAIDAAITVLDTMNPNDKV---GV 229 Query: 300 TFFNDRVISDP----------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 F+D + P + I+ + K F + G T A A++ Sbjct: 230 IAFSDELKLPPKIGDASCYANELALATTINIQNL-KQFVLSLVARGGTHYGKAFDAAFNL 288 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFS 406 + S + + +E + I+ LTDGE D I N+ + I+T Sbjct: 289 LKESYTLDADNERGKIERDQVIIFLTDGEPLDDKTSIMRKIRSNNEEMENKVTILTFGLG 348 Query: 407 VNKTQQEKARYFLSNCA 423 ++ FL + A Sbjct: 349 LDS-----GINFLEDIA 360 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 50/133 (37%), Gaps = 15/133 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D VR+G FND + +++ G T A+ Sbjct: 51 RDQVRVGLAQFNDNIYKAFLL--NQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFF 108 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + E R K+N+ + ++L+TDGE N+E +K K QG+ I + +V Q Sbjct: 109 T--ESAGSRAKDNVP--QIVILVTDGE---SNDEVAEAASKLKGQGVSIYVVGINVQDVQ 161 Query: 412 QEKARYFLSNCAS 424 + L AS Sbjct: 162 E------LKTIAS 168 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 52/167 (31%), Gaps = 19/167 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I D VR G ++D + H + + G T Sbjct: 646 MIELISTFRVGADHVRFGVVQYSDSPTVEFDIR--QHSSVAQLKSAITKIWQTGGGTRTG 703 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 +A+ D+ ++++++TDG Q ++ + + + I I Sbjct: 704 EALTFMKRLFSEVARDK---------VLRFLIVITDG---QSQDQVAQAAEELRQENITI 751 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 I T + L S N F N L I ++ I + Sbjct: 752 YAIGVKSAVT-----KELLEISGSQNRMFFVNDFDSLKPIQQEVIQD 793 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ + Sbjct: 594 EGFWLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRQCERCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 92/268 (34%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K A + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 62/182 (34%), Gaps = 24/182 (13%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ + +G+ ++ L +P++ G VD+ + L+ + A + A Sbjct: 11 FRKRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYS 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + ++ F + + TD I + Sbjct: 71 NNQQDSHLSKVAEGYFFENAGADARAN----------TDFSYDGIFN------EDGSTVL 114 Query: 130 QVVLSSRYDLLLNPLSLFLRS--MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 QV + R+ + L F+ + + +++ + +++ + + + V+D S Sbjct: 115 QVSAARRHPTIFGDLLSFVTAGELDWRAFPLAARSQ------IVVQNQSIELVMVLDNSG 168 Query: 188 SM 189 SM Sbjct: 169 SM 170 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 59/200 (29%), Gaps = 69/200 (34%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + + G T I + + + + + R + + K +VL+TDG+N Sbjct: 395 QATINAAINAMDADGETNIPEGIAWGWRLLSAREPFTQGRANDAKDNLKVLVLMTDGDNN 454 Query: 381 QDNE---------------------------------------------------EGIAI 389 ++ + AI Sbjct: 455 YGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTSKRANRSSFVSAMNEKVAAI 514 Query: 390 CNKAKSQG--------IRIMTIAFSVNKTQQEKARYFLSNCASPN-------SFFEANST 434 C K G I I TIAF +N + + CAS +++A S+ Sbjct: 515 CQNIKDDGRKATGEDGIVIFTIAFDLNDGS--SVKKLMEQCASYGITDPTKKLYYDAKSS 572 Query: 435 HELNKIFRDRIGNEIFERVI 454 +L F D I ++ I Sbjct: 573 SDLMAAF-DSITEQVSSLRI 591 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 68/192 (35%), Gaps = 26/192 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A S + N +G F + + R+ V Sbjct: 109 TRLEAAQVAAKSFAEGLTPGIN------LGLISFAGSATVLVAPT-----TDRSAVSQGI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STA DA+ A I S + + + +VL+TDG+ T + Sbjct: 158 DGLKLAQSTATGDAIVAALSAIDSFGKVV--GGADGPPPAR-VVLMTDGKETVGTRKATD 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE---KARYF------LSNCA--SPNSFFEANSTHEL 437 AK GI I TI+F + + KA+ + A S FF+A S EL Sbjct: 215 AAGDAKEAGIPISTISFGTERGSVDINGKAQEVPVDDESMKEIAKISGGEFFKAASAEEL 274 Query: 438 NKIFRDRIGNEI 449 +++ D +G +I Sbjct: 275 RRVY-DTLGEQI 285 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 89/267 (33%), Gaps = 26/267 (9%) Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS--- 254 G L G+ R S+ +G I + Y+ C++ Sbjct: 595 GFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPI 654 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 655 DLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRN 711 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + K A + + A++ ++ + E R + + IV Sbjct: 712 --FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPFSTRVPRAAIVF- 766 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEA 431 TDG + ++ +KAK+ GI + + ++ L AS F A Sbjct: 767 TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFYA 817 Query: 432 NS---THELNKIFRDRIGNEIFERVIR 455 E+++ + I + + R Sbjct: 818 EDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 57/445 (12%), Positives = 139/445 (31%), Gaps = 66/445 (14%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G ++ + +P +L + + + + + QA+ A + Sbjct: 12 GDISLMFVICLPFILTMIAVSILLAMYLLTVTRAGQASDAASLACG-------------- 57 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 Q+ ++ L+ + + F + +V ++ Y+ Sbjct: 58 ------YSQRADQDLLVGILDYYRPGFVVHDGEALVSIDGKNRCSIEATYR--------- 102 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW--VIDFSRSMLDYQRDSE 197 NP + L ++ + ++ S + + V+D S SM Q Sbjct: 103 -FNPTMMALLPESART-HVSLSSDTGATSHLVINSTPLPMDLALVLDISSSM-SAQLPQL 159 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 +N + + +VG L P+ + + + Sbjct: 160 KLIINGALEEIRQQ------DPNEVGGVRFSLVPFETGVGVLNAPWMPKSAAKVTCVDGL 213 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S+ + L A I+S+ ++ A + + + ++ Sbjct: 214 SYGQHSVDYARTVDDLAEPAANLNIKSVFASQWLD------ACSMDATI---LPLTQDLN 264 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR--MKNNLEAKKYIVLLT 375 +VK G+T+ + T++ ++E +++ ++ +VL T Sbjct: 265 -----LVKQRVDALVTSGTTSSYQGLIWGVRTLLPQWQEEWQIPPVESPALIQR-LVLFT 318 Query: 376 DGENTQ---DNEEGIAICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFE 430 DG + D+ +C + + I + I F V+ ++ CA ++ Sbjct: 319 DGADQGFHLDDLIEQGLCRVIQDKHHIEMSFIGFGVS----DRRLQQFRECAGDKGKVYD 374 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A +T EL FR+ + + ++ Sbjct: 375 AQNTQELEAFFREALQTDTKASLVL 399 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 63/178 (35%), Gaps = 24/178 (13%) Query: 276 DALASVIRSIKKIDNV-NDTV------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 DA I + V N+ V R+G F + F+ + + ++ Sbjct: 113 DASGEQIDRLSAAKGVLNEFVAGREGDRLGLIVFGNAAYLQAPFTDDHETWLALLDESIV 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ D++ A H + + + +++LTDG +T + Sbjct: 173 NMAGPS--TALGDSIGLA----------IAH-FRQSKTENRVLIVLTDGNDTGSRVPPLD 219 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR 442 AK +G+ I T+A T E+A L A + F A+ L + ++ Sbjct: 220 AAEVAKVEGVTIYTVAVGDPTTVGEEALDMETLETVARLTGGDSFVASDLVALRETYQ 277 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 42/272 (15%), Positives = 90/272 (33%), Gaps = 28/272 (10%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S R EG L G+ R S+ +G + + + Y+ C++ Sbjct: 327 SGDESYTCRCLEGFRLAEDGKHCRRKDVCKSTHHGCEHMCVNRGNSYICKCSEGFILAED 386 Query: 246 PGPLDPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + FV S + +V+ + +I S+ I R+G + Sbjct: 387 GRRCKRCTEGPVDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-TISPKA--ARVGLLQY 443 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + +V ++ + + + K A + + A++ ++ + E R Sbjct: 444 STQVRTEFTLRN--FGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIEGA--RPL 499 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + + IV TDG + ++ +KAK+ GI + + ++ L Sbjct: 500 SARVPRVAIVF-TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEI 549 Query: 423 AS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 AS F A + I +++ + Sbjct: 550 ASEPTDKHLFYAEDFSTM-----GEISDKLQK 576 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 70/221 (31%), Gaps = 11/221 (4%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + S G+ + C+ Sbjct: 156 ALRKAGILLYAIGVRDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTLAK 215 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + P+ E D L + + I +D +D VR+G Sbjct: 216 AAPHVDHVSPACREAALADIVFLVDSSTSIGPQNFQKVKNFLYSVILGLDISSDRVRVGL 275 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L I++ G T A++ + E+ Sbjct: 276 AQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALE--FIRTNYLTEESGS 331 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 R K+ + + ++L+TDGE N+E + ++ K G+ + Sbjct: 332 RAKDRVP--QIVILVTDGE---SNDEVQEVADRLKEDGVVV 367 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 70/221 (31%), Gaps = 11/221 (4%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + S G+ + C+ Sbjct: 47 ALRKAGILLYAIGVRDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTLAK 106 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + P+ E D L + + I +D +D VR+G Sbjct: 107 AAPHVDHVSPACREAALADIVFLVDSSTSIGPQNFQKVKNFLYSVILGLDISSDRVRVGL 166 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L I++ G T A++ + E+ Sbjct: 167 AQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALE--FIRTNYLTEESGS 222 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 R K+ + + ++L+TDGE N+E + ++ K G+ + Sbjct: 223 RAKDRVP--QIVILVTDGE---SNDEVQEVADRLKEDGVVV 258 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 56/167 (33%), Gaps = 18/167 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 57 RFLVGLVRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERALRTLVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 I + Q L ASP F S +L + F Sbjct: 169 EIYAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 208 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 72/194 (37%), Gaps = 27/194 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A S + + V++G F + + + +I I Sbjct: 106 SRLDAAKAAARSFVERMPAG------VKVGLVSFAAGAVLESGLTADHQGVIERIDLLER 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TAI + + +++ + H++ ++LL+DG N + Sbjct: 160 RA-----NTAIGEGL---LESLKAFPTGANHQVAVPAT----VILLSDGRN-RIGIAPQE 206 Query: 389 ICNKAKSQGIRIMTIAFS-----VNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 +AK +G+R+ TI + L A + +F A+S L +I+ Sbjct: 207 AAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIAEVTGGRYFAADSADRLQEIY 266 Query: 442 RDRIGNEIFERVIR 455 R+ +G++I ++ R Sbjct: 267 RE-LGSQIGWKLER 279 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 67/171 (39%), Gaps = 12/171 (7%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +R A+ ++R+I D VR+ ++ + F+ ++++ + Sbjct: 209 EQIRFAVERIVRNINVHP---DAVRLALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLK 265 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + G+T+ + A+ AY + + D + ++ + KK I++ TDG + Sbjct: 266 SIKGTTSTHIALHQAYKLLTDT--DNENGVREGV--KKMIIIFTDG---HSQRSPQDMAL 318 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K +G+ I I + E LS + + F + L ++ Sbjct: 319 RLKDKGVEIFAITLTPAPYADEG--ELLSITQNTDHIFTPVNLKVLITTYK 367 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 80/247 (32%), Gaps = 30/247 (12%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++ G+ II A++ ++ + G VD R AL+ A +A + Sbjct: 7 TRLTRLARRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAAL-- 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + ++ + R + ++L +T + I TA Sbjct: 65 -------MISKDAATMSASQITTR------ARQYVDSL---YTATDAP-IQNFTATYTPN 107 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S ++LS+ + + F+R +G + + T S V+D Sbjct: 108 SGSGASILLSANGTMP----TYFMRVLGSNFNTLPVATSSTTKWGSTRMRVA----LVLD 159 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ---NGKVGIRDEKLSPYMVSCNKSLY 241 + SM + + Q + + + +D +S VS + + Sbjct: 160 NTGSMAQNGKMAALQSAATDMITKLSAFNTTTGDVYISIVPFAKDVNVSTSNVSASWLNW 219 Query: 242 YMLYPGP 248 P Sbjct: 220 TEWMAEP 226 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 63/150 (42%), Gaps = 16/150 (10%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENT 380 + +K+ G+T + + T+ ++N+ K+ N K YIVLL+DG NT Sbjct: 452 STLKSQINAMTPSGNTNQAVGLFWGWQTLNTTNDPFKAPAKDPNWVYKDYIVLLSDGLNT 511 Query: 381 QDNEEG-------------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 Q+ I + A++ G +I + VN + ++ L +CA+P + Sbjct: 512 QNRWTQTVSDIDARQELLCKNIKDPAQNGGNQITVFSIQVNISSKDPTSKVLQDCATPGA 571 Query: 428 -FFE-ANSTHELNKIFRDRIGNEIFERVIR 455 +F+ + + F + + R+ + Sbjct: 572 GYFQMITQSSQTADAFNNVLATIAKLRISQ 601 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 73/207 (35%), Gaps = 23/207 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S H + +V+ L ++ ++ D V+ GA +V+ D Sbjct: 195 PPANLVFLVDVSGSMHRREGLPMVQGTLKLLVDQLRPQDRVSLVTYAGA----TQVVLDS 250 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K+ I GSTA +Q AY H ++ + Sbjct: 251 TPGSDKAKIRAAID-----QLTAGGSTAGESGIQLAYQQ------ASKHLIEGGINR--- 296 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ D + + + G+ + T+ F V+ + A + Sbjct: 297 ILLATDGDFNVGISDFDSLKQLAADKRKTGVSLTTLGFGVDNYNERLMEQLAD--AGNGN 354 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVI 454 + ++ E K+ D++ + + Sbjct: 355 YAYIDNLREARKVLVDQLASTLATVAS 381 >gi|62531155|gb|AAH92555.1| LOC594926 protein [Xenopus (Silurana) tropicalis] Length = 895 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 45/264 (17%), Positives = 86/264 (32%), Gaps = 36/264 (13%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 D + NC D N + + ++ Y V + P L Sbjct: 223 DQQRSCSNCSTTQLDGDFTVTYDVNRETPGNIQVVNGYFV-------HFFAPSKLKEVPK 275 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 F+ S+ + K ++AL ++ +K+ D+ N F+ V W Sbjct: 276 NIIFIIDRSISMIGLKMQQTKEALLKILDDVKEHDHFN------FVIFDWGVEI-----W 324 Query: 315 GVHKLIRTIVKT-----FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + T + + G T INDA+ +A + D+ H ++ K+ Sbjct: 325 EQSLVKATPENLNRAKAYVRNLYPKGWTNINDALLSAISLL-----DQAHDARS--VPKR 377 Query: 370 ---YIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+ +TDG+ + I A++ + ++ F V + L N Sbjct: 378 SASLIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLYSLGFGVGVDYPFLEKLSLENSG 437 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 +E + + F D + N Sbjct: 438 VARRIYEESDAALQMEGFYDEVAN 461 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 47/298 (15%), Positives = 92/298 (30%), Gaps = 49/298 (16%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT---VKSYSSQNGKVG---------I 224 +S+ ++ + R + + S PL + R +K Y + + Sbjct: 18 LSMYVLLRWGRRRVLGRIYSFNHPLTRYVSSYIRKQHDIKWYLNIALAIASIVLIMFSLA 77 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + P V ++L + P + S +I + V+ Sbjct: 78 LPYTIIPRYVKTTQTLEAKISLQRKPPVVIVLDTSGSMKGDKIITAINAVKKF------- 130 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ID D V +G FND V + L + + + G T + ++ Sbjct: 131 ---IDQTIDYVLIGLITFNDHVRIAIPPTSDQELLYKKL-----GEIKAFGGTIYSKPLE 182 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN----EEGIAICNKAKSQGIRI 400 AYD ++ E + I+ +TDG + E + C I I Sbjct: 183 IAYDWLVPFAEFNL---------SPTIIFVTDGLPYSQDAPLYREVVYKC---ARYNITI 230 Query: 401 MTIAFSVNKTQQEK---ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I + A+ L A + F+ T+ L +F +++ + + Sbjct: 231 YPIFIETPGMSIYETMMAQQRLREIANITKGQFYNVKQTNSLINLF-EKLAEKTVSKA 287 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 67/198 (33%), Gaps = 33/198 (16%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + TG+ II AL + ML G D +R ++ A+I + Sbjct: 5 RAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIA-A 63 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V I + + + + ++E T E++ Sbjct: 64 VKQINNTGDTDALKLKVTDWFHAQVEN-----------------------SYTLGEIDID 100 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + ++ + + F++ I + + + + + SY +++ VID Sbjct: 101 TTNHNITATASGTVP----TTFMKIANIDTVPVSVASAVKGPATSY-----LNVYIVIDT 151 Query: 186 SRSMLDYQRDSEGQPLNC 203 S SML + + Sbjct: 152 SPSMLLAATTAGQATMYS 169 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 3/63 (4%) Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCASP-NSFFEANSTHELNKIFRD 443 + C AK+ G+ + TI FS + + + L +CA+ + +F + ++LN F Sbjct: 517 TLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKSCATNVDHYFAVENANQLNAAF-S 575 Query: 444 RIG 446 IG Sbjct: 576 SIG 578 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 30/188 (15%) Query: 12 KKLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K +G+ ++ L L+PVM + G D R++ AL+QA A++T + L Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFML-GATADYTRYATTRSALRQATDVAVLTVASKLTA 72 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + ++A+ + T + K + Sbjct: 73 TTTDAQAKAQAQVIL---------------------NAQPRMSTASITTASIATTKQTFC 111 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + F++ + S + A+ + + V+D S SML Sbjct: 112 ATSEVTIQ------NSFMQMARVTSLTPSVTSCADLAWG-ANPNATYEVALVVDNSGSML 164 Query: 191 DYQRDSEG 198 Sbjct: 165 SSDGSVTK 172 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 553 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 612 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 613 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 669 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 670 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 725 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 726 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 775 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 776 AEDFSTMDEISEKLKKGICEALEDSDGR 803 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 353 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 412 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 413 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 469 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 470 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 525 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 526 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 575 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 576 AEDFSTMDEISEKLKKGICEALEDSDGR 603 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 310 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 369 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 370 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 426 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 427 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 482 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 483 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 532 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 533 AEDFSTMDEISEKLKKGICEALEDSDGR 560 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 654 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 713 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 714 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 770 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 771 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 826 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 827 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 876 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 877 AEDFSTMDEISEKLKKGICEALEDSDGR 904 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 175 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 234 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 235 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 291 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 292 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 347 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 348 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 397 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 398 AEDFSTMDEISEKLKKGICEALEDSDGR 425 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 379 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 438 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 439 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLR 495 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 496 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 551 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 552 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 601 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 602 AEDFSTMDEISEKLKKGICEALEDSDGR 629 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 72/204 (35%), Gaps = 29/204 (14%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + + V+ + S++ + ++ R+G F ++ + +R Sbjct: 110 SGQAMALSRLDAVKQGVMSLLDA-------SEGNRIGLIAFGEQSFVMSDLT-AYGDTVR 161 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +V ST + D + A + + + +VL+TDG +T Sbjct: 162 YMVSQLETG-FAGDSTRLGDGVGYAVSLLADVDSERA-----------IVVLITDGNDTG 209 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SPNSFFEANSTHEL 437 + + AK+ +++ A + + ++ L A + +FF + Sbjct: 210 SDLPPVEAARLAKALDVKLYVAAVGADVSTDREPIDEALLRRLAERTGGAFFRIAQVIDF 269 Query: 438 NKIFRDRIG-----NEIFERVIRI 456 + +++ G + ERV RI Sbjct: 270 DAMWQTLEGLEPKQERVSERVKRI 293 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 75/187 (40%), Gaps = 20/187 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ +S + VR +++++ + + D R+G ++DR ++ G Sbjct: 40 VFILDASSSVGKEDFEKVRQWVSNLVETFEIGP---DKTRVGVVRYSDRPTTEFDL--GK 94 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +K I + G+T DA++ Y S +++ R+ ++ KK +LLTD Sbjct: 95 YKTREEIKEAARKIRYYGGNTNTGDALR--YINTYSFSKEAGGRL-SDRTVKKVAILLTD 151 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + + N A+ GIRI + E + L AS F + Sbjct: 152 GRSQDYVLDP---ANAARQAGIRIFAVGVG------EALKEELDEIASEPKSAHVFHVSD 202 Query: 434 THELNKI 440 + ++KI Sbjct: 203 YNAIDKI 209 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/251 (17%), Positives = 99/251 (39%), Gaps = 30/251 (11%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + Y+ + K+ I ++ +++ Y L PG P +D+S Sbjct: 73 QRYTGTSWKLLIPSTLVALFLLGMLFEWVYQLNPGSAKPVKDVVLVIDNSGSMKDTDPNQ 132 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 A ++I + + DN R+ F+ F+ ++ I+ + Sbjct: 133 DRYTAAKNLINRMDR-DN-----RVSVIVFDHATTLLQPFTRVKNQEIKDEIMAEIDGLA 186 Query: 333 EM-GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK--YIVLLTDGENTQDNEEGIAI 389 G T I+ A+ ED + ++ + +A + +++L+DG + D++ +A Sbjct: 187 TTDGGTDISLAL-----------EDTMSHIQESQDAGRSAMVIMLSDGFSETDHDRVLA- 234 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 K Q I + TI S+ L A + +++ + +L+ +F+ +I + Sbjct: 235 --DYKQQQIAVNTIGLSLVYKD---GANLLQTIAAETGGQYYDVQNAADLSFVFQ-KIYD 288 Query: 448 EIFERVIRITK 458 ++ +R +TK Sbjct: 289 DVGDR-SLLTK 298 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 62/177 (35%), Gaps = 33/177 (18%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV---ISDPSFSWGVHKLIRTIVKTF 327 V+ A +I ++ D R+ F+ R + + + ++ R I + Sbjct: 59 LETVKQAAVGLIERLQPDD------RLSIVAFDHRAKVLVRNQPMG-NLDQIKRKINR-- 109 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 G TAI++ ++ +I + +D V + + LLTDGEN N E Sbjct: 110 ---LGADGGTAIDEGLKLGVKELIKAKQDTVSQ----------VFLLTDGENEHGNNESC 156 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 I + A + I ++ F N + L A + S + F Sbjct: 157 IKLAELAAENNLTINSLGFGANWN-----QDILEKIADIATGSLSYIEEPEQALSEF 208 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 58/423 (13%), Positives = 128/423 (30%), Gaps = 45/423 (10%) Query: 42 DVVRWSYYEHALKQAAQTAII-------TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 D + + + A+ TA + TA +E ++ + + Sbjct: 14 DPTLYLQRGNGIPSASNTAALLLVAVSLTACSGKGAEVEHRQAKQQAEQRHQVASQRQAE 73 Query: 95 IRNFENNLKKN---FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 +R+ V ++ PR A + ++++ + + + Sbjct: 74 MRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEI 133 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + ++ I + R +E + + + LN F Sbjct: 134 PVSTFSIDVDTGSYATLRRMLRE------------GHLPEKGTVRVEEMLNYFAYDYPLP 181 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 K+ + + + + M+ L P + + +D S + K Sbjct: 182 AKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASVDKL 241 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 L++ AL + + D V+ V GA G + Sbjct: 242 PLLQTALKLLTAQLSAQDKVSIVVYAGAAGVVLD---------GASGNDTQTLNYALEQL 292 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIA 388 + GST + AY + H + N + ++L TDG+ D ++ IA Sbjct: 293 SAGGSTNGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLIA 343 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + K K GI + T+ F + + ++ ++ +E K+ D + + Sbjct: 344 LIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADK--GNGNYAYIDTLNEARKVLVDELSST 401 Query: 449 IFE 451 +F Sbjct: 402 LFT 404 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 53/160 (33%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S + +K R+AL ++ + D + Sbjct: 257 NGYFVHYFAPDSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFD---- 312 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + K++A G T INDAM A + +N++ Sbjct: 313 --LISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQE 370 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E+ + I+LLTDG+ T I + Sbjct: 371 ELLPEGSITL----IILLTDGDPTVGETNPSNIQKNVRKA 406 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 592 EGFQLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFLLAEDGRRCKKCTEGP 651 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 652 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 708 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 709 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 764 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 765 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTTKHLFY 814 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 815 AEDFSTMDEISEKLKKGICEALEDSDGR 842 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 63/191 (32%), Gaps = 33/191 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R AL + IR + + +R+ F + +L+ + Sbjct: 105 SRFEAARAALRTFIRELP------EGLRLALVTFARDAHLVVPLTTDRGRLLEAVD---- 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEG 386 TAI DA+ + + +E + + ++ I+LLTDG + + Sbjct: 155 -FLQLNLGTAIGDAILESIQALPPLSERAE-----DPDPRRLATIILLTDGR-SLGGVDP 207 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKA------------RYFLSNCA--SPNSFFEAN 432 + +A Q IR+ TI + L A +F + Sbjct: 208 VVAAQEAARQQIRVHTIGIGRTTSGPVPGLPEVYAQAALFDEETLKEVARVGDGQYFYVD 267 Query: 433 STHELNKIFRD 443 S +L + +RD Sbjct: 268 SAEKLKEAYRD 278 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/294 (12%), Positives = 91/294 (30%), Gaps = 24/294 (8%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + + A + +++ I +++D S S+ + +E + F Sbjct: 19 LQVSARNLAQVASTFQNQVNKYQDSRADIVFMLDSSGSVGENNFQTEITFVENFLSQLTI 78 Query: 211 TVKSY----------SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + ++ + + + ++ Y D + + Sbjct: 79 SPQASRVAIVSFDDAARTHIDYIKNPKNKCSFLRELKTVKYTGGSTNAEDAFRLAQELLR 138 Query: 261 SSSLRHVIKKKHLVRDA-LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + + + + + I R+ F++ + + K Sbjct: 139 PKSAFANNQPVKQTEITFVENFLSQL-TISPQAS--RVAVVSFDNHARTHIDYI-NSPKN 194 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ + GST D + A + + + N K+ +V LTDG Sbjct: 195 KCSFLRELKAVKYTGGSTNAEDGFRLAQELLRPQS-----AFTNYQPVKQVVVYLTDG-- 247 Query: 380 TQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + +++ + N KS I +I + E ++C+SP A Sbjct: 248 -KPDKDPVGRANNLKSVYNAEIYSIGVDPYSKRYETDGVTSADCSSPGCGANAE 300 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 63/167 (37%), Gaps = 12/167 (7%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + F+ +L G F I++A+L+ L + ++ D + L+ + A Sbjct: 1 MFYYRNFLSVFSRLNHCTNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + L K K++I +L N++ L T+ +++ IV + V Sbjct: 61 LHNNPNELSVGTI------KQREMLIKKRIGYFLDSNYKGTL---LTEEQIKLIVNQSTV 111 Query: 121 EMNPRK---SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + R + + + ++ L L L + K + I + + Sbjct: 112 SITERSFYPQQFHINIELHKNIQLKSLILHMAMNPKKDFNISQRKSS 158 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 59/175 (33%), Gaps = 30/175 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVKT 326 K L +++ + ++ D V G F+D+ V+ + V + Sbjct: 423 KIELAKESAMRTVELMRAKDTV------GVVAFDDQPWWVVPPQKLG------DKEEVLS 470 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ +A + ++ + H I+L+TDG+ + N Sbjct: 471 SIQSIPSAGGTNIYPAVSSALEEMLKIDAQRRH-----------IILMTDGQ-SAMNSGY 518 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I + ++A ++ + A+ ++ L +F Sbjct: 519 QDLTDTMVENKITMSSVAVGMDA-DTNLLQSLAD--AAKGRYYFVEDETTLPAVF 570 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 78/208 (37%), Gaps = 21/208 (10%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C G + +D+SS + VR +A+++ + + D Sbjct: 20 CCGEETQAQRAGCKNVHYDLVFILDTSSSVG-KEDFEKVRQWVANLVDTFEIGP---DKT 75 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G ++DR ++ G ++ I + + G+T DA++ Y S +E Sbjct: 76 RVGVVRYSDRPTTEFDL--GRYQTREQIKEAAKNIKYYGGNTNTGDALR--YINTYSFSE 131 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + R ++ KK +LLTDG + + + A GIRI + E Sbjct: 132 EAGGRPTDSAI-KKVAILLTDG---RSQDHVLDPATAAHKAGIRIFAVGVG------EAL 181 Query: 416 RYFLSNCAS---PNSFFEANSTHELNKI 440 + L AS F + + ++KI Sbjct: 182 KEELDEIASEPKSAHVFHVSDYNAIDKI 209 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 10/87 (11%) Query: 375 TDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 T G +C++AK++GI I T+ F + + LS CA+ S Sbjct: 235 TRGNMNSPANSKHNSVAYMKTMCDQAKAKGIIIYTVGFQIRRNTLPDLS--LSYCATSPS 292 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVI 454 + + +L+ F+ I + I I Sbjct: 293 HYYFVESSDLSAAFK-AIASSIKSLRI 318 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 72/488 (14%), Positives = 141/488 (28%), Gaps = 104/488 (21%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G+ II AL +L G VD S + L +A + A+ P + +++ Sbjct: 19 GNVAIIFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSATLAATTPAMMQQTTATAKT 78 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + F + N N T V D + V+ +Y + Sbjct: 79 VATSMFA----AQVAQINRLTYNSANLT-VTVNDDTSASPVKTRTVTVSYLAQVG----- 128 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML--------- 190 N F + + KA + + + V+D S SM Sbjct: 129 --NAFGSFYHV---PTSIFTVKASSTASTARN-----IDFYLVLDNSPSMELPATTAGLA 178 Query: 191 ----------DYQRDSEGQPLNCFGQPADRTVKSYS-----------SQNGKVGIRDEKL 229 ++ P N P T+ SY+ + R Sbjct: 179 SMTAATGCVFACHENTYSDPENTVQYPGYGTIDSYTYAKNAGIALRIDNVREAAKRLAST 238 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 S M+S N + Y + + + ++S + ++ ++ + + Sbjct: 239 SQAMMSANGATYRLAAYAFNYDTTQLQALTSTTSAN-----VSAISTSINAMTPPLME-- 291 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ + GA + +W L KT + M T I + Sbjct: 292 -KNNYLPTGA----SYTYPTSASTWTTVTLGSDPTKTNYNVRDAM--TDIEMTLTKVNAA 344 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG---------------ENTQDNEEGIAI----- 389 + + + + ++ ++L+TDG ++ N G Sbjct: 345 MPNPGNGT---TASGDKPQEVVMLVTDGMVDGSFYTNTSCTNYASSYSNSYGTFYRCLRP 401 Query: 390 -----CNKAKSQGIRI------------MTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 C K++GIRI L +CAS + +FE + Sbjct: 402 LDTTLCTTIKNRGIRIAVLNLIYYPTPGYGFYDGAVAPFISTVSPALKSCASTDLYFEVD 461 Query: 433 STHELNKI 440 + ++++ Sbjct: 462 TGSDISEA 469 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 57/427 (13%), Positives = 126/427 (29%), Gaps = 68/427 (15%) Query: 28 LLMPVMLGVGGMLVDVV-RWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 L V+L V G V + + ++++ +V ++ + S+ +F Sbjct: 11 LAASVLLVVAGCGVSSSPETTVAPPPAQTELKSSVQPEAVMADSAMAKRSALTAPIASFA 70 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 E + + + ++ + I T ++ + + LLN Sbjct: 71 PMPAGESYPQGYRDEQREQYAKLADNPIHSVTETPVSTFSADVDTGAYANVRRLLN---- 126 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 Q + E R + + + S + P N + Sbjct: 127 ------------QGRLPPEGAVRLEEMVNYFPYDYALPSDGSPFGVTTELAASPWNPHTR 174 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 +K+ ++ + VD S Sbjct: 175 LLRIGIKASDRAVAELAPAN----------------------------LVFLVDVSGSMD 206 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + LV+ L ++ +++ D V+ V G + V+ + ++T Sbjct: 207 RREGLPLVKSTLKLLVDQLREQDRVSLVVYAGE---SSVVLEP------TSGREKAKIRT 257 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE---NTQDN 383 GSTA ++ AY + + I+L TDG+ T D Sbjct: 258 AIERLTAGGSTAGASGIELAYQ--------MAQQAFIPKGINR-ILLATDGDFNVGTSDF 308 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + + + GI + T+ F V+ + A ++ ++ E K+ D Sbjct: 309 DSLKQMAVDKRKTGISLTTLGFGVDNYNEHLMEQLAD--AGDGNYAYIDNLREARKVLVD 366 Query: 444 RIGNEIF 450 ++G+ + Sbjct: 367 QLGSTLA 373 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 16/164 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S S + + T Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLS--AFSRREDMERAIRALVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSVAEGAR---PPEARVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + Q L ASP + + +L + F Sbjct: 171 YAVGV------QRADVGSLRAMASPPLDEHVFLVDSFDLIQEFG 208 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 323 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 377 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 378 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 431 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 432 RAKEEGIVMYAVGVGKAVEEE------LRKIAS 458 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 58/174 (33%), Gaps = 30/174 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +++ + + D + F+++ + +++ + T Sbjct: 474 KIEAAKESARATAEVLSPSD------LITVVAFDNQPTTIVRLQRASNRMR---IATDIA 524 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I A++ AY+ + +N H +++L+DG+ D + Sbjct: 525 RLQAGGGTNIYPALREAYEILQGANAKVKH-----------VIVLSDGQAPYDGIADL-- 571 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 C + +S I + + + R L+ + + L +IF Sbjct: 572 CQEMRSARITVSAVGIG------DADRNLLNLITDNGDGRLYMTDDLAALPRIF 619 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 553 EGFQLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 612 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 613 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 669 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 670 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 725 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 726 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 775 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 776 AEDFSTMDEISEKLKKGICEALEDSDGR 803 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFQLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFQLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WG 315 FV S + +V++ + ++ S+ I R+G ++ +V ++ + + Sbjct: 579 VFVIDGSKSLGVNNFEIVKEFVLGILDSL-TISPKA--ARIGLLQYSTQVRTEFTLKQFS 635 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ V + + A++ + + E H + + +V T Sbjct: 636 TATDMKKAVSQM---KYMGKGSMTGLALKQMTERSFTEAEGARHL--SAKVPRVCVVF-T 689 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSF-FEAN 432 DG + +E KAK +GI + I ++ L AS P F A Sbjct: 690 DG---RAQDEVSEWAAKAKQRGITMYAIGIGKAIEEE------LREIASDPPEKHLFYAE 740 Query: 433 STHELNKIFRDRIGNEIFERV 453 + I ++ +R+ Sbjct: 741 DFSAM-----GEITEKLQKRM 756 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 67/203 (33%), Gaps = 23/203 (11%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + F+ SS V++ + ++++ + D R+G + V + Sbjct: 13 CSDKRLDLVFIIDSSRSVRPYDFEKVKEFILNILQFLDIRP---DVTRVGLIQYGSTVKN 69 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + S + + + T A+Q A + S E + Sbjct: 70 EFSLK--TFARKQDMERAVRRMMYLSTGTMTGLAIQFAVNIAFSETEGARPL---SQNVP 124 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---P 425 + I+++TDG + + I KA++ GI I I L + S Sbjct: 125 RVIMIVTDG---RPQDPVAEIAAKARNSGILIFAIGVGRVDM------NTLKSIGSEPYE 175 Query: 426 NSFFEANS---THELNKIFRDRI 445 + F + L +F++++ Sbjct: 176 DHVFLVANFSQIETLTSVFQNKL 198 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 69/219 (31%), Gaps = 41/219 (18%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 +D LS + + + + ++D ++K N R+G + Sbjct: 85 TKGVDIMLSIDVSLSMLAKDLNPDRITALKDIAVKF---VQKRPND----RIGVVAYAAE 137 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + +I I + TAI + + A + ++ S Sbjct: 138 AFTKVPVTSDHQVVIDEIKNLNSAGLEP--GTAIGEGLSVAVNHLVKSKAKS-------- 187 Query: 366 EAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-----KTQQEKARYF- 418 K ++L+TDG N Q+ AK+ I++ I N T Q+ Sbjct: 188 ---KVVILMTDGVSNIQNAIPPQVAAELAKNNNIKVYAIGIGTNGYALMPTSQDIFGDLV 244 Query: 419 ------------LSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +F A S L +++ + Sbjct: 245 FTETEVTIDENTLREIAQTTGGKYFRATSNSSLEEVYDE 283 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 N AIC+ AK++GI + TI F L +CAS ++ + E++ F Sbjct: 455 NARTKAICDAAKARGIVVYTIGFEA----PSGGVSVLKDCASSDAHYFDVQGLEISDAFA 510 Query: 443 DRIGNEIFERVIRITK 458 I I R +R+T+ Sbjct: 511 S-IATSI--RQLRLTQ 523 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 48/109 (44%), Gaps = 18/109 (16%) Query: 336 STAINDAMQTAYDTII-------SSNEDEVHRMKNNLEAKKYIVLLTDG----ENTQDNE 384 +T + + A+ + + E+ R + +K ++L TDG + + + Sbjct: 345 NTNNAEGVMWAWRLLSPHWRGYWDKGKSELPRDYQHPNNRKVMLLFTDGNHLVDVAKRDR 404 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + +A+C + K QGI I++I F + +CAS ++ A++ Sbjct: 405 KQVALCREMKKQGIEIISIDF-------NNRSQVMKSCASAGQYYIADN 446 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 49/342 (14%), Positives = 103/342 (30%), Gaps = 38/342 (11%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-VVLSSRYDLLL 141 + N + + D+ + L+ ++ + Sbjct: 120 SQLSPNNDSKDGGTGQLINQELGTGNESSDDVANVIYDVFYGSVGTMNFINLAQLLNMFV 179 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF------SRSMLDYQRD 195 NP++ + K + A + + G + +D + L R Sbjct: 180 NPMAHITEKV--KVLRKLARYLASYGLLPHQGKGGSRVFKALDNVAREPTIGNALRVSRF 237 Query: 196 SEGQPLNCFGQP------ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 +E + D + + +R L+ M + + + + Sbjct: 238 TEHSNYPTYITGVREYRIGDPAYRIDLDKTSMNMVRKTFLNKPMSTRDIVVREYADVKLM 297 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D L + K + ++A+ IR + + ++ R+ FN R Sbjct: 298 DIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIRYLSRTND-----RLSMVLFNFRAD-- 350 Query: 310 PSFSWGVHKLIRTIVKTFAID--ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG H + + I + + G T I +A++ A + SN Sbjct: 351 --ILWGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSN-----------YP 397 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 K+I+ +TDG T + I K + G+ + T+A N Sbjct: 398 NKHIICITDGR-TVNASSCIKEAVKLRRMGVTLSTVAVGDNS 438 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 59/173 (34%), Gaps = 37/173 (21%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L ++A A + +K+ D G F+ +W V + K I Sbjct: 422 KFDLAKEAAARSVELLKEEDTF------GFIAFDTE-------AWTVVETEPIKNKDEVI 468 Query: 330 ----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I A+ AY + NE ++ R K+I+LLTDG++ E Sbjct: 469 ETIRSTALGGGTDIFPALNQAYQQL---NEMDLKR--------KHIILLTDGQSNDGPYE 517 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHE 436 I + + + + T+A + L A F+E Sbjct: 518 --EIIEEGLTNNVTLSTVAIGGDADT-----SLLEELAEIGTGRFYEVYEASA 563 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 61/189 (32%), Gaps = 35/189 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +++S+ D G F S S ++IR + Sbjct: 108 TRLESSKRSAEILLKSLDPKD------YAGIITFESGATSAAYLSPDKDRVIRKLQAIEP 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G+TAI D + D S KK ++LL+DG N Sbjct: 162 KE----GATAIGDGLALGIDMAESMP-----------NRKKVVILLSDGVNNAGVIHPEQ 206 Query: 389 ICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCASP--NSFFEANST 434 A+ +GI++ TI N L A+ +F++ Sbjct: 207 AAGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAASTNGQYFKSVDD 266 Query: 435 HELNKIFRD 443 L++I+ + Sbjct: 267 RTLSEIYSN 275 >gi|328953619|ref|YP_004370953.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] gi|328453943|gb|AEB09772.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] Length = 333 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 81/267 (30%), Gaps = 22/267 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL++ +G I AL + ++ G + D+ + L++ A + + L+ Sbjct: 11 KLVRDASGAVAISFALTLAILCGFVALAFDIGHLVMVKAELQRTADAGALAGATGLVPYT 70 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIR---NFENNLKKNFTDREVR---------DIVRDTAV 120 V+S + Q + LI N +N TD V + + Sbjct: 71 GPVTS--QTPDWLQGQSKAQTLINNTANLADNQVFTTTDSAVTYGYWLLNPPEGYVQSLP 128 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH----G 176 + P SAY + R L N F +G+ S + +Y ++ Sbjct: 129 TVRPTTSAYLPQPAIRVTLSRNVDLYFAPLLGVSSPKTVNATATAILPETYRTKNTPPIA 188 Query: 177 VSIQWVIDFSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 V+ V D +D Q + G S S + + Y Sbjct: 189 VARDIVYDIIGDSVVINVDEQTITPRSNAGSAGWFNLSGENSAPSVRINEALTSPTSAIY 248 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFV 259 ++ K+ L L V Sbjct: 249 LLPGTKATLLGLVTAGETIVLPIVDDV 275 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 56/144 (38%), Gaps = 25/144 (17%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +++ + S + +K ++ G T I+ + A D + + Sbjct: 588 VSYDNEAYLEQSLT-----SRSGTLKNSIAAISDGGGTNISAGLNLALDNLEAEKGSRA- 641 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 ++L++DG++ E+ A ++A GI + T+ F E ++ Sbjct: 642 -----------VILMSDGQDGGSEEDMQAATDRAAKLGISVYTVGFG------ECDDAYM 684 Query: 420 SNCA--SPNSFFEANSTHELNKIF 441 A + F +A+++ EL+ I+ Sbjct: 685 QAIAEVTGGKFVKASASTELSDIY 708 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 85/250 (34%), Gaps = 23/250 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + Y+ C++ Sbjct: 747 EGFRLAEDGRRCRRKDICKSTQHGCEHACVNQGDSYVCKCSEGFVLAEDGRRCKRCTEGP 806 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S +V+ L ++I S+ + R+G ++ +V ++ + Sbjct: 807 LDLVFVIDGSKSLGEDNFEIVKQFLTAIIDSL-AVSPKA--ARVGLLQYSTQVRAEFTLR 863 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 864 --SFSTAKDMKKAVAHMKYMGKGSMTGQALKHMFERSFTQLEGA--RPLSARVPRVAIVF 919 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ NKAK+ GI + + ++ L AS F Sbjct: 920 -TDG---RAQDDVSEWANKAKANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 969 Query: 431 ANSTHELNKI 440 A +N+I Sbjct: 970 AEDFSTMNEI 979 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 91/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+Q+G + + Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKNVCKSTQHGCEHMCVNNGNSYLCRCSEGFVLAEDGKHCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + V+ + +I S+ + R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFETVKHFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 709 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + K + + A++ ++ + E R + + IV Sbjct: 710 RGFSSA-KEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPPSTQVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ ++ I + + R Sbjct: 817 AEDFSTMGEISEKLKEGICEALEDSGGR 844 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 58/423 (13%), Positives = 128/423 (30%), Gaps = 45/423 (10%) Query: 42 DVVRWSYYEHALKQAAQTAII-------TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 D + + + A+ TA + TA +E ++ + + Sbjct: 29 DPTLYLQRGNGIPSASNTAALLLVAVSLTACSGKGAEVEHRQAKQQAEQRHQVASQRQAE 88 Query: 95 IRNFENNLKKN---FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 +R+ V ++ PR A + ++++ + + + Sbjct: 89 MRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEI 148 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + ++ I + R +E + + + LN F Sbjct: 149 PVSTFSIDVDTGSYATLRRMLRE------------GHLPEKGTVRVEEMLNYFAYDYPLP 196 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 K+ + + + + M+ L P + + +D S + K Sbjct: 197 AKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASVDKL 256 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 L++ AL + + D V+ V GA G + Sbjct: 257 PLLQTALKLLTAQLSAQDKVSIVVYAGAAGVVLD---------GASGNDTQTLNYALEQL 307 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIA 388 + GST + AY + H + N + ++L TDG+ D ++ IA Sbjct: 308 SAGGSTNGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLIA 358 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + K K GI + T+ F + + ++ ++ +E K+ D + + Sbjct: 359 LIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADK--GNGNYAYIDTLNEARKVLVDELSST 416 Query: 449 IFE 451 +F Sbjct: 417 LFT 419 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 58/187 (31%), Gaps = 35/187 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ G F + + L+ + + Sbjct: 109 EAAKDVGIEFILSRPNDNF---GLVVFAGESFTQCPITSNHASLVNLFKQVDFGIIQD-- 163 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA + I K+ K I+LLTDG N + I+ A+S Sbjct: 164 GTAIGLGLATAINRI-----------KDAEGKSKVIILLTDGTNNTGDIAPISAAQIAQS 212 Query: 396 QGIRIMTIAFS-VNKTQQEKARYF----------------LSNCASP--NSFFEANSTHE 436 GIR+ TI + F L AS +F A + Sbjct: 213 YGIRVYTIGVGTQGIAEVPMLDQFGNIHYTEAEVVIDETTLQQIASTTGGKYFRATNVSS 272 Query: 437 LNKIFRD 443 L +I+ + Sbjct: 273 LKQIYSE 279 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 65/191 (34%), Gaps = 43/191 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV------------ISDPSFSWGV 316 + ++A + RS+ ++ +G + +V G Sbjct: 94 TRLDAAKEASKNFSRSV------SEESELGFMVYGTKVGNSPEEREAGCKDVTTLLPVGK 147 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + N G T + A++ A + + E + IVL++D Sbjct: 148 GNAGK--ISGEVDKVNASGHTPMGPALKQAAKELPNEGE-------------RSIVLVSD 192 Query: 377 GENTQDNEEGIAICNKAK---SQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 GE+T +C+ AK QGI I T+ F V+ +++ + A + +A Sbjct: 193 GEDT---CAPPPVCDVAKDLHKQGIDLTINTVGFLVDPAARKELQCIAE--AGGGEYLDA 247 Query: 432 NSTHELNKIFR 442 L + + Sbjct: 248 QDAESLAESMK 258 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 87/262 (33%), Gaps = 27/262 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S+ ++ + Q KS N + D + M N Sbjct: 238 MFMQNLESVAEFCTEETHNTEAPNLQNKMCNHKSTW--NIIMHSEDFQHLSPMTEINSPP 295 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + S S+ + + + A +I+ I+K V G Sbjct: 296 HPTFSLLKSKQRVVCLVLDKSGSMAAEDRLFRMNQAAELYLIQIIEKGSLV------GMV 349 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ + + + T + + G T+I + ++ + I SN+ + Sbjct: 350 TFDSYAQIQNNLI-KITEDNTYQKITANLPQEASGGTSICNGLKAGFQAISQSNQSTLGS 408 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 I+LLTDGE+ Q I++C + + G I TIA + EK L Sbjct: 409 E---------IILLTDGEDNQ-----ISLCFEEVRQSGAIIHTIALGPSA---EKELETL 451 Query: 420 SNCASPNSFFEANSTHELNKIF 441 SN + F+ N + L F Sbjct: 452 SNMTRGHRFYAHNDINGLIDAF 473 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 68/174 (39%), Gaps = 21/174 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I + +D ++ R+G ++ +V S+ S H + ++VK T Sbjct: 28 RFMIDILNTLDIGLNSTRVGVVQYSSQVRSEFSLR--SHASLDSMVKAIQEMVPLAQGTM 85 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ + ++ E + ++ N +V++TDG + + + +A+ +G+ Sbjct: 86 TGLAIRYTMNVAFTAAEGDRPKVPN------VVVIVTDG---RPQDRVAEVAAEARERGM 136 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 137 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 183 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 78/211 (36%), Gaps = 28/211 (13%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S+++E F + + + + + V++ L + +K D R+G F D Sbjct: 123 LSGSMAQEDFKNDAGKK--VSRLDAVKEVLDGFLAK-RKGD------RVGLVVFGDAAYL 173 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ + L + ++ + TA DA+ + E Sbjct: 174 QAPFTTDLQ-LSQELLGECEVGM-AGPRTAFGDAIGLGVNLFDEDTERA----------- 220 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCASP- 425 K I+ LTDG +T+ + A + I+I T+A + + L + AS Sbjct: 221 KTIIALTDGNDTKSKVPPVEAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASET 280 Query: 426 -NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 +F A L I+ + ++I + I+ Sbjct: 281 GGKYFFAADREHLAGIYDEL--DKIETQTIQ 309 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 59/190 (31%), Gaps = 37/190 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR------------VISDPSFSWGV 316 + + A ID + D+ + + S G Sbjct: 86 TRMDAAKKAAN------DTIDTLADSAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGN 139 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +K K G T I +A++ A + + SS K+ I+L++D Sbjct: 140 NKPEDLEDK--INGLEPKGYTPIGNAIKKAAEELGSSG-------------KRNIILVSD 184 Query: 377 GENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 G +T + GI I T+ F V+ Q++ S ++ A+ T Sbjct: 185 GIDTCAPPPVCDVAEDIAGDGIDLAIHTVGFKVDDKAQKELECISE--VSGGTYTSADDT 242 Query: 435 HELNKIFRDR 444 L + D Sbjct: 243 EALTEALTDA 252 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 64/188 (34%), Gaps = 26/188 (13%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S S I K + ++A + +++ D R+G F+ I F + Sbjct: 406 SMSATFGISKFDMAKEAAILSLTTLQPGD------RVGVLAFDTETIWTVPFRTVGEGVS 459 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ + G T I A+ + + H VLLTDG + Sbjct: 460 LVELQDQIATMSLGGGTNIERALSVGLPALANEPYSTRHA-----------VLLTDGRSY 508 Query: 381 QDNE-EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHEL 437 +N + A++ I + TIA + L+ AS ++ +L Sbjct: 509 SNNYPRYQQLVETARAAQITLSTIAIGSDSDT-----ELLNQLASWGNGRYYFVADATDL 563 Query: 438 NKI-FRDR 444 +I F++ Sbjct: 564 PRITFQES 571 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 95/250 (38%), Gaps = 28/250 (11%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + Y+ + K+ I ++ +++ Y L PG P +D+S + Sbjct: 73 QRYTGTSWKLLIPSTLVALFLLGMLFEWVYQLNPGSAKPVKDIVLVIDNSGSMNETDPNQ 132 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDE 331 A ++I + + DN R+ F+ F+ + I+ Sbjct: 133 DRYTAAKNLINRMDR-DN-----RVSVIMFDHATTLLQPFTRVNNQETKDEIIAEIDGLA 186 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEEGIAIC 390 G T I+ A++ + H ++ + +V++ +DG + D++ +A Sbjct: 187 TTDGGTDISLALE----------DTMSHIQESRDAGRSAMVIMLSDGFSETDHDRVLA-- 234 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + K Q I + TI S+ L A + +++ +L+ +F+ +I ++ Sbjct: 235 -EYKQQQIAVNTIGLSLVN---PDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQ-KIYDD 289 Query: 449 IFERVIRITK 458 + +R +TK Sbjct: 290 VGDR-SLLTK 298 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 52/396 (13%), Positives = 123/396 (31%), Gaps = 30/396 (7%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF--------TDREVRDIVRDTA 119 ++ +++ S+ + + +K+ E I + + + D + I+ D A Sbjct: 15 MLYLIKDTSTTLQTAPGELAKKLAELAINGLGTSEMQGYYDKLTFKSLDLDGNSILNDLA 74 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV-S 178 + +V ++ + + ++ ++ + + + + Sbjct: 75 TRFANKLQT-KVTIARKIKDAVEVSYAKSATVTSRTECCKADTRWLKYDSRFRTKVNLDE 133 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + +I + S Q + T+ + + + + SC+ Sbjct: 134 MCVIISGAASSNPKQLQDNVLQTMKQNIENNPTLTWQYFGSEEGLYTNYPMIRDSSSCSS 193 Query: 239 --SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV- 295 Y Y P + V S + + ++A +V+ ++ D V Sbjct: 194 YDPRYRPWYVEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAF 253 Query: 296 -----RMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ T + + S + I+K F E G T A A+D Sbjct: 254 ESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDI 313 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAF- 405 + D+ ++ KN + I+ +TDG D + N+ S I+T Sbjct: 314 L-----DKYYKEKNTTR-RPVILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMG 367 Query: 406 -SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ + + + FE + T L + Sbjct: 368 GGISPAGVDLLQSLAEQTLDGGARFEVSLTTALRDV 403 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 73/275 (26%), Gaps = 30/275 (10%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + + S + +S + I + + SL Sbjct: 193 LQHIEAFENLNSGSNKTAEISSYPESQAVSQSSCLDGFDILIHVKAPTSSSNDVTGSLVN 252 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + V S K L++ A+ VI+ ++ D R+ F Sbjct: 253 ESSMRSSRRVPIDLVTVLDVSGSMAGTKLALLKQAMGFVIQHLRPSD------RLSVIAF 306 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + H + ++ G T I DA++ A I N Sbjct: 307 SSTARRLFPLQRMSHHGRQQALQAI-SSLGAGGGTNIADALKKAVKVIEDRNYKNSVCS- 364 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-----------GIRIMTIAFSVNKTQ 411 I+LL+DG++T + + + + + T F + Sbjct: 365 --------IILLSDGQDTFNISSNFQGTSAGRRSLVPPSILNELHMVPLHTFGFGADH-D 415 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + AS +F + F IG Sbjct: 416 SDTLHSISE--ASGGTFSFIEDEGVMQDAFAQCIG 448 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 69/182 (37%), Gaps = 29/182 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + +V++ ++ I +K DN MG F ++ V+ ++ I+ Sbjct: 106 QHLSRFDVVKEIVSDFISQ-RKNDN------MGLVVFGAYSFIASPLTYDVN-ILNKILS 157 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 I M Y + +S + +K + K +LLTDG +T + Sbjct: 158 QLQI------------GMAGKYTALNTSLAQGANLLKQSKSKTKIAILLTDGYST-PQVD 204 Query: 386 GIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I + K +GI++ I + +A L A S F A+S EL ++ Sbjct: 205 TITLDIALDMIKKEGIKVYPIGIGMPHEYNTEA---LLKIANESGGVAFGASSAAELQEV 261 Query: 441 FR 442 ++ Sbjct: 262 YK 263 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 48/135 (35%), Gaps = 18/135 (13%) Query: 329 IDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T+ + + + ++ N+ + K ++ I++L+DGE+ + Sbjct: 440 NIVRPSGWTSASSGLLVGANIMMDENKSPDAKPSKLGTNIQRVILVLSDGEDNWPTYSTL 499 Query: 388 AI------CNKAKSQGIRIMT---------IAFSVNKTQQEKARYFL-SNCASPNSFFEA 431 C+K + Q ++ IAF + C + ++ A Sbjct: 500 TTLLNNGMCDKIREQLGKLQDPNLRELPGRIAFVAFGYSPPANQVAAWKKCVG-DQYYTA 558 Query: 432 NSTHELNKIFRDRIG 446 S EL + F+ IG Sbjct: 559 YSKEELLESFKQIIG 573 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 81/252 (32%), Gaps = 31/252 (12%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV----- 66 +K ++ G + ++TALL +L + G VD + L Q A + Sbjct: 15 RKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVAENNDY 74 Query: 67 ----PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN--FENNLKKNFTDREVRDIVRDTAV 120 +V S + + + + RN + K + E + D + Sbjct: 75 RENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENANASSDAPI 134 Query: 121 EMN------------PRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 ++ P + Y +V L + + +K I+ Sbjct: 135 TIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLVSADKLKQDRIRM 194 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +++ + V + V DFS SM + +D G+ L R V S S+ Sbjct: 195 ESDTSYAIKEKGIVIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVVSEISKIL 254 Query: 221 KVGIRDEKLSPY 232 E +SP+ Sbjct: 255 LPEDVSEGVSPF 266 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 553 EGFRLAEDGKHCRRKDVCKSTHHGCEHICVNDGNSYICKCSEGFVLAEDGRQCKKCTEGP 612 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 613 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 669 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 670 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 725 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 726 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 775 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 776 AEDFSTMDEISEKLKKGICEALEDSDGR 803 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKHCRRKDVCKSTHHGCEHICVNDGNSYICKCSEGFVLAEDGRQCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFRLAEDGKHCRRKDVCKSTHHGCEHICVNDGNSYICKCSEGFVLAEDGRQCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTNKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 88/258 (34%), Gaps = 26/258 (10%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + S Y+ C++ Sbjct: 594 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNRGSSYICKCSEGFILAEDGRRCKRCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FGSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSARVPRVAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 816 Query: 431 ANS---THELNKIFRDRI 445 A E+N+ + I Sbjct: 817 AEDFSTMGEINEKLKKGI 834 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 88/258 (34%), Gaps = 26/258 (10%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + S Y+ C++ Sbjct: 553 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNRGSSYICKCSEGFILAEDGRRCKRCTEGP 612 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 613 IDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AISPKA--ARVGLLQYSTQVRTEFTLR 669 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 670 N--FGSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSARVPRVAIVF 725 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 726 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPIDKHLFY 775 Query: 431 ANS---THELNKIFRDRI 445 A E+N+ + I Sbjct: 776 AEDFSTMGEINEKLKKGI 793 >gi|293347920|ref|XP_001064219.2| PREDICTED: vitrin-like [Rattus norvegicus] Length = 648 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 121/394 (30%), Gaps = 47/394 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 288 DVVQALDIGPGGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 347 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + I+ + Sbjct: 348 FVTKNFFSKANGNRGGAPNVAV--------------VLVDGWPTDKIEEVSR-------V 386 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ +V + + Q E + + + + L Sbjct: 387 ARESGINVFFVTVEGAAEREKQHVVEPNFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 446 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 447 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 501 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + ++ G T+ A+Q A + + Sbjct: 502 ---TRIGAVQYT--YEQRLEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAAIQYALEQLF 556 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++L+TDG + ++ A +G+ I + Sbjct: 557 KKSKPNK---------RKVMILITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 602 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q++ ++ A +SFF + L K F RI Sbjct: 603 QDELEVIATHPARDHSFF-VDEFDNLYK-FVPRI 634 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 77/210 (36%), Gaps = 25/210 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S K L+++AL + R + D V V GA+ Sbjct: 359 PWEERPASNLVFLLDVSGSMSQPNKLPLLKEALMLLTRRLDSRDRVAIVVYAGASG---- 414 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ + + + + GST ++ AY H +++ Sbjct: 415 LVLPSTTANNTATIEHALT-----QLQAGGSTNAGAGIELAYQV------AREHFIEDGN 463 Query: 366 EAKKYIVLLTDGENT--QDNEEGIA--ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 ++L TDG+ Q N +A + ++AK G+ + + F + + + Sbjct: 464 NR---VILCTDGDFNVGQTNRGDLAQIVADQAK-DGVSLTVLGFGMGNYKDNMLEELSNK 519 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ +S E K+F + + IF+ Sbjct: 520 --GKGTYAYVDSEAEARKVFLQDLASNIFK 547 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 53/169 (31%), Gaps = 4/169 (2%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G F I AL M +LG G+ +D+ R + L+ A + + A+ L + ++ Sbjct: 12 GAFLITFALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAARELNGQSDAITRAQ 71 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN--PRKSAYQVVLSSRY 137 N+ K T R T R + Q +SS Sbjct: 72 NAGMAAGNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSDGKLARYAECQYSMSSIK 131 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 LL + F +W EA V+ + I + + Sbjct: 132 LWLLQAMGAFTGDSA--TWPNTGTVEARAVATRAPSQSACPIPVQLKKA 178 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 72/275 (26%), Gaps = 30/275 (10%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + D S +S + I + + SL Sbjct: 189 LQRMEAFDDLNFGSSKTAEISSYPEFQAVPQSTCLDGFDILIHVKAPTSSSDDATGSLVN 248 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + V S K L++ A+ VI+ ++ D R+ F Sbjct: 249 GSSLRLSRRVPIDIVTVLDVSGSMAGTKMALLKQAMGFVIQHLRPSD------RLSVIAF 302 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + H + ++ G T I DA++ A I + Sbjct: 303 SSTARRLFPLQRMSHHGRQQALQAI-NSLGAGGGTNIADALKKAVKVIADRSYKNSVCS- 360 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-----------GIRIMTIAFSVNKTQ 411 I+LL+DG++T + + + + + T F + Sbjct: 361 --------IILLSDGQDTYNISSNFQGTSAGRRSLVPSANPNELHMVPLHTFGFGADH-D 411 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + AS +F + F IG Sbjct: 412 SDTLHSISE--ASGGTFSFIEDEGVMQDAFAQCIG 444 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 49/421 (11%), Positives = 133/421 (31%), Gaps = 38/421 (9%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ I +K+ I++ +G + I ALL ++ + + ++V + L A + A Sbjct: 1 MIMRKYVITQTKRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-- 118 +++ + + N+ + + + ++ + K F + I +T Sbjct: 61 VLSLTAENNSGRKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMK 120 Query: 119 -----AVEMN--PRKSAYQVVLSSRYDL-----LLNPLSLFLRSMGIKSWLIQTK--AEA 164 + N K+ Q SS + +N SLF ++G + + + Sbjct: 121 FEPICTTQNNAITPKNGKQYAYSSSHVTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSG 180 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSM---LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + + + + V D S SM ++ + + + Sbjct: 181 SMAQKINNVNLPLDLMVVADLSGSMDYNINNHKVYSNTEASKLTLLKQVLEELTDKYLLS 240 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL----VRDA 277 + + + + L ++ H + S + + + Sbjct: 241 EEANPNNRISMIPFAMGAQHPIRNSCVLPFEWNQSHIGYNDSQKVSPNEIEYNLRNLPIR 300 Query: 278 LASVIRS--IKKIDNVNDTVRMGATF--FNDRVISDPSFSWGVHKLIRTIVKTF------ 327 ++ + +D ++G F ++ G K + + Sbjct: 301 SRTIFTHNLVYLLDTKKTLEKIGTRFNNYDVEYQKSAICLEGSDKFQQQWYEKNQKINFI 360 Query: 328 --AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G+T + + A + +++ + ++ I++L+DG ++ ++ Sbjct: 361 NEVKRLKAAGATLASSGLIVAVNNMLNEP---ARSDVLKQQTRRTILILSDGSDSIGDDS 417 Query: 386 G 386 G Sbjct: 418 G 418 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 59/186 (31%), Gaps = 12/186 (6%) Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + F AD K + ++ N +++ + + P Sbjct: 29 SSVFRVKADIASKPQFTVTIDSYTPKNPKLGEEITINGTIHPQPFKISIPPKEIVLVLDS 88 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S S+ K ++ A I + + N+ + F+ + + Sbjct: 89 SGSMADNY-KLTNLKKAATDFITKMSTVKNLK----IAIVDFDTQATIINKLTDVSSSTN 143 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T +K + G T + ++ A + +S+E KN I+ ++DGE T Sbjct: 144 VTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKN-------IIFMSDGEPT 196 Query: 381 QDNEEG 386 N + Sbjct: 197 YYNWQT 202 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 55/168 (32%), Gaps = 30/168 (17%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + H + + + +TA+ DA+ + E Sbjct: 144 RVGLILFGSQAFILSPLTLDRHAARQLLDG--VVPSIAGPATALGDAIALGVSKLRERPE 201 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF-----SVNKT 410 + ++++ DG+N + A++ G RI I S+ Sbjct: 202 -----------GSRVMIVIADGDNNAGSFAPKEAARLARATGTRIYVIGVGSKQPSIPIL 250 Query: 411 QQEKARY---------FLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 ++ RY L A + +F A T L +I RIG Sbjct: 251 EEGSVRYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEI-SSRIGQ 297 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 90/268 (33%), Gaps = 26/268 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 594 EGFQLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFLLAEDGRRCKKCTEGP 653 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ I R+G ++ +V ++ + Sbjct: 654 IDLAFVIDGSKGLGEENFEVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLR 710 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 711 N--FNSAKDMKKAVAHMKYMGKGSMTGLALRHMFERSFTQGEGA--RPLSTRVPRAAIVF 766 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KAK+ GI + + ++ L AS F Sbjct: 767 -TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------LQEIASEPTTKHLFY 816 Query: 431 ANS---THELNKIFRDRIGNEIFERVIR 455 A E+++ + I + + R Sbjct: 817 AEDFSTMDEISEKLKKGICEALEDSDGR 844 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 71/263 (26%), Gaps = 46/263 (17%) Query: 181 WVIDFSRSM---------------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNG----- 220 +VID S SM L + G F ++ K Sbjct: 337 FVIDMSGSMSGTKMQQEAHRAARSLQKRSTDGGTARISFSPTIEQQRKCPDCPGTLIDGD 396 Query: 221 ---KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 K + E + N + P L FV S K R+A Sbjct: 397 FIIKYDVNRENDLGDIQIANGYFVHFFAPKDLPRLPKNVVFVIDMSGSMSGTKMQQTREA 456 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-----TFAIDEN 332 + ++ + D + G F+ R+ W T + Sbjct: 457 MLKILEDL---DPEDHF---GIILFDHRIQF-----WNTSLSKATKENIDEAMVYVKAIQ 505 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T IN + A D + ED + I+LLTDG+ I Sbjct: 506 SYGGTDINAPVLKAVDML---KEDRKAKRLPEKSID-MIILLTDGDPNSGESRIPVIQEN 561 Query: 393 AKSQ---GIRIMTIAFSVNKTQQ 412 K+ + + ++ F + Sbjct: 562 VKAAIGGQMSLFSLGFGNDVKYP 584 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 53/260 (20%), Positives = 91/260 (35%), Gaps = 24/260 (9%) Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 ++ V S N I E P S S+ + G +DP +D Sbjct: 559 ISASIDAEPYVVTSGDIVNITTVITVEGELPVSRSAATSMLILDRSGSMDPDYYAGTALD 618 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFN-DRVISDPSFSW 314 + + +A + K +N+ +G F+ + S Sbjct: 619 IVLVLDRSGSMKFLGNAPEQPLTDAKSAAKIFMENLLSNTEVGVVSFSSTSTVDRQPVSL 678 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + ++ G TAI DAM A + +I+ D KK +++L Sbjct: 679 NISG-NKDLLHNAIDSMVADGGTAIGDAMADANNLLINGRPDA----------KKIMIVL 727 Query: 375 TDGENTQDNE-EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEA 431 TDG T ++ +G + A IRI +I ++ E L AS S++ A Sbjct: 728 TDGVATAGSDRDGSDAISTANLNNIRIYSIGLGSSEYIDEP---MLKRIASETGGSYYNA 784 Query: 432 NSTHELNKIFRDRIGNEIFE 451 S EL ++ + I EI + Sbjct: 785 PSGSELQTVY-NTISKEISD 803 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 50/144 (34%), Gaps = 21/144 (14%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + + A++ + +D T +G F G I Sbjct: 767 DVTRLDVAKQAVS---AAANLLDP--QTGSLGVVMFGSEAEVALPL--GPLPDAAGIAAA 819 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I +Q A+ + +S+ D H IV++TDG + + + G Sbjct: 820 LG-HLQPGGGTNIYPGLQLAFQALRASDADARH-----------IVVMTDGMSDEADFPG 867 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT 410 + +++GI + ++A Sbjct: 868 LLAA--IRAEGITVSSVAIGSTSE 889 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 82/244 (33%), Gaps = 20/244 (8%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 F + + + + V S DP+ V Sbjct: 135 FSSTGSPQHVNTAVTKRRWTSTSYTTTQRPVPTTPSQVLKERQCACDPAKLYLDIVVVVD 194 Query: 264 LRHVIKKKHLVRDA--LASVIR--SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + K L+ A LA+V + ++ +V VR+G F+++ + + Sbjct: 195 SSLSMTKDGLIEVAADLATVFQWMNVSSGTDVGQFVRVGLVTFSNQAFVNGNLD-DFTSY 253 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + F + I A+Q+A D + SS A+ I+L T Sbjct: 254 NSLVKRLFQMPYLGGSELNIESALQSASDILQSSR----------YYARTAILLYTSAYG 303 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS---THE 436 + AI N+ K G +I+T+AF + LS+ ASP F + T E Sbjct: 304 EGGFTDPKAIANQIKESGTKIITVAFRQQPEGSLVEK--LSHLASPGFSFASRQSIITDE 361 Query: 437 LNKI 440 + + Sbjct: 362 ILRA 365 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 66/219 (30%), Gaps = 37/219 (16%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIK--KKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + L ++ V S + + + +I S+K D G Sbjct: 79 HIPLEQTKEGVNVVLVMDVSGSMQAQDYTPSRLEAAKSSAEILINSLKSKD------YAG 132 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + S +++ + GSTAI D + + Sbjct: 133 IVTFESGATTAAYLS----PYKEKVIEKLRNVAPKEGSTAIGDGLSLG-----------I 177 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV----------- 407 + KK I+LL+DG N AK+ I++ TI Sbjct: 178 DMASSIPNKKKVIILLSDGVNNAGYISPDEAIQYAKANNIQVYTIGMGSNGNVLLGYDWF 237 Query: 408 -NKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 N E L A+ +F++ L++I+++ Sbjct: 238 GNPQYAELDEATLQAIANDTGGKYFKSIDDKTLDEIYKN 276 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 57/397 (14%), Positives = 119/397 (29%), Gaps = 74/397 (18%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 K +K K F I ++ +M+ G+ +D +R ++Q TA+++A Sbjct: 25 RKVFSNWRKSEKGSMTAFGI---FIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSA 81 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + L E + ++ + D+ + +V ++ Sbjct: 82 ASLLQTLGAEA--------------------------VVTDYMSKANIDVDYNLSVNVSE 115 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + V ++ L +LFL + I S I + AE + + I V+D Sbjct: 116 GINFRAVDATATATLE----TLFLGLLNIDSLGITVTSGAEERIPN------LEISLVLD 165 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SM R + + + S + + ++P ++S+ + Sbjct: 166 VSGSMGSNSRLTNLKTAATQFVSTIISGGSGGTVAMSIIPFSSSVTP-----SQSVIDAI 220 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL------ASVIRSIKKIDNVNDTVRMG 298 + F D + + A+ + D+ N R Sbjct: 221 TMEDNHDYSTCIEFADDDFSSSSLDLDSTYKRAVFTSRYSDTGSGDFDDADDFNQDWR-- 278 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--- 355 + + + T++ GSTA + M+ + + Sbjct: 279 ----SCYMDEYFELL--AYSDDETVLYNKIQGLLAQGSTAGHTGMKWGTSLLDPEFQAVT 332 Query: 356 ---------DEVHR----MKNNLEAKKYIVLLTDGEN 379 D H ++ K IV ++DG N Sbjct: 333 NSMIAAGVVDAAHAGMPVAYSDTNTMKIIVFMSDGNN 369 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 5/72 (6%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +Q + A C AK +GI I TIAF + + L+NCA+ ++ + + Sbjct: 505 SQSDTLMSANCTAAKDRGITIFTIAFEAPSNAETQ----LNNCATSDNHYYDAQGTSITS 560 Query: 440 IFRDRIGNEIFE 451 +F I I + Sbjct: 561 VF-SSIATTIQK 571 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 24/175 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L+ A + I + +++ +G F+ + + + + Sbjct: 325 ITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIR-IINDSSYLAIST 378 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + G T+I + ++ ++ I SS++ IVLLTDGE+ + I Sbjct: 379 KLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNR-----I 424 Query: 388 AIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + C + K G I TIA + + LS+ F+ + L F Sbjct: 425 SSCFQEVKHSGAIIHTIALGPSA---ARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 75/187 (40%), Gaps = 20/187 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ +S + VR +++++ + + D R+G ++DR ++ G Sbjct: 40 VFILDASSSVGKEDFEKVRQWVSNLVETFEIGP---DKTRVGVVRYSDRPTTEFDL--GK 94 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +K I + G+T DA++ Y S +++ R+ ++ KK +LLTD Sbjct: 95 YKTCEEIKEAARKIRYYGGNTNTGDALR--YINTYSFSKEAGGRL-SDRTVKKVAILLTD 151 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + + N A+ GIRI + E + L AS F + Sbjct: 152 GRSQDYVLDP---ANAARQAGIRIFAVGVG------EALKEELDEIASEPKSAHVFHVSD 202 Query: 434 THELNKI 440 + ++KI Sbjct: 203 YNAIDKI 209 >gi|169234588|ref|NP_001038425.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Danio rerio] gi|169154233|emb|CAH68946.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 1 (CACNA2D1) [Danio rerio] Length = 1069 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 82/230 (35%), Gaps = 28/230 (12%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP-----LDPSLSEEHFVDSSS 263 D ++ + R SP++ S NK Y + P VD+S Sbjct: 201 DPSLHWQVFGSATGLARYFPASPWVDSKNKIDLYDVRRRPWYIQGAASPKDMLILVDASG 260 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR-- 321 + L+R +++ ++ ++ +D V + FN+ S F V +R Sbjct: 261 SVSGLT-LKLIRTSVSEMLETLS----DDDYVNI--VSFNNSAKSVACFENLVQANVRNK 313 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +K G+T + A++ + S N + K I+L TDG Sbjct: 314 KTLKEAVQKITANGTTDYKIGFKEAFNQLASMNVSRANCN-------KIIMLFTDG---- 362 Query: 382 DNEEGIAICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 ++ I ++ S +RI T + + + +Y C + ++E Sbjct: 363 GEDKASEIFDEYNSDKRVRIFTFSVGQHNYDKAPIQYM--ACHNKGYYYE 410 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 69/207 (33%), Gaps = 30/207 (14%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSS----LRHVIKKKHLVRDALASVIRSIKKID 289 V +Y + P L+ +D S ++K + ++AL IR + D Sbjct: 391 VEAALPVYMDVRDREQRPDLALVFVIDRSGSMAEPAGNVQKLDIAKEALVQAIRMLYGED 450 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F+ + + + GV + V G T I + Sbjct: 451 ------RVGIVTFDSQAYTTMPITQGVGEE---EVLQAIASVTADGGTNIGAGLSAGQRM 501 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + H ++LLTDG +++ +A+ ++QGI + +A + Sbjct: 502 LTGVEAKIKH-----------MILLTDGWGEGNDQ--LAVVEAMRAQGITLSVVAAGSDT 548 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHE 436 ++ K A ++ A Sbjct: 549 AEELKTL----ATAGGGRYYAAAIMQA 571 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 62/456 (13%), Positives = 133/456 (29%), Gaps = 31/456 (6%) Query: 26 TALLMPVMLGVGGM-LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 TA + +LG+ G+ VD+ + + A ++ + + + + ++ Sbjct: 128 TAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAEVAFVLDNTGSMSSNNKMTYLKSSLDAV 187 Query: 85 FP---KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + Y + D A + + Y S + Sbjct: 188 LASMLDSTGKNYAKTKVALVPFDTQVSLSNVAGMVDYAGDFSTVTPTYSCSGYSSAQCQV 247 Query: 142 NPLSLFLRSMGIKSWLIQTKA---EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + G + L + + +Y S + RS + + Sbjct: 248 ISENASAMCNGNATCLSNNRNYTRSWTSNGNTYFGVFATSYYQSSNTYRSYGNTYYYTYI 307 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + T+ S+ NG + + ++ + Y P + Sbjct: 308 AWRQVVYRVNSSTLTLNSTNNGGDYYTYQAYYNQPNNYSRYYGAVTYSTPTAGGYNSTST 367 Query: 259 VDSSSLRHVIKKKHLV----RDALASVIRSIKKID--NVNDTVRMGATFF-NDRVISDPS 311 + L+ + VI + D + AT + + ++ Sbjct: 368 TIIKDNTTITANSDLLGVGTSNWTGCVIDRTQSYDVTSDAPVAGTPATLYPAAKCATNTL 427 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +T+A G+T + +Q + + + + KY+ Sbjct: 428 LPVMALTQDIAAARTYAARMAPAGNTNVTIGVQWGMEVLSPTAPFSEGGAFTDKAVLKYM 487 Query: 372 VLLTDGENTQDNEEGIA---------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++LTDG NTQ+ C AK+ GI + T+ + L NC Sbjct: 488 IVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVEQGDSTT------LQNC 541 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 AS +++ S + +I I R +R+T+ Sbjct: 542 ASQTAYYYNLSNADQLPATMSKIMKSI--RKVRLTQ 575 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 95/294 (32%), Gaps = 24/294 (8%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 TK Y +K G+ +I A + ++ G +D L+ A I A Sbjct: 2 TKVSRYLRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIAA 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + + + A F I++ L+ +F + D + + ++ Sbjct: 62 TIDPTATPTQTTREAVAKKAFC-GNIKQ------SGGLQNSFCNTTTLDTLGTASATLST 114 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S + ++ Y + + L +GI + I A++ + + + +V+D Sbjct: 115 ATSNNIMTVT--YSATAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAE------VAFVLD 166 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCNKSLYYM 243 + SM S + D + S GK + + L P+ + S Sbjct: 167 NTGSM------SSNNKMTYLKSSLDAVLASMLDSTGKNYAKTKVALVPFDTQVSLSNVAG 220 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + D S + S + +A A + + N + R Sbjct: 221 MVDYAGDFSTVTPTYSCSGYSSAQCQVIS--ENASAMCNGNATCLSNNRNYTRS 272 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 53/416 (12%), Positives = 124/416 (29%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + LK +Q I + PL +Q ++ S + Sbjct: 246 FLIDGSWSIGKRRFRIQKQFLKDISQALDIGPAGPLLGIVQFGDDPSMEFNLKTHANSRD 305 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A P + Sbjct: 306 LKAAIEKIPQKGGLSNVGRALSFVTKNFFSNANGNRGGA--------------PNVAIVM 351 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + +E G++I ++ + Q E ++ + Sbjct: 352 VDGWPTDKVEEASR-------LARESGINIFFITIEGAVENEKQNVIEPNFVDKAVCRRN 404 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + N+ L + ++ FV S Sbjct: 405 GFYSLNVLSWFSLHKIVQPLVKRVCDTNR-----LACSKTCLNSADIGFVIDGSSSVGTG 459 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +A++ + + D R+GA + F + + + I+ Sbjct: 460 NFRTLLQFVANLSKEFEISDTD---TRIGAVQYT--YEQRLEFGFDKYSTKQDILNAIKR 514 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 515 VNYWSGGTSTGAAINYALEHLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 562 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A G+ +I + Q++ ++ +SFF + +L K I Sbjct: 563 AMAAHQNGVITYSIGIA--WAAQDELEVIATHPTKDHSFF-VDEFDDLYKSVPKII 615 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 74/191 (38%), Gaps = 32/191 (16%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +DSS K+L +DA + + ID F+ + + + Sbjct: 649 IDSSGSMTWNDPKNLRKDAAKEFVDKLSSIDEAA------IIDFDSSSKINRNLTSN--- 699 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 RT++ + D + G T++ + + + SN+ KK ++LLTDG+ Sbjct: 700 --RTLLYSAIDDIDSSGGTSLTAGVSKGLEALSKSND------------KKIMILLTDGK 745 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHE 436 D ++ +A + G+ I TI N + + L++ A+ ++ A + Sbjct: 746 GPYD----KSLTTQAINAGVTIYTIGLGTNN---DIDQPLLNSIATETGGKYYHAKKDID 798 Query: 437 LNKIFRDRIGN 447 + F + G+ Sbjct: 799 IQGSFDNVSGD 809 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 24/175 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L+ A + I + +++ +G F+ + + + + Sbjct: 325 ITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIR-IINDSSYLAIST 378 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + G T+I + ++ ++ I SS++ IVLLTDGE+ + I Sbjct: 379 KLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNR-----I 424 Query: 388 AIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + C + K G I TIA + + LS+ F+ + L F Sbjct: 425 SSCFQEVKHSGAIIHTIALGPSA---ARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|308081588|ref|NP_001183957.1| vitrin isoform 2 [Mus musculus] gi|26342052|dbj|BAC34688.1| unnamed protein product [Mus musculus] Length = 628 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 48/389 (12%), Positives = 117/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 268 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 327 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 328 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 366 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + D Q E + + + + L Sbjct: 367 ARESGINVFFITVEGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 426 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ L + ++ FV S V +A++ + + D Sbjct: 427 RVCDTDR-----LACSKTCLNSADIGFVIDGSSSMGTSNFRTVLQFVANLSKEFEISDTD 481 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 482 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 536 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 537 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 582 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 583 QDELEVMATHPAKDHSFF-VDDFDNLYKI 610 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 41/368 (11%), Positives = 104/368 (28%), Gaps = 54/368 (14%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ A + ++ G+ VD+ R + L+ A A + S+ L + Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQL-----------PDDP 49 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 K+ + + N N + + T + + V + D+ L+ Sbjct: 50 DVDNGKVSQAVTTNLAANDPE----------AKATDISSGGATRSVCVTAEADVDMTLSK 99 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + + + + + + V+D + SM + + Sbjct: 100 VVGL--------------DATTVTAEACAGYNDIELVMVLDATGSMRGTPIANVKEAAAN 145 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 S ++ K+G+ + + + G + + ++ Sbjct: 146 LVDLIMP--DSGANTRSKIGLVPFQGKVRIDGNDPVTAERDPDGVGAGCRNADGTLNDGK 203 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 L+ + S+ G + + DR S S + I Sbjct: 204 LKTEYSDT----RSRNSIF---------YGYTISGVSTYYDRTCSGMSPIRALSSDKEAI 250 Query: 324 VKTFA--IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T I++ ++ + + + + +K +++LTDG+ Sbjct: 251 LDNIGAINAGAVTSGTLISEGIKWGHKVLSPKAPYTEGN--TDKKVRKIMIVLTDGDTED 308 Query: 382 DNEEGIAI 389 G Sbjct: 309 GRCGGRYA 316 >gi|34783791|gb|AAH56811.1| Zgc:112265 protein [Danio rerio] Length = 927 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 66/474 (13%), Positives = 142/474 (29%), Gaps = 68/474 (14%) Query: 22 FFIITAL--LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 F ++ + + ++ G L+ + I + + S A Sbjct: 1 FSVMVMMDRVAFRLILFGAFLISATSAPVEKKQNVDIYSFYINSTVSSRYATTIITSRVA 60 Query: 80 KNSFTFPKQKIEEYLIRNF-------------ENNLKKNFTDREVRD-----------IV 115 + + E + +N + + K + + + ++ Sbjct: 61 NKLSEPQEIQFEVKIPKNAFISKFRMIIEGKTYDGVVKKKEEAQQQYNKAVSRGESAGLI 120 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET----VSRSY 171 + + K++ V +S+ L L R +G LI + + Sbjct: 121 KSVGRTLEDFKTSVTVAANSKVTFELTYEELLKRRLGKYELLINAQPMQPVADFKIDVHI 180 Query: 172 HKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + G+S V + + + + R ++ + + G+ + + Sbjct: 181 QENPGISFLEVKGDLNTGDLASAVKTTRADKDAWVTFYPTRDQQTKCTNCAENGLNGDLI 240 Query: 230 SPYMVSCNKSL--------YYMLYPGPLDPSLSEEH--FVDSSSLRHVIKKKHLVRDALA 279 Y V+ Y++ Y P D ++ F+ S +K R AL Sbjct: 241 ITYDVNRGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGSMHGRKIRQTRSALL 300 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++++ + + D+ G F+ + K R ++F + G+T I Sbjct: 301 TILKDLDEDDHF------GLITFDAEIDFWRRELLQATKANRENAESFVKRIQDRGATNI 354 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKK----YIVLLTDGENTQDNEEGIAICNKAKS 395 NDA+ D I N +K ++LLTDG+ T I K Sbjct: 355 NDAVLAGVDMI-------------NRNPRKGTASILILLTDGDPTAGETNIEKIMANVKE 401 Query: 396 Q---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + F + + L N A +E + + F D + Sbjct: 402 AIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVA 455 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 56/167 (33%), Gaps = 18/167 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + T Sbjct: 197 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMEGAIRALVPLAQGTMTG 254 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 255 LAIQYAMNVAFSVAEGAR---PPEARVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 308 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 + Q L ASP F S +L + F + Sbjct: 309 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFGQQ 348 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 502 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 556 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 557 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDVSVWAA 610 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 611 RAKEEGIVMYAVGVGKAVEEE------LREIAS 637 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 54/155 (34%), Gaps = 25/155 (16%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + + L + I + GST + + + TAY + Sbjct: 127 IGLIEFGSKAKIISGLTQNAKHLYKAINR-----LKTNGSTNMTEGLTTAYLKL------ 175 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 KN + ++I+LLTDG + I + + GI ++TI +K Sbjct: 176 -----KNVDDP-RFIILLTDGLPNHP-KNTQQIAQEICADGIELITIG----TGDADKTY 224 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 C NSFF + F RI + E Sbjct: 225 LQSLACYDQNSFFA--KAGTMVSTF-SRIAQVLTE 256 >gi|68448495|ref|NP_001020335.1| inter-alpha (globulin) inhibitor H4 [Danio rerio] gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio] Length = 915 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 57/360 (15%), Positives = 115/360 (31%), Gaps = 42/360 (11%) Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--- 166 E +++ + K++ V +S+ L L R +G LI + Sbjct: 109 ESAGLIKSVGRTLEDFKTSVTVAANSKVTFELTYEELLKRRLGKYELLINAQPMQPVADF 168 Query: 167 -VSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + G+S V + + + + R ++ + + G Sbjct: 169 KIDVHIQENPGISFLEVKGDLNTGDLASAVKTTRADKDAWVTFYPTRDQQTKCTNCAENG 228 Query: 224 IRDEKLSPYMVSCNKSL--------YYMLYPGPLDPSLSEEH--FVDSSSLRHVIKKKHL 273 + + + Y V+ Y++ Y P D ++ F+ S ++ Sbjct: 229 LNGDLIITYDVNRGNPKGEVQISNGYFVHYFAPSDVPRIPKNVVFIIDRSGSMHGRRIRQ 288 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R AL ++++ + + D+ G F+ + K R ++F + Sbjct: 289 TRSALLTILKDLDEDDHF------GLITFDAEIDFWKRELLQATKANRENAESFVKRIQD 342 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK----YIVLLTDGENTQDNEEGIAI 389 G+T INDA+ D I N +K ++LLTDG+ T I Sbjct: 343 RGATNINDAVLAGVDMI-------------NRNPRKGTASILILLTDGDPTAGETNIEKI 389 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K + + F + + L N A +E + + F D + Sbjct: 390 MANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVA 449 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 84/253 (33%), Gaps = 22/253 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G +I A+LM +LG + VD+ L+ A + I + + + Sbjct: 11 EGGAISVIVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCAR--DATGV 68 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + RN + + K D+ + T +++ SA +V + Sbjct: 69 DCSTTSALASG----LANRNALDGMSK----VHTIDLDKTTR-KVSVTTSAKEVGGAD-- 117 Query: 138 DLLLNPLSLFLR-SMGIKSWLIQTKAEA---ETVSRSYHKEHGVSIQWVIDFS-RSMLDY 192 N +SLF ++GI + + +A A ++ SI V D S+ Sbjct: 118 ----NSVSLFFADALGIPTKEVGARASAVWGSPMAGRTAFPLAFSICQVKDNIGGSLQLL 173 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 Q + ++C P+ V+ + + VS S PG + Sbjct: 174 QEHGKNSNVDCNYGPSGAAVEGGFGWLVQDLGKCGGTIDLAVSEGGSDPGNNAPGNCGTT 233 Query: 253 LSEEHFVDSSSLR 265 L + ++ + Sbjct: 234 LQKWADDINAGRK 246 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 68/216 (31%), Gaps = 19/216 (8%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 +T + M ++G M VD S + A+ A A + L + + Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTEGATD---------- 50 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 ++ Y + F NL K + T ++ Y P Sbjct: 51 ---DQLRAYALDFFNANLNK---INPANTTLTVTLPSNTTGGGLLKMTARLDYKPYFYP- 103 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 +F + +G + ++ ++ + + V+D S SM S ++ Sbjct: 104 -VFGQLVGKSETDANQRISF-NITSEVRLKNTLEVALVLDNSGSMTKTGTGSGQTRIDLL 161 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 A + V + + Q + D + +V S+ Sbjct: 162 KTAAKQLVDTLAQQAAMIKQVDRPVQFGLVPFAASV 197 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 32/95 (33%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++ + T +K G T + + M + + S R Sbjct: 418 YSCSTNPITPLTDVSVTTGLTAIKAAIDLMKPDGGTNVPEGMAWGWRVVSSGEPFTQGRP 477 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + K +++LTDG NT ++ + A S+ Sbjct: 478 ETERGNDKVVIVLTDGANTYYTPSSLSHSDPADSK 512 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 52/205 (25%), Gaps = 40/205 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ ++ G + + A Sbjct: 112 SRLEAAKQAVRVIV-------PEAGGTAFGLVALASEAALMVPPTLDREAFFARLNSLQA 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +AI + TA + + ++ KK IVL+TDGEN + Sbjct: 165 GEL--GDGSAIGMGVSTA-----------AYHLISSAAPKKSIVLITDGENNAGSVHPGT 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------QQEKARYF------------LSNCA--SPNSF 428 A GI + + + + + L A + + Sbjct: 212 AAQLAFENGITLYVLGVGTRGSVPLEYVDPATGKTYSGYLDSRFDESPLQEIALTAGGRY 271 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 F S EL G E + Sbjct: 272 FGVESMGELTAAVSAVTGREQTVQS 296 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 56/150 (37%), Gaps = 21/150 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F S+ + + IV + STA+ D + ++ + Sbjct: 129 RIGLVVFGTTAGIASPLSFD-KEAQKNIVGNINVGVLGK-STALIDGIVSSIQLL----- 181 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 KN+ K I+LL+DGE++ AK I+I TI + + Sbjct: 182 ------KNSKSKSKIIILLSDGEDSASKIPLEFALKLAKKYNIKIYTITIDKSYS----- 230 Query: 416 RYFLSNCASPN--SFFEANSTHELNKIFRD 443 + A+ N FE + +L K+++ Sbjct: 231 -DMMKVIANKNGAKNFEVQNKEDLVKVYKS 259 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 61/193 (31%), Gaps = 28/193 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +D + I + + +G F+ + R V++ Sbjct: 108 RLAAAKDGADAFIDQL---PPRIN---LGLVSFSGSAALLVPPT-----TDRQSVRSGIH 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 STAI + + I ++ E IVLL+DGE TQ Sbjct: 157 GLQLGPSTAIGEGIFAGLQAITTAGEQLA--ADGGTPPPAAIVLLSDGE-TQRGRPNAQA 213 Query: 390 CNKAKSQGIRIMTIAFS-VNKTQQEKARYF--------LSNC--ASPNSFFEANSTHELN 438 A+ G+ + TIA+ + + + L A+ S+ A S EL Sbjct: 214 AQAARDAGVPVDTIAYGTADGSLDVGGQEIPVPVNEDALREIARATDGSYHRAASGDELR 273 Query: 439 KIF---RDRIGNE 448 ++ IG Sbjct: 274 SVYENLGSSIGYR 286 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 65/516 (12%), Positives = 152/516 (29%), Gaps = 84/516 (16%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 72 LEEV------SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD--REVRDIVRDTAVEMN 123 ++ SS +N ++ + + +K E ++ + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNEEKMHLIPTCKTKTD 129 Query: 124 PRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 K + V + + A + + + Sbjct: 130 TNKKGHTSSSEVTCTVSGTIEHKSWFPLKVGSVEVIPHEVNVASKSKAFKKNTFNIPIDL 189 Query: 180 QWVIDFSRSM---LDYQRDSEGQPLNCFG--------QPADRTVKSYSSQNGKVGIRDEK 228 V D S SM L + + G P++ G + ++ N ++ + Sbjct: 190 MVVADLSGSMNYDLSNKNEIVGSPMSKLGILQDVLSELAEKTLLSEEANHNNRIYVTPFA 249 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV--IRSIK 286 L + S N ++ Y + L S+S + + + + + ++ Sbjct: 250 LGAEISSSNCAIPYSWDMNKNNQELENAKNTLSNSKNSQYYRAEFINNLVYKLNTRDTLT 309 Query: 287 KIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 I D ++ F + + + W H +K + G+T + + Sbjct: 310 NIGGQKDYKLKYSKGAFCLKNMRTQNKGWYSHNNKSDFIK-YVKALKADGATLASSGLLV 368 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--------------------------- 378 A + +I S + K+ I++L+DG Sbjct: 369 AANNMIKSGSRTKEL---GEQTKRVILVLSDGNDEIVKGEISGIPFLNYTRITENLIYGK 425 Query: 379 -----------------------NTQDNEEGIAICNKAKSQGIRIM----TIAFSVNKTQ 411 NTQ E +CN + + ++ T V Sbjct: 426 QEVFLSQKQKISLSHSTIDTYLTNTQPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGY 485 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +AR +C +++ AN L F+ IG Sbjct: 486 ASRARKAWEHCVGNGNYYSANDKESLLNSFKQAIGE 521 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 70/190 (36%), Gaps = 23/190 (12%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + + + + +D L + L + + K + +DA+A I+ Sbjct: 293 RKSMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSNGMPKIEIAKDAVAQYIQF 352 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI--VKTFAIDENEMGSTAINDA 342 + K ++ R+ FN R WG+H++ R + G T + +A Sbjct: 353 LSKTND-----RLAMVLFNFRAD----VLWGLHQVRRYWQQMNYMLKYVYAGGGTNLANA 403 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ + + + S + H ++ +TDG T ++ I + + G I T Sbjct: 404 LERSREVLTRSRSNSKH-----------VICVTDGR-TVNSSMCIKEAVRLRRSGTTIST 451 Query: 403 IAFSVNKTQQ 412 IA N + Sbjct: 452 IAIGENSDDE 461 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 24/175 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + L+ A + I + +++ +G F+ + + + + Sbjct: 325 ITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIR-IINDSSYLAIST 378 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + G T+I + ++ ++ I SS++ IVLLTDGE+ + I Sbjct: 379 KLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNR-----I 424 Query: 388 AIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + C + K G I TIA + + LS+ F+ + L F Sbjct: 425 SSCFQEVKHSGAIIHTIALGPSA---ARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 12/106 (11%), Positives = 41/106 (38%), Gaps = 6/106 (5%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 F + ++L++ G+F ++ A+ ++G + +D + ++ A A + + Sbjct: 4 ATFCLHMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGS 63 Query: 65 SVPLIQSL------EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 + + +++ P++ + + L E + + Sbjct: 64 VRAYGEGATEEEAYDAAQKFFWSNYALPQESVVDALSAATEASTQA 109 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 55/134 (41%), Gaps = 9/134 (6%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 +K I + ++ R+G F+ ++ + ++T ST A+ Sbjct: 106 ELKNIVDAFNSDRIGLIIFSSEAFVQCPLTYD-QNALNLFIETLNTGLVPGSSTDFGSAL 164 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A++ + S ++ + K I+L++DGE+ D+ EG +K GIR+ ++ Sbjct: 165 NMAHEKLTSEA------APSSQQKSKIIILISDGEDFGDDTEG--AVSKINDSGIRLFSL 216 Query: 404 AFSVNKTQQEKARY 417 + + + R Sbjct: 217 GVGTEQGSKIRTRR 230 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 26/193 (13%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +H+ + L++ A+ +I+ ++ D R+ FN V+S+ + + + Sbjct: 85 KHMSSRLDLLKIAMKYIIKLVRDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLE 138 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN----T 380 G+T A++ A + + ++E + +I+LL+DG + + Sbjct: 139 NL-VDKLKASGNTDFRPALKKAVEILDGRGKEEKKKRVG------FILLLSDGVDQFQYS 191 Query: 381 QDNEEGIAICNK-------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + N E +A A + + T FS + + +S + F + Sbjct: 192 RINWEKVAKSTDVDHSEVGAMLRKYAVHTFGFSASHDPVPLRQ--ISALSYGLYSFVCKN 249 Query: 434 THELNKIFRDRIG 446 + + F +G Sbjct: 250 LDNITEAFARCLG 262 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 63/164 (38%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T +TAY + + Sbjct: 205 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTET----RTAYGIEFARS 257 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + KK ++++TDGE + D+ + + + ++ I +A + Sbjct: 258 EAFQKGGRKGA--KKVMIVITDGE-SHDSPDLEQVIDASEKDNITRYAVAVLGYYNRRGI 314 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 315 NPEAFLSEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 357 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 81/256 (31%), Gaps = 26/256 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPA-DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ + N ++ S S+ + + D + + M + + Sbjct: 243 SLDSVTEFCTAETHNTEAPNLQNKMCNSRSTWDIIMNSEDFQNASPMTGTDPPPHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K G F Sbjct: 303 LKSKQRVVCLVLDKSGSMSSEDRLFQMNQAAELFLIQIIEKGS------LTGMVTFESSA 356 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + G T+I ++ + II SN++ Sbjct: 357 TIQNYLTEITDH-NAYEKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNTSGSE----- 410 Query: 367 AKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 IVLLTDGE ++ I++C + K G I TIA + K LSN Sbjct: 411 ----IVLLTDGE-----DDNISLCFEEVKKSGSVIHTIALGPSA---AKELEILSNMTGG 458 Query: 426 NSFFEANSTHELNKIF 441 + F+ + L F Sbjct: 459 HRFYANKDINGLIDAF 474 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 69/408 (16%), Positives = 123/408 (30%), Gaps = 55/408 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G F I + +M GG+ VD +R+ + ++ AI+ A+ L+ Sbjct: 21 FLRDERGSFIIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAAS-----LD 75 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + ++I + ++ +N T+ EV D V T + SA Sbjct: 76 QDLDPEEVVLDYFTKAGLGHVISQDDIDVFENQTNGEVADDVAVTTRRVEASVSALMPTT 135 Query: 134 SSR----YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ----WVIDF 185 R YDL L ++ + I + + I F Sbjct: 136 FLRLAHMYDLGLYTEGGAEEALSLS--EISLVLDVSGSMGNSSSSGYSKIYELRRAAKRF 193 Query: 186 SRSML----DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ML D + + R S + S + Y Sbjct: 194 VNVMLCNPADADETEDCTLTEGDISINIVPYAEQVLLPSNLLQRFNHTSEHTESRCITFY 253 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN-------VNDT 294 + P+ S + FV + + + N ND Sbjct: 254 EDEFDTVAVPTFSLDTFVTNGRPLPALYGDPIQLTGYFDPSGGTNSTPNPGSNSPCYNDY 313 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT--------- 345 G+T ND + L I D G+T+I+ M+ Sbjct: 314 --SGST--NDYWREIYPMGFSAEALRDEID-----DLGASGNTSIDLGMKWGAALLDPAA 364 Query: 346 --AYDTIISSNEDEVHRMKNNLEA-------KKYIVLLTDGENTQDNE 384 A ++++NE + +K IVL+TDGENT + Sbjct: 365 QPAISDLVAANEVNE--AFDGRPFEYTQRGIEKVIVLMTDGENTSQDY 410 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 38/83 (45%), Gaps = 9/83 (10%) Query: 373 LLTDGEN----TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 L D + ++ N+ IC AK+ G+ + TI F V+ +Q + +CAS ++ Sbjct: 513 FLADAHDYFNYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQ----HDIMRSCASAPAY 568 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 + +++ F I EI + Sbjct: 569 YFDVDGLDISAAF-AAIAREISK 590 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 63/191 (32%), Gaps = 33/191 (17%) Query: 276 DALASVIRSIKKI------DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++ + + V +G F+ ++ T ++ Sbjct: 319 MTIGNRLKRLNQAGKLFLLQTVEQGSWVGMVTFDSAAYVQSELV-QINSATERDTLTKSL 377 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I +++A+ I + I+LLTDGE+ I+ Sbjct: 378 PTTASGGTSICSGLRSAFTVIKKKYPTDGAE----------IILLTDGEDN-----TISA 422 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQ----EKARYFLSNCASPNSFFEANSTHELNKIFRD- 443 C ++ K G I T+A + Q+ K L AS + + + L F Sbjct: 423 CFDEVKQSGAIIHTVALGPSAAQELEQMSKMTGGLQTYASD----QVQN-NGLVDAFAAL 477 Query: 444 RIGNEIFERVI 454 GN+ + Sbjct: 478 SSGNKAVSQRS 488 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 50/380 (13%), Positives = 121/380 (31%), Gaps = 40/380 (10%) Query: 48 YYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 ++ A++T + P + + + ++ + E + + Sbjct: 39 QSTSKIQSASKTDSSAQARPELSEVAGLVMASEPQADLYQGSKELHRSAKVMTSSMA--- 95 Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + IV V ++ R + + +++ ++ ++ + ++ I + + Sbjct: 96 ----QRIVSSQHVSVSDRNFSLAPTTNDKFESVVQNGNMVAGETPVSTFSIDVDTGSYST 151 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 +R + + + R E + P + S N ++ Sbjct: 152 TRRLINQGQLPT----------KNTVRVEELVNYFSYDYPVPTNSEQPFSVNTELAPSPY 201 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 ++ + + P L S + +D S K L++ A+ + + + Sbjct: 202 NADTQLLRI-GLKGFDVAPDKLSAS-NLVLLLDVSGSMSSADKLPLLKQAMLMLSQQLSA 259 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D V+ V GA+ GV T +KT N G T + +Q AY Sbjct: 260 QDKVSIVVYAGASGVVLD---------GVAGNDFTAIKTALSQLNAQGGTNGSQGIQLAY 310 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIA 404 + H ++N ++L TDG+ + + + +GI + T+ Sbjct: 311 QL------AQKHFIENGSNR---VILATDGDFNLGMTDHQQLVDFVASRSKKGIGLSTLG 361 Query: 405 FSVNKTQQEKARYFLSNCAS 424 F + + L ++ Sbjct: 362 FGLGSGSASYNDHLLEQLSN 381 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 48/389 (12%), Positives = 117/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 290 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 349 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 350 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 388 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + D Q E + + + + L Sbjct: 389 ARESGINVFFITVEGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 448 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ L + ++ FV S V +A++ + + D Sbjct: 449 RVCDTDR-----LACSKTCLNSADIGFVIDGSSSMGTSNFRTVLQFVANLSKEFEISDTD 503 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 504 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 558 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 559 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 604 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 605 QDELEVMATHPAKDHSFF-VDDFDNLYKI 632 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 57/157 (36%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++K D D V+ GA +++ + I + + G T A+Q Sbjct: 867 VEKADVGRDRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQ 924 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + ++ ++++TDGE + D ++ + + +GI I + Sbjct: 925 HA-NILF----MEEHGSRLKQNVRQLMIVITDGE-SHDRDKLNDTARELRDKGITIFAVG 978 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + ++ +L I+ Sbjct: 979 V-----GRANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 82/240 (34%), Gaps = 22/240 (9%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + K S +G + VS + + G +D ++ +F+ S K+ Sbjct: 424 SYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKTGCVDTKEADIYFLIDGSSSIRRKE 483 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++SV+ N VR+G ++ R + S + + K Sbjct: 484 FEQIQIFMSSVVDMFPIGPNN---VRVGVVQYSHRNEVEFPVSQYTKGIE--LRKAVWNI 538 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T A+ I + +H + Y+++LTDG + N+ + Sbjct: 539 KQLKGGTFTGKALDFILPIIKKGKSERIHEVPC------YLIVLTDG---KSNDSVLEPA 589 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEI 449 N+ +++ I I + Q L A + L I ++ I + I Sbjct: 590 NRLRAEHITIHAVGIGEANKTQ------LQQIAGKDERVSFGQNFDSLKYI-KNEIVHRI 642 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 57/157 (36%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++K D D V+ GA +++ + I + + G T A+Q Sbjct: 867 VEKADVGRDRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQ 924 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + ++ ++++TDGE + D ++ + + +GI I + Sbjct: 925 HA-NILF----MEEHGSRLKQNVRQLMIVITDGE-SHDRDKLNDTARELRDKGITIFAVG 978 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + ++ +L I+ Sbjct: 979 V-----GRANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 82/240 (34%), Gaps = 22/240 (9%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + K S +G + VS + + G +D ++ +F+ S K+ Sbjct: 424 SYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKTGCVDTKEADIYFLIDGSSSIRRKE 483 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++SV+ N VR+G ++ R + S + + K Sbjct: 484 FEQIQIFMSSVVDMFPIGPNN---VRVGVVQYSHRNEVEFPVSQYTKGIE--LRKAVWNI 538 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T A+ I + +H + Y+++LTDG + N+ + Sbjct: 539 KQLKGGTFTGKALDFILPIIKKGKSERIHEVPC------YLIVLTDG---KSNDSVLEPA 589 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEI 449 N+ +++ I I + Q L A + L I ++ I + I Sbjct: 590 NRLRAEHITIHAVGIGEANKTQ------LQQIAGKDERVSFGQNFDSLKYI-KNEIVHRI 642 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 37/116 (31%), Gaps = 15/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + +K T I + A T+ S+ Sbjct: 135 RVGLIPFAGSSYLMCPLTLDYQ-AFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSN 193 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 K +++LTDGEN Q + + + AK G+ I TI + Sbjct: 194 ------------HKILIILTDGENLQGDV--LKAADLAKKNGLTIYTIGVGTAAGE 235 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 61/167 (36%), Gaps = 22/167 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S + + R I + + M A Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLA 118 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + E H + V++TDG + + + +A+++GI Sbjct: 119 IQYAMNVAFSVAEGARPPEAH-------VPRVAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 I + Q L ASP F S +L + F Sbjct: 169 EIYAVGV------QRADVGSLRAMASPPLNEHVFLVESF-DLIQEFG 208 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 364 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 418 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + + Sbjct: 419 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDNISVWAA 472 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 473 RAKEEGIVMYAVGVGKAVEEE------LREIAS 499 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 61/167 (36%), Gaps = 22/167 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S + + R I + + M A Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLA 118 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + E H + V++TDG + + + +A+++GI Sbjct: 119 IQYAMNVAFSVAEGARPPEAH-------VPRVAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 I + Q L ASP F S +L + F Sbjct: 169 EIYAVGV------QRADVGSLRAMASPPLNEHVFLVESF-DLIQEFG 208 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 364 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 418 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + + Sbjct: 419 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDNISVWAA 472 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 473 RAKEEGIVMYAVGVGKAVEEE------LREIAS 499 >gi|74196449|dbj|BAE34363.1| unnamed protein product [Mus musculus] Length = 650 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 46/389 (11%), Positives = 119/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 290 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITKRGGLSNVGRAIS 349 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 350 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 388 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + + Q E + + + + L Sbjct: 389 ARESGINVFFITVEGAAEREKQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 448 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 449 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 503 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 504 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 558 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 559 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 604 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 605 QDELEVMATHPAKDHSFF-VDDFDNLYKI 632 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 72/194 (37%), Gaps = 23/194 (11%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ ++ + + +K D R+G F S L+ I + Sbjct: 117 ITRLQAVQRSVRAFVAQ-RKTD------RIGLVIFASSAWPFAPISEDKQALLARINQL- 168 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + SS + + + +LLTDG +T Sbjct: 169 -APGMIGQQTAIGDALGVAVKLLDSSLDRDA---------SRLAILLTDGNDTASQLSPA 218 Query: 388 AICNKAKSQGIRIMTIAFS-VNKTQQEKAR-YFLSNCA--SPNSFFEANSTHELNKIFRD 443 A S +++ TIAF +N + ++K L A + +A+++ + Sbjct: 219 LAAQLAASHHVQVHTIAFGDINSSGEDKVDTALLKQIAQLTGGEALQASTSGKALDSVWQ 278 Query: 444 RIGNEIFERVIRIT 457 +I + + +R T Sbjct: 279 QI-DAMTPSQVRTT 291 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 63/178 (35%), Gaps = 27/178 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V+ + I + DN MG F ++ H L R + + Sbjct: 109 SRFDVVKSIVKDFISQ-RTNDN------MGLVVFGSYSFIASPLTYDKHILSRIVSQLEV 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE--G 386 M Y + + V+ +K + K +LLTDG +T ++ Sbjct: 162 -------------GMAGKYTALYEALAQGVNLLKMSKAKSKVAILLTDGYSTAGADKIPL 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + + AK +G+++ I R L A + F A++ +L ++++ Sbjct: 209 DVVLDMAKKEGVKVYPIGIGGPDEYN---RAVLLKIAKETGGVAFGASNASQLKEVYK 263 >gi|198426775|ref|XP_002120099.1| PREDICTED: similar to SD03168p [Ciona intestinalis] Length = 1474 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 61/374 (16%), Positives = 125/374 (33%), Gaps = 21/374 (5%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 +E + F ++ + + +V +N LNP S + Sbjct: 1 MESTFLNVFIISIISTTSAFNIIPSELTKSVTLNASVVNSTGPTYFGTSFFLNPSSGNHQ 60 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + S L+ + ++ E E + S + F AD Sbjct: 61 LL-VGSPLLNSGSQTEPSGGLSFCEFSFTDSVPSCKPISPSTPALVAGDAVGLSFAAQAD 119 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCN---KSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + + + + + Y + K+ + P + + F+ S Sbjct: 120 GNITACAPLRPSYCDGELEHLGYCYAGTEFGKTWMSISRIDPFECPKVDILFLLDGSGSI 179 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V ++++ + ++ S N + V +G F+ ++ F G+ I+ Sbjct: 180 VESDFEIMKEWIENITLSFDISSNGS--VAVGLMQFSHFSLTKTEFQIGMFTTKEEIMAA 237 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G+T DA++ + S+ N+ +K IVLLTDGE T D Sbjct: 238 MKNVTIKKGNTYTADALRRSIAVFQKSSR------YNDTNTRKVIVLLTDGEAT-DTASL 290 Query: 387 IAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCA-----SPNSFFEANSTHELNK 439 + + +SQGI I + V +++ A + +P+ F +T L+ Sbjct: 291 SSTADLVRSQGITITAVLITEKVLPSERSAAVAQMQLIVNGVAGNPSGVFVVGTTANLDS 350 Query: 440 IFRDRIGNEIFERV 453 + R I I + Sbjct: 351 VIR-AITQRIQSTL 363 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 57/167 (34%), Gaps = 18/167 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 59 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRALVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSVAEGAR---PPEARVPRIAVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 + Q L ASP F S + L + F + Sbjct: 171 YAVGV------QRADVGSLRAMASPPLDEHVFLVESFN-LIQEFGQQ 210 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 405 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 459 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 460 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAR 513 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 514 RAKEEGIVMYAVGVGKAVEEE------LRQIAS 540 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 71/445 (15%), Positives = 141/445 (31%), Gaps = 88/445 (19%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 +D L+ A A + S+ L ++++ L+R Sbjct: 54 IDSGMLYLSHSRLQAAVDAAALAGSLQL----------------PYDPQLDKGLVRGAVT 97 Query: 101 -NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + N+ + + + T S ++F+ ++GI S + Sbjct: 98 QYMDANYPEASLNGVTPGTEER------------SVTVTATATVPTIFMNALGIGSSEVH 145 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 KA A Y+K + + +VID S SM + + S + Sbjct: 146 AKATA-----GYNK---LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKV 197 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G V R + P V G L+PS E + +S ++ ++ Sbjct: 198 GLVPFRGKVHLPAGVDGLPDG-CRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCT 256 Query: 280 SVIRSIKKIDNVNDTVRMGATFFN-----DRVISDPSFSWGVH--------------KLI 320 S I ++ + +T+ + N + WG H K I Sbjct: 257 S-IPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGSSAKDI 315 Query: 321 RTIVKTFAIDENEMGSTAINDAMQT---AY--DTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R ++ + E G + A+ AY + + H Sbjct: 316 RKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKL--------- 366 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFE 430 N + KAK GI + I F + + + + AS + +++ Sbjct: 367 -------NAAMLEEARKAKEAGIEVFAIRFGDSDSVDV---SLMKSIASSKAGTNDHYYD 416 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A S ++++ +F+ +IG ++ R++R Sbjct: 417 APSAYDIDDVFK-KIGRQLGWRLLR 440 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/150 (22%), Positives = 53/150 (35%), Gaps = 21/150 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F DR ++ V + I A STAI+D + A ++ S+ Sbjct: 130 RIGLVIFGDRAYVAQPPTFDVGSVAHAI--EAAQIGISGRSTAISDGLGLATRRLLQSDA 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 K +VLL+DG +T + A S GIR+ TIA + Sbjct: 188 TS-----------KVVVLLSDGVDTSGKVQAGDAARLAASHGIRVHTIALGPEDLENQPE 236 Query: 413 -----EKARYFLSNCASPNSFFEANSTHEL 437 + A A + F + +L Sbjct: 237 SRDAVDAAALHAMAEAGGGTSFRVRNMEDL 266 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 56/303 (18%), Positives = 101/303 (33%), Gaps = 62/303 (20%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 KE I V+D S SM NC + + V I+++ Sbjct: 87 SKEKKKEIVLVMDTSTSMECLVEPESYDIDNCVP----------TKEGHIVYIKNKSYLV 136 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 S + + Y + + + + +K+ ++ A S I+ ++K +N Sbjct: 137 NTAFLQGSRHKLFYITIGTTNYYIQG--NKCYRQSSYNEKNRLQHAKESAIKFVQKFEND 194 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + +G F+ S + ++++ +I D G+T I +++A + Sbjct: 195 KN-ISIGLVSFDTTANSQKDITSKLNEVEDSINSLKVAD---NGATNIEAGLKSAQQLLK 250 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDG----------------------ENTQDNEE---- 385 K N +A KY++L++DG +NT N Sbjct: 251 ----------KGNKDADKYVILMSDGFPTAFDYAGEKVEKNFNYHEIQDNTFINFGYYDY 300 Query: 386 -------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 I N K GI I FS EK A+ + EA +T LN Sbjct: 301 SGYAMKHSINQANSLKKDGINSFIIGFSEGANS-EKLNNIAK--AAGGEYEEAKNTDTLN 357 Query: 439 KIF 441 + Sbjct: 358 GAY 360 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 71/216 (32%), Gaps = 34/216 (15%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S++D + PL F + + + +N + I +LS N YY+ Sbjct: 655 SLIDINENLINAPLKAFKELMELLQYIFFGEN-PIEISSVQLSSGNFIINGKKYYVKDNK 713 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + +DS V+ + K ++ + ++ + Sbjct: 714 VYEFNEKDRSRLDS------------VKKVANDFVDKFK----NDENTEIAIVRYSSKAN 757 Query: 308 SDPSFSWGV--HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 S + + I+K G T I D ++ +Y + ++D Sbjct: 758 IVLDGSNKIFLNGKDNEIIKKRINSLKADGGTNIGDGIRKSYSILDKCDKDSE------- 810 Query: 366 EAKKYIVLLTDGENT-----QDNEEGIAICNKAKSQ 396 KY++L+TDG T + + C +K Sbjct: 811 ---KYMILMTDGVPTAYTCYANTIKASNNCKYSKDN 843 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 53/122 (43%), Gaps = 8/122 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 494 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 551 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDGE + D+++ + +++GI I + Sbjct: 552 HA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIFAVG 605 Query: 405 FS 406 Sbjct: 606 VG 607 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 96/250 (38%), Gaps = 28/250 (11%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + Y+ + K+ I ++ +++ Y L PG P +D+S + Sbjct: 73 QRYTGTSWKLLIPSTLVALFLLGMLFEWAYQLNPGSAKPVKDIVLVIDNSGSMNETDPNQ 132 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 A ++I + + DN R+ F+ F+ ++ + + Sbjct: 133 DRYTAAKNLINRMDR-DN-----RVSVMVFDHATTLLQPFTRVKNQETKDEIIAEIDGLA 186 Query: 333 EM-GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEEGIAIC 390 G T I+ A++ + H ++ + +V++ +DG + D++ +A Sbjct: 187 TNDGGTDISLALE----------DTMSHIQESRDAGRSAMVIMLSDGFSETDHDRVLA-- 234 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + K Q I + TI S+ L A + +++ +L+ +F+ +I ++ Sbjct: 235 -EYKQQQIAVNTIGLSLVN---PDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQ-KIYDD 289 Query: 449 IFERVIRITK 458 + +R +TK Sbjct: 290 VGDR-SLLTK 298 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 63/173 (36%), Gaps = 36/173 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L ++A A + +++ D +G F+DR W + + K A Sbjct: 418 SKLELAKEAAARSVELLREKDT------LGFIAFDDR-------PWVIVETGPLEDKKDA 464 Query: 329 ID----ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 +D G T I +++ AY+ + + H I+LLTDG+ + + Sbjct: 465 VDKIGSVTPGGGTEIFTSLEKAYEELENLKLQRKH-----------IILLTDGQ-SARST 512 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + ++ K I + T+A + R L A F++ + Sbjct: 513 DYESMIETGKENNITLSTVALGSDAD-----RNLLEELAGLGAGRFYDVTDSS 560 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 64/178 (35%), Gaps = 22/178 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGS 336 +I I +D + R+G ++ +V + S + ++ R I + + M Sbjct: 55 RFMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTG 114 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 AI AM A+ + + + + +++TDG + + + A++ Sbjct: 115 LAIQYAMNVAFTVQEGARP-------PHKKIPRIAIIVTDG---RPQDRVSEVAAHARNA 164 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 GI I + Q L ASP F S EL + F + +++ Sbjct: 165 GIEIYAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFGKQFQDKLCA 215 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 70/216 (32%), Gaps = 21/216 (9%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 +VS N L + FV SS + +R L ++IR + N Sbjct: 13 LLVSLNAELQATPAGSKCRTGPLDLVFVIDSSRSVRPFEFETMRRFLVNIIRGLDIGPNA 72 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+G ++ +V S G + + T A+Q A + Sbjct: 73 ---TRVGVIQYSSQVQSVFPL--GAFSRREDMERAIHAIVPLAQGTMTGLAIQYAMNVAF 127 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S E + + V++TDG + + + +A+++GI I + Sbjct: 128 SVAEGAR---PSQARVPRVAVIVTDG---RPQDRVTEVAAQARNRGIEIYAVGV------ 175 Query: 412 QEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 Q L ASP F S +L + F Sbjct: 176 QRADVGSLRAMASPPLDEHVFLVESF-DLIQQFGFH 210 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 64/199 (32%), Gaps = 20/199 (10%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 S Y C + D + + V S + LV+ + ++ + Sbjct: 360 SSYRCICPEGRQLQADGKSCDRCRTGHVDLVLVIDGSKSVRPQNFELVKRFVNQIVDFLD 419 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + R+G ++ RV ++ G + + + E T A++ Sbjct: 420 VSP---EGTRVGLVQYSSRVRTEFPL--GRYGTADEVKQAVLAVEYMEKGTMTGLALRHL 474 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + S + R +N + +V TDG + ++ +AK +GI + + Sbjct: 475 VEHSFSEAQGARPRAQN--VPRVGLVF-TDG---RSQDDISVWAARAKEEGIIMYAVGVG 528 Query: 407 VNKTQQEKARYFLSNCASP 425 ++ L AS Sbjct: 529 KAVEEE------LREIASD 541 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 64/178 (35%), Gaps = 22/178 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGS 336 +I I +D + R+G ++ +V + S + ++ R I + + M Sbjct: 55 RFMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTG 114 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 AI AM A+ + + + + +++TDG + + + A++ Sbjct: 115 LAIQYAMNVAFTVQEGARP-------PHKKIPRIAIIVTDG---RPQDRVSEVAAHARNA 164 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 GI I + Q L ASP F S EL + F + +++ Sbjct: 165 GIEIYAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFGKQFQDKLCA 215 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 61/180 (33%), Gaps = 23/180 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + ++ V G ++ RV ++ + I E Sbjct: 362 ELVKQFVNRIVDLL-EVSPDGTHV--GLVQYSSRVRTEFPL--NKYHSAEEIKAAVMKME 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E E R ++ + +V TDG + ++ Sbjct: 417 YMEKGTMTGLALKHMVE--HSFSELEGARPLSHNVPRIGLVF-TDG---RSQDDISEWAQ 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFF----EANSTHELNKIFRDRI 445 +AK GI + + ++ L AS F + + L + F+ I Sbjct: 471 RAKESGIVMFAVGVGKAVEEE------LRAIASEPVEQHFSYSADFTTMTHLVENFKLNI 524 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 48/389 (12%), Positives = 117/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 290 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 349 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 350 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 388 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + D Q E + + + + L Sbjct: 389 ARESGINVFFITVEGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 448 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ L + ++ FV S V +A++ + + D Sbjct: 449 RVCDTDR-----LACSKTCLNSADIGFVIDGSSSMGTSNFRTVLQFVANLSKEFEISDTD 503 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 504 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 558 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 559 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAACQKGVITYAIGIA--WAA 604 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 605 QDELEVMATHPAKDHSFF-VDDFDNLYKI 632 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 43/116 (37%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+GA F F++ H L ++ G TA DA+ A ++ Sbjct: 410 RIGAIQFT--YDQRMEFNFNDHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLFKPRS 467 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++++TDG Q ++ A+ +GI + + + + Sbjct: 468 SSN---------RKFLIIITDG---QSYDDVRVPAMAAQREGITVYAVGVAWAPME 511 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 57/180 (31%), Gaps = 28/180 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ A+ VI+ + ID R+ F+ + + K Sbjct: 78 KIEKVKTAMLFVIKKLSPID------RLSVVTFSADANRLCPLRQITENSQKDLEKLI-N 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G+T I +QT + + + I+L++DGE + Sbjct: 131 GLNADGATNITAGLQTGLKVLSDRS------LSGGRVVG--IMLMSDGEQNAGGDAA--- 179 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIG 446 + + + T F +N L A +F + T+ L+K F + Sbjct: 180 --QVPVGNVPVYTFGFGINHEP-----RVLKAIAHNSIGGTFSDVQDTNNLSKAFSQCLA 232 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 43/116 (37%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+GA F F++ H L ++ G TA DA+ A ++ Sbjct: 410 RIGAIQFT--YDQRMEFNFNDHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLFKPRS 467 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +K+++++TDG Q ++ A+ +GI + + + + Sbjct: 468 SSN---------RKFLIIITDG---QSYDDVRVPAMAAQREGITVYAVGVAWAPME 511 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 TRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 85/272 (31%), Gaps = 38/272 (13%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ID +R S +P + VK + + N G LS N Sbjct: 305 RIDDTRHPEIDMYLSIMRPTGVVTDVPRQNVKVFENNNQIEGFSWVNLSRVQDPLN---I 361 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 ++ S+E D + A I + NV G Sbjct: 362 MLVIDTSGSMGPSKEGLTDGG--------LDAAKIAALDFIDHLPSNANV------GLIH 407 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F V D S + + + ++I + G TAI DA+ +Y + + Sbjct: 408 FGTLVTVDHSLTNDIGAVRQSISE-----LKPEGQTAIYDALAISYTQLRRAK------- 455 Query: 362 KNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 + +IVL++DG +T + +I KA I I + + + Sbjct: 456 -----GQTFIVLISDGADTASKGDNYDSIVAKATKANIPTYIIGLTSPEFDGQLLEDLQR 510 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + ++ S +L F + E+ + Sbjct: 511 D--TKAMIYQTPSKEQL-GGFYTEVAQEVSGQ 539 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/255 (12%), Positives = 70/255 (27%), Gaps = 30/255 (11%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D ++ A V +S + E + ++ + + Sbjct: 470 SDEPWTPVETEIDRDAGVATAEVDHFSFFSVFRIEEWEDETSDTITLDGNETDGEIGNGS 529 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 ++ FV+ S H A + ++ D+ R G + D Sbjct: 530 GIETADFVFVNDESGSMSGSPTHYAELAGKRFVGALT------DSERAGRVGYASGANLD 583 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + V + + G T ++ + + + Sbjct: 584 QPLT-----TDHDAVNSSLERLSASGGTNTRAGLRVGLNHLEEEGWENR---------SA 629 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNS 427 ++LL+DG + + + + A G+ I T+ N + E L A + Sbjct: 630 VMILLSDG---KSGSDPLPVAEDAAEAGVEISTVGLGNNINENE-----LREIAAITGGD 681 Query: 428 FFEANSTHELNKIFR 442 F+ +L F Sbjct: 682 FYHVEREEDLPDTFE 696 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 66/213 (30%), Gaps = 37/213 (17%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ + L + G+F +I AL + G G+ VD S L+ A Sbjct: 1 MLSIRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQNALDA- 59 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ V+ R + I + ++ KN T+V Sbjct: 60 ----------AVLAVAQRGDKISDAEARSIAASFLTGNLSSAYKNMAVE-----RNGTSV 104 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +++ + PLS F +G K + + A+ Y I Sbjct: 105 KLSAEAT--------------MPLS-FGGLIGRKEATVGASSTADMAFAYYE------IA 143 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 V+D + SM + + + +N V Sbjct: 144 LVLDTTGSMRGGKLQAMKEAVNGLIDDLSSRVT 176 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 57/187 (30%), Gaps = 55/187 (29%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 S L R + + G+T I + + + S E + + +K Sbjct: 368 TPLSNDYAALKREVSR-----FTADGNTNIMEGVAWGMRVL-SPREPFTEGKEPASDVEK 421 Query: 370 YIVLLTDGENTQD-------------------------------------NEEGIAICNK 392 +++LTDG N N +A C Sbjct: 422 IMIVLTDGANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRTLAACEN 481 Query: 393 AKS-------QGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDR 444 AK + I TI L CA+ P +F++ S +LN IF++ Sbjct: 482 AKREYTPSKEDDVTIYTIRLEEPDVATG---TLLQECATGPGYYFDSPSRTQLNAIFKE- 537 Query: 445 IGNEIFE 451 I + I + Sbjct: 538 IRDGITK 544 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 53.4 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 84/266 (31%), Gaps = 23/266 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + LN F + K+ + + + + M+ L Sbjct: 165 KEGRLPQKDTLRVEEMLNYFSYNYPQPNKNEAPFSVTTELAPSPYNDDMMLLRIGLKGYE 224 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S K L++ AL + + + + D V+ V GA Sbjct: 225 QSKAELGASNLVFLLDVSGSMASDDKLPLLQTALKMLTQQLDEQDKVSIVVYAGAAG--- 281 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V+ D G I+ GST + +Q AY + H +K Sbjct: 282 -VVLD-----GAAGNDIKILTYALEQLTAGGSTNGAEGIQLAYQL------AQKHFVKGG 329 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ + + K GI + T+ F + + Sbjct: 330 INR---VILATDGDFNVGTTNLDELVDLVEVQKKHGIGLTTLGFGMGNYNDHLMEQLANK 386 Query: 422 CASPNSFFEANSTHELNKIFRDRIGN 447 + +S +E K+ +++G Sbjct: 387 --GNGQYAYIDSVNEARKVLVEQLGA 410 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 72/215 (33%), Gaps = 25/215 (11%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L+ + FV S + +V+ + +I S+ I R+G Sbjct: 100 LHSWSGTRGCTEGPVDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-TISPKA--ARVGL 156 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ +V ++ + + + K A + + A++ ++ + E Sbjct: 157 LQYSTQVRTEFTLRN--FGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIEGA-- 212 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R + + IV TDG + ++ +KAK+ GI + + ++ L Sbjct: 213 RPLSARVPRVAIVF-TDG---RAQDDVSEWASKAKANGITMYAVGVGKAIEEE------L 262 Query: 420 SNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 AS F A + I +++ + Sbjct: 263 QEIASEPTDKHLFYAEDFSTM-----GEISDKLQK 292 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 73/196 (37%), Gaps = 21/196 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + G +D ++ +F+ S + ++ + +I + + VR G + Sbjct: 827 AVKEGCVDTEEADIYFLIDGSGSIYPEDFEDMKKFMIELISMFQV---GANRVRFGVVQY 883 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +D ++ S H + + + E G T +A+ + +++ +D H++ Sbjct: 884 SDVRRTEFFIS--EHNTQKMLKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKDRPHKVP 941 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + +V++TDGE + + ++ GI I I ++ + + Sbjct: 942 QS------LVVITDGE---SQDRVTEAAAEIRNDGITIFAIGVKNAVEEE------IRDI 986 Query: 423 A-SPNSFFEANSTHEL 437 A S F N+ L Sbjct: 987 AGSNEKMFFVNNFDSL 1002 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 63/201 (31%), Gaps = 22/201 (10%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P ++ F+ SS +++ + S+++ +I V++G F+ Sbjct: 1014 CTPEACKNMKADIVFLVDSSASINSDDYETMKEFMESMVKQ-AEIGPDR--VQIGLIQFS 1070 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + +K I + T + +A++ +S Sbjct: 1071 SETKEEFPL--NRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKG-------G 1121 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + K+Y++++TDGE + G+ I I QQ L Sbjct: 1122 RVNTKQYLIVITDGEAQDAVGNP---AKAIRDHGVIIYAIGV-----QQANNTQLLEIAG 1173 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + +S L F ++ Sbjct: 1174 KQEQVYYEDSFDSL--AFLNK 1192 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 61/163 (37%), Gaps = 16/163 (9%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + + +D + +R+G +ND + + I+++ + G Sbjct: 254 KAKNFLYDLVSNLDVGINKIRIGLVTYNDETNPEFLL--NSYSSKTEILESIQNMKYVEG 311 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A++ + ++ + + + ++++T+G+ ++ + KS Sbjct: 312 YTYTGRALEY----VNTTYFTQAAGSRFEESVAQILIIVTEGD---SSDTLTEPAKELKS 364 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHE 436 +GI + + ++ +Q + P+ F++ + + Sbjct: 365 RGISVYVVGTNIKYDRQ-----LQEASSKPDEKFFYQLDDFDD 402 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 58/179 (32%), Gaps = 23/179 (12%) Query: 276 DALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++ I + VR+G ++ + + + K Sbjct: 1420 TSAKTFMKEIVDSFTISENRVRIGVAQYSANPKKEFFL--NEYYSSSDMKKQIDSISQLK 1477 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 +T ++ +N +Y++++TDG + E A + Sbjct: 1478 ATTYTGKGLRFVKQFFDPANG-------GRKNVPQYLIVMTDGMSNDSVNEDAA---ALR 1527 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 S G++I +I + + + SP + +E + L D I +I +V Sbjct: 1528 SSGVKIFSIGIGLRNS-----FELVMIAGSPKNVYEVETFQAL-----DSIKRQIVAQV 1576 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 23/171 (13%) Query: 12 KKLIKS--CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++L G +TA+LM +LG+ VDV L+ A ++ I + Sbjct: 2 RRLKHDDRERGAVAPMTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQACA 61 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA-VEMNPRKSA 128 Q+ A S + D V + +N Sbjct: 62 QNAASADCAAPTSAA-------------------TSLAGLNALDGVSNAPQASVNLGTGT 102 Query: 129 YQVVLSSRYDLLLNPLSL-FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 V SR N +L F R++G+++ IQ A+A+ S ++ Sbjct: 103 VDVTTQSRNTSGDNHFTLVFARALGVETANIQASAQAKFGGFSATDAIPLT 153 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 53/373 (14%), Positives = 105/373 (28%), Gaps = 23/373 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L++ +G ++ +LL+P+ LG+ + +D+ +H LK AA A + A L + Sbjct: 2 RRLLRDRSGQITVLASLLLPIGLGIAALAIDLSTLQLVKHRLKVAADAASLAAVAVLPDT 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + ++ T A QV Sbjct: 62 TTALDRALSIAADNAGTGAGTVTA--ASDVRFGSYNSAAKSFTPGATPAN------AVQV 113 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS-ML 190 S + F +++G + I A A S +Y V+D S S L Sbjct: 114 TASRNQAHGNPVVLAFAKALGWSTPDISVSAVAVRFSPAYC-------FLVLDPSASDAL 166 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-----SLYYMLY 245 NC Q + + + N S Sbjct: 167 SVSGTGRLSVPNCGVQVNSTSATAATVGNNSTAQARSFCIVGGYSGTSFSPKPITKCAAA 226 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-ND 304 P PL + + + + I N + ++ G +F N Sbjct: 227 PDPLADIPEPAQPTAGCYYNGLNTGSGMTLPSNVTYCGKITLNGNGDFYLQPGLYYFKNA 286 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V + S + + + G+ + Y ++ + Sbjct: 287 TVELLQNASLVGSGVTIFLDANSTLKFAASGTVNLKAPSSGPYRGLLIFQSRSAAATTTS 346 Query: 365 LEAKKYIVLLTDG 377 + +L+ DG Sbjct: 347 VVRGSPDILM-DG 358 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 91/254 (35%), Gaps = 32/254 (12%) Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY-MLYPGPLDPSLSEEHFVDSSSL 264 QP + S V I P ++ ++L+ + P + + + S S Sbjct: 384 QPGEPITSSPYLIADGVTIALNPPGPSVLRDVRALWTQVRKPARVLVVMDVSGSMASESG 443 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL----- 319 K L + A S + + D +MG F + + + + + + Sbjct: 444 YGSESKLDLAKKAATSALGQLTDTD------QMGLWAFTTDLPTPDTITADLVGVGPLAQ 497 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + T + A + A + + + +V+LTDG N Sbjct: 498 TRQPIIDAISSLTPLNGTPLYAATREAAKAMNAQKDPNSINA---------VVVLTDGRN 548 Query: 380 --TQDNEEGI--AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANS 433 T ++ +G+ + A+ G+R+ TIA+ + L AS + ++A + Sbjct: 549 EYTDNDLDGLLRELNASAEEDGVRVFTIAY-----GPDADLATLQEISEASRAAAYDARN 603 Query: 434 THELNKIFRDRIGN 447 ++K+F D + N Sbjct: 604 PTSIDKVFSDVLSN 617 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 71/196 (36%), Gaps = 17/196 (8%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + + H I + V++ L I++ R+G F D Sbjct: 108 DLSGSMQKKDFVNQQGHKISRLDAVKEVLTDFIKT-------RQGDRLGLILFGDAAFVQ 160 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + +++T ST + DA+ + +++ + +K Sbjct: 161 TPFTADHDVWLDLLMQTRVN--MAGKSTHLGDAIGLTIKRFNEATKNQ----TSEKTREK 214 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SP 425 ++L+DG +T I AK +RI IA K+ +Q ++ A S Sbjct: 215 VAIILSDGNDTGSYVPPIDAAMVAKVNAVRIYMIAIGDPKSVGEQSLDMQTINKIASVSG 274 Query: 426 NSFFEANSTHELNKIF 441 ++A + EL + Sbjct: 275 GQAYQALNQQELLNAY 290 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 69/208 (33%), Gaps = 29/208 (13%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I V+D S SM D + +N +P + + +G + + ++ + Sbjct: 85 IVLVVDTSGSMND-RNVEGKYKINYSAEPIFSEEEYRWYRRVSLGNGQKYIIHELIDTPE 143 Query: 239 SLY---------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 Y+ + + ++ + K ++ A + + + Sbjct: 144 REIKIGKETYNEYIEINKKRYYLKYDYSDGNWYAIAYSEPKIDELQKAAKNFVNKFEIKA 203 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N +G + + S + + ++ +I + G T + D ++ A Sbjct: 204 NTK----IGLVSYGNNGKEVHSLTNELDRINSSIDS----GLSIGGGTNVGDGIRMANGI 255 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + N +A KYIVL+TDG Sbjct: 256 LN-----------NGSDADKYIVLMTDG 272 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 75/226 (33%), Gaps = 28/226 (12%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 + G + P M++ + P + + +S VR A+ Sbjct: 54 RPGEEIVSIDPGMIATLDGRPSPVTSKPATRAQRTTVLLIDTSGSMGRSGMATVRTAVKD 113 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + S K VR+G F + + + + R V+ D G+TA+ Sbjct: 114 FLASAPK------DVRIGVVSFGNTAGPEIAPT-----TARAAVQAVVDDLRADGNTALF 162 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-EGIAICNKAKSQG-I 398 + A + + IVLL+DG+NT + G+A KA + + Sbjct: 163 SGVTQAVRML-------------GSTGDRSIVLLSDGKNTVGDRASGLAAAGKALTASQV 209 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 R+ + F+ + E F A S +A + F+ Sbjct: 210 RVEVVRFTTGENDPEALAAFAK--AGGGSVVQATDAEGVRTAFQTA 253 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 61/456 (13%), Positives = 130/456 (28%), Gaps = 96/456 (21%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G + LL+ M+ G+ VD R S L AA A+++ + SL+ +S+ Sbjct: 24 RSGSVGFVFGLLLLPMMVAMGVSVDYARVSAARSDLAAAADAAVLSVTNKAAMSLDMLSA 83 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 +A ++ + ++N + + +V D + R + S R Sbjct: 84 QA---------RVRDAFLKNI--QTMPDISGVSADAVVIDL---LGVRAATLSYTASYR- 128 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + F +G+++ + A +++ Y + ++D S SM ++ Sbjct: 129 -------TAFSGILGMRTLSVSGNAASKSAVPIY-----MDFYLLLDNSPSMGVGATSAD 176 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + D S Sbjct: 177 ISTMVSRTPDK-----------------------------------CAFACHDLSAGNSD 201 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRVISDPSF 312 + + V + +VR A ++ + V R + + + Sbjct: 202 YYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTALYTMGADCASVGLTTVSPL 261 Query: 313 SWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 S + + + G + + ++ D L +K + Sbjct: 262 SSDLAAAKTNAQAIDLMTIQKPGYNNDQCTDFDGVFQSLNGKI-DVAGDGSTALTPQKVV 320 Query: 372 VLLTDG--ENTQDNEEGIAI-------------CNKAKSQGIRI---MTIAFSVNKTQ-- 411 L++DG + + C K++GI+I T + Sbjct: 321 FLVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLKNRGIKIAVLYTTYLPLPTNDWY 380 Query: 412 -------QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q + CASP +FE + T + Sbjct: 381 NTWIAPFQATLPSAMQGCASPGLYFEVSPTQGIADA 416 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/259 (12%), Positives = 83/259 (32%), Gaps = 4/259 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE-E 74 + G I+ AL++ V++G G+ +D+ + L+ +A + + A+ L ++ Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLS 76 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 V A + + E + N FTD +A+ +P Y ++ Sbjct: 77 VPEAAGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAIT-SPSSIKYVKCMT 135 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 S+ ++ + G+ A A + + + + + + Sbjct: 136 SQTGIVNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFIC--KAGTQTNPPV 193 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + S+ + N D S ++ + Y P + Sbjct: 194 AGATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGT 253 Query: 255 EEHFVDSSSLRHVIKKKHL 273 + +++ + + Sbjct: 254 PGNKAATTNAYNTRFGIYA 272 >gi|73980136|ref|XP_851163.1| PREDICTED: similar to vitrin isoform 2 [Canis familiaris] Length = 645 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 54/416 (12%), Positives = 130/416 (31%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + L AQT I + PL +Q + +++ + Sbjct: 263 FLIDGSSGIGKRRFRIQKQFLADVAQTLDIGPAGPLMGVVQYGDNPAAQFNLRTHMNSRD 322 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A V + + Sbjct: 323 LKTAIEKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPNVAV--------------VI 368 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + + +E G++I ++ + + Q +E N + Sbjct: 369 VDGWPTDKVEEASR-------FARESGINIFFITIEGATENEKQYVTEPNFSNKAVCRTN 421 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + ++ + S +D SS Sbjct: 422 GFYSLNVQSWFSLHKTVQPLVKRVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTGN 477 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ +A++ + + D R+GA + F + + I+ Sbjct: 478 FRTVLQF-VANLSKEFEISDTD---TRIGAVQYT--YEQRLEFGFDDYNTKSDILNAIKR 531 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 532 VGYWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 579 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + Q++ ++ A+ +SFF + L K I Sbjct: 580 AMVAHHKGVITYAIGVA--WAAQDELEVIATHPANDHSFF-VDEFDHLYKFVPKVI 632 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 71/445 (15%), Positives = 141/445 (31%), Gaps = 88/445 (19%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 +D L+ A A + S+ L ++++ L+R Sbjct: 34 IDSGMLYLSHSRLQAAVDAAALAGSLQL----------------PYDPQLDKGLVRGAVT 77 Query: 101 -NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + N+ + + + T S ++F+ ++GI S + Sbjct: 78 QYMDANYPEASLNGVTPGTEER------------SVTVTATATVPTIFMNALGIGSSEVH 125 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 KA A Y+K + + +VID S SM + + S + Sbjct: 126 AKATA-----GYNK---LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKV 177 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G V R + P V G L+PS E + +S ++ ++ Sbjct: 178 GLVPFRGKVHLPAGVDGLPDG-CRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCT 236 Query: 280 SVIRSIKKIDNVNDTVRMGATFFN-----DRVISDPSFSWGVH--------------KLI 320 S I ++ + +T+ + N + WG H K I Sbjct: 237 S-IPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGSSAKDI 295 Query: 321 RTIVKTFAIDENEMGSTAINDAMQT---AY--DTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R ++ + E G + A+ AY + + H Sbjct: 296 RKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKL--------- 346 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFE 430 N + KAK GI + I F + + + + AS + +++ Sbjct: 347 -------NAAMLEEARKAKEAGIEVFAIRFGDSDSVDV---SLMKSIASSKAGTNDHYYD 396 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A S ++++ +F+ +IG ++ R++R Sbjct: 397 APSAYDIDDVFK-KIGRQLGWRLLR 420 >gi|3929911|dbj|BAA34707.1| complement factor B/C2B [Cyprinus carpio] Length = 833 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 66/196 (33%), Gaps = 28/196 (14%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-KTFAIDENEMG 335 A + I+ ++K+D+ T+R G + S + + + + ++ K + G Sbjct: 360 AKNATIQLVQKLDSYEVTMRFGIISYASEAKEIVSITNDLSQDVHYVMRKLHEFSDKSHG 419 Query: 336 ---STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------- 384 T ++DA+ Y+ + E++ I++ TDG + Sbjct: 420 NKRGTNLHDALNKVYEELALLRENKRSHFNETQN---VIIIATDGYSNMGPSPINILPKI 476 Query: 385 ----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS----PNSFFEANSTHE 436 + + K + + + A +QE L + AS F + Sbjct: 477 RNLFGYKSSVDHTKEELLDVYVFAVGQQVNKQE-----LQSIASIKKDERHVFVLKDYRQ 531 Query: 437 LNKIFRDRIGNEIFER 452 L +F I + + Sbjct: 532 LGLVFNQMISDSAVTK 547 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 56/190 (29%), Gaps = 33/190 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + A ++ IK N R+G F + + L+ + + Sbjct: 116 RQVDRMTAAKRVLVDFIKGRPND----RIGIVAFGGAPYNPCPPTLDHDWLLNNMDR-IQ 170 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G I S R+ K I+L+TDG N Sbjct: 171 TGIMEDG------------TAIGSGIAAAARRLDQLEVKSKVILLMTDGANNSGKLSPQD 218 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------EKARYF----LSNCAS--PNSFFEAN 432 A + GIRI I+ + F L A+ SFF A Sbjct: 219 AARLAATLGIRIHAISIGTPGMHPIYMPNGPPINSGRQEFDPETLQEVANIGSGSFFRAE 278 Query: 433 STHELNKIFR 442 L +IF+ Sbjct: 279 DLSTLERIFK 288 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 55/400 (13%), Positives = 116/400 (29%), Gaps = 82/400 (20%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ I+ G +++ + + L +GG+ +D +R +L+ A++ + + Sbjct: 16 TEMFIQEEDGTVTVLSFFIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATASTE 75 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + +E+Y ++ +++ D ++ + V Sbjct: 76 AGART-------------IVEDYFAKSGQSDYLLAQKDGDISTTLNAAKVTAGA------ 116 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 L+ + ++ G+ + A AE + V+D S SM Sbjct: 117 ---------ELSLDTYLMKLAGVPTLSASGTATAEVRIPK------LEAILVLDVSGSMA 161 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + Q + + K G + P+ S P Sbjct: 162 SNSKIQNLQTAAKDFV-------TTVMNSSKPGDTVMSIVPFSFSVT--------PPQSV 206 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF--------- 301 + S K+ L+S S+ VN V Sbjct: 207 FDALAVEETHNYSTCLEFKENDYQHATLSSGSSSLSSGIPVNQMVYTSVYGDFDNLDSGW 266 Query: 302 ---FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA-------YDTII 351 + D I +S + L I G+T+ N+ M + + Sbjct: 267 RSCYTDEYIRILPYSTSITDLHAKID-----ALQPAGNTSGNEGMNWGAALLDPTFREVT 321 Query: 352 SS-------NEDEVHRMKNNLEAK--KYIVLLTDGENTQD 382 +S +E + + E + K I+ + DG NT Sbjct: 322 ASMIAAGHLSETLANVPSDYDEPETLKAIIFMGDGANTTS 361 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 5/71 (7%) Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +C K++G+ + +I F V + + LS CAS + + S ++ F I Sbjct: 515 NVCKATKTEGVVVYSIGFEVPVNGTAENQ--LSACASSPAHYFRASGTDIKSAF-SAIAA 571 Query: 448 EIFERVIRITK 458 + + +R+T+ Sbjct: 572 NV--KQLRLTQ 580 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 72/207 (34%), Gaps = 23/207 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S H + +V+ L ++ ++ D V+ G + +V+ D Sbjct: 195 PPANLVFLVDVSGSMHRREGLPMVQGTLKLLVDQLRPQDRVSLVTYAG----DSQVLLDS 250 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K+ I GSTA +Q AY H + + Sbjct: 251 APGSDKAKIRAAID-----QLTAGGSTAGESGIQLAYQQ------ASKHLIDGGINR--- 296 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ D + + + G+ + T+ F V+ + A + Sbjct: 297 ILLATDGDFNVGISDFDSLKQLAADKRKSGVSLTTLGFGVDNYNERLMEQLAD--AGNGN 354 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVI 454 + ++ E K+ D++ + + Sbjct: 355 YAYIDNLREARKVLVDQLSSTLATVAS 381 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + A+CN AK+QGI + TI F + L +CAS ++ E+ F Sbjct: 490 DARTRAVCNAAKNQGIVVYTIGFEAPSS----GTAVLKDCASSDAHHFDVRGLEIRDAFA 545 Query: 443 DRIGNEIFERVIRITK 458 I I R +R+T+ Sbjct: 546 S-IATSI--RQLRLTQ 558 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + A+CN AK+QGI + TI F + L +CAS ++ E+ F Sbjct: 492 DARTRAVCNAAKNQGIVVYTIGFEAPSS----GTAVLKDCASSDAHHFDVRGLEIRDAFA 547 Query: 443 DRIGNEIFERVIRITK 458 I I R +R+T+ Sbjct: 548 S-IATSI--RQLRLTQ 560 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 50/166 (30%), Gaps = 3/166 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ-SLEE 74 + G +ITALL+PV++G G+ +D+ + ++ A A+ + L L Sbjct: 7 RHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQGLAM 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS-AYQVVL 133 ++ + + N + + + + + Sbjct: 67 TTANSFITSNGFDPNDATITYTQDTVNNPAGSPEINCSMTNQVPTFFLGLFGYPNISITV 126 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE-TVSRSYHKEHGVS 178 S++ L + + T S + + +S Sbjct: 127 SAKGILQTGGAGGPFNYAIFSDQNLPISGNQKITGSVHTNHQLTIS 172 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 33/189 (17%) Query: 276 DALASVIRSIKKI------DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 A+ ++ + + V +G F+ + R + + Sbjct: 317 MAIGDRLKRLTQAGKLFLLQTVEQGSWVGMVTFDSAAYVQSALRQIKGGTDRDALTKS-L 375 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I +++A+ I + + IVLLTDGE+ I+ Sbjct: 376 PTVASGGTSICSGLRSAFTVIRKKYKTDGSE----------IVLLTDGEDN-----TISS 420 Query: 390 C-NKAKSQGIRIMTIAFS----VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD- 443 C N+ K G I T+A + K L AS +A + + L F Sbjct: 421 CFNEVKQSGAIIHTVALGPSAAAELEELSKMTGGLQTYASD----QAQN-NGLIDAFGAL 475 Query: 444 RIGNEIFER 452 GN + Sbjct: 476 SSGNGAVSQ 484 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 58/164 (35%), Gaps = 11/164 (6%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L + + F+ SS V++ + ++++ + D R+G + Sbjct: 20 SLLENTCNNKRLDLVFIIDSSRSVRPYDFEKVKEFILTILQFLDVSP---DATRVGLIQY 76 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V + S + + I + + T A+Q A + S +E Sbjct: 77 GSTVKHEFSLK--TFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFSESEGARPL-- 132 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N + I+++TDG + + I KA++ GI I I Sbjct: 133 -NQNVPRIIMIVTDG---RPQDPVGEIAAKARNSGILIFAIGVG 172 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 62/218 (28%), Gaps = 5/218 (2%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL--IQSLEEVSS 77 G I+ L+M V++G G+ +D + L+ +A + AS L S+ S Sbjct: 20 GAVAIVVGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSIAPASF 79 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + ++ + +N + A++ +A R Sbjct: 80 ARAEAAGQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSSKYVRC 139 Query: 138 DLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + N + F++ + + + A G+ + I S Y Sbjct: 140 TITRSNIMPWFMQVLMPSLDTLTVSSLATATLAPAQNNCGIPM--AICSKGSAPSYGMTP 197 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 FG T +L+ + Sbjct: 198 GQWVSGFFGAGGGVTGSFNWIDFTPPAGGTSELAALLT 235 >gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis] Length = 1225 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 69/219 (31%), Gaps = 22/219 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P + FV S K + A+ ++ + + D Sbjct: 259 NGYFVHYFAPRGLPPVQKDVVFVIDISGSMYGTKMKQTKKAMHVILSDLHQDD------- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTAINDAMQTAYDTI 350 FFN SD W + I+ K + G T IN A+ A Sbjct: 312 ----FFNIVTFSDTVNVWKPSQSIQATPQNIKKAKDYVSKMEADGWTDINAALLAAASVF 367 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSV 407 S+ M++ I+ LTDGE T G I + A+ I + +AF Sbjct: 368 NHSSPMAGKIMRDQRIP--LIIFLTDGEPTSGVTTGSRILSNAQQALKGTISLFGLAFGD 425 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + R L N +E K F D I Sbjct: 426 DADYGLLRRLSLENRGVARRIYEDADATLQLKGFYDEIA 464 >gi|292655414|ref|YP_003535311.1| von Willebrand factor type A domain-containing protein [Haloferax volcanii DS2] gi|291372503|gb|ADE04730.1| von Willebrand factor type A domain protein [Haloferax volcanii DS2] Length = 818 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 53/163 (32%), Gaps = 25/163 (15%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ +D + D R+G FN R R G+T I + Sbjct: 427 ALDALDQLGDENRVGIVGFNYRAYDVAPLR--PLGPNRESAADLIRRLESGGATDIAVGL 484 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + I+L++DG + + +A ++ G+ ++TI Sbjct: 485 DGAAQQLGDRRGT--------------IILISDGHDRFQDAATLA--DQLGRDGVSVITI 528 Query: 404 AFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 N ++ L AS ++ A+ T L +F Sbjct: 529 GTGPNPNERT-----LRAIARASGGNYLRADETDRLRILFGGS 566 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 83/237 (35%), Gaps = 29/237 (12%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V C + G + + FV +S + + ++ + D Sbjct: 26 VICTVNQANAASLGAVPIEGYDAVFVLDTSYSMRDTDPEGISAEVINMFMDLSDAD---- 81 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G +N V++ + ++ ++ N G T + ++ + + + Sbjct: 82 RTRVGFVAYNHNVVASKPLTSIAVAAQKSQIQQDIRTLNRSGYTDLGLGLRRGSELLAAG 141 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGEN-----------TQDNEEGIAICNKAKSQGIRIMT 402 + + +++LL+DGE N + ++ A+++G + T Sbjct: 142 A---------SQGRQPFMILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYT 192 Query: 403 IAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 I + + T R L AS + F +S +L +I ++I +++ I Sbjct: 193 IGLNHDGTVN---RQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLVPIA 246 >gi|18042139|gb|AAL57848.1|AF454755_1 vitrin [Mus musculus] Length = 650 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 118/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 290 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 349 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 350 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 388 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + + Q E + + + + L Sbjct: 389 ARESGINVFFITVEGAAEREKQHVVEPVFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 448 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 449 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 503 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + + + I+ G T+ A+Q A + + Sbjct: 504 ---TRVGAVQYTYEQRLQF--GFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 558 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 559 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 604 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 605 QDELEVMATHPAKDHSFF-VDDFDNLYKI 632 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 86/233 (36%), Gaps = 36/233 (15%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 K + + +D ++ SS + + I++I + Sbjct: 746 KTEHFSTFLLGDKNMPVDLSKVDIVFVLDNSGSMSSNDPNYYRIEATKK----FIQNIDE 801 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ++N R+G F+ V + + +KL++ + GST I ++ A Sbjct: 802 LNN-----RVGLVDFDSSVYVRSNLTSDKNKLLQALNAMRWTG----GSTNIGGGLKAAL 852 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + + ++KK IVLL+DG + + K + I + TIA Sbjct: 853 E------------LFDQEQSKKIIVLLSDGYHNTGIHPNDVLPELIKQE-IVVNTIAL-- 897 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ R L + A + +F ++T L+ ++ + +I ++TK Sbjct: 898 ---GKDCDRELLHDIADKTKGDYFYVDNTGGLS---QEDVDKQIELIYEKLTK 944 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 60/157 (38%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D +D VR+GA ++D G + ++++ + G T A+ Sbjct: 841 VKKADVGSDRVRIGALKYSDYPEILFHL--GKYSNRSSVIEHLRRRRSTGGDTYTARALD 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E + + K+ ++++TDG + D K +++GI I + Sbjct: 899 H-----TNMMFTEEYGSRIQQNVKQMLIVITDGV-SHDRNLLNETALKLRNKGIDIYAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 Q + + + F ++ ++L I+ Sbjct: 953 V-----GQADQLELEAMAGNKSKTFHVDNFNKLKDIY 984 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 78/208 (37%), Gaps = 26/208 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 N+ ++ + + ++ F+ SS ++ ++ + S++ I+ D Sbjct: 609 KNEVVHRICTEKGCEDMKADIMFLVDSSGSIGLENFGKMKTFMKSLLAKIQIGP---DRT 665 Query: 296 RMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G F+D+ + + + +++ I + ID+N + + +IS Sbjct: 666 HIGVIQFSDKTREEFQLNKYFTQNEISDAIDRMSLIDKNTLTG-----------NALISV 714 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ + KK+++L+TDGE + + +G+ I ++ Q Sbjct: 715 DQYFTPAKGARIGIKKFLILITDGEAQDAVRDP---AKALRDKGVVIFSVGVYGANRTQ- 770 Query: 414 KARYFLSNCASPNS-FFEANSTHELNKI 440 L + + F+ S +L I Sbjct: 771 -----LEEISGDGNLVFQVESFDDLKAI 793 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 44/117 (37%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + +T +++ D TAI+ A+ A ++ Sbjct: 130 RVGLVGYAGSAFPQVPITTDYAST-KTFLQSMNTDMVSSQGTAISQAIDLAKSYYNDDDQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K +++L++GE+ N E +A A ++GI+I TI + Sbjct: 189 TN-----------KVLIILSEGEDHDSNVESMA--ETAAAEGIKIYTIGVGTERGDP 232 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 77/260 (29%), Gaps = 43/260 (16%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + K +NG + + L + S+ + + +S L Sbjct: 48 PEASTKRRIWRNGFLLLAIVFLIGMLARPQISIRVDVPKKEKGIEAMICLDISNSMLCED 107 Query: 268 IK--KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +K + + L + ++ ND V G F + + + + + Sbjct: 108 VKPNRLSFAKQVLGKLFDGLQ-----NDKV--GLVVFAGNAYTQIPITTDLSAAKQFLAD 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G TAI A++ A + + E K I++LTDGEN + N Sbjct: 161 ISPNMVTAQG-TAIGAAIELASKSFSDNKE-----------IGKTIIVLTDGENHEGNA- 207 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCASPNS 427 I +A GIR+ I + + AS Sbjct: 208 -IEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEKMCRDIASAGE 266 Query: 428 --FFEANSTHELNKIFRDRI 445 FF S L + ++ Sbjct: 267 GTFFSGQSASALVRAIESQL 286 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 57/200 (28%), Gaps = 23/200 (11%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 K + E + N + P L FV S K R+A+ Sbjct: 236 KYDVNRENDLGDIQIANGYFVHFFAPKDLPRLPKNVVFVIDMSGSMSGTKMQQTREAMLK 295 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-----TFAIDENEMG 335 ++ + D + G F+ R+ W T + G Sbjct: 296 ILEDL---DPEDHF---GIILFDHRIQF-----WNTSLSKATKENIDEAMVYVKAIQSYG 344 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T IN + A D + ED + I+LLTDG+ I K+ Sbjct: 345 GTDINAPVLKAVDML---KEDRKAKRLPEKSID-MIILLTDGDPNSGESRIPVIQENVKA 400 Query: 396 Q---GIRIMTIAFSVNKTQQ 412 + + ++ F + Sbjct: 401 AIGGQMSLFSLGFGNDVKYP 420 >gi|293359740|ref|XP_233802.5| PREDICTED: vitrin [Rattus norvegicus] Length = 648 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 82/242 (33%), Gaps = 22/242 (9%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 F A + S N + + K +V + S +D SS Sbjct: 415 FASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSS 474 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + +++ +A++ + + D R+GA + F + + + Sbjct: 475 SVGTSNFRTVLQF-VANLSKEFEISDTD---TRIGAVQYT--YEQRLEFGFDKYNSKADV 528 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + G T+ A+Q A + + ++ +K ++L+TDG + Sbjct: 529 LSAIRRVGYWSGGTSTGAAIQYALEQLFKKSKPNK---------RKVMILITDG---RSY 576 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ A +G+ I + Q++ ++ A +SFF + L K F Sbjct: 577 DDVRIPAMAAYQKGVITYAIGIA--WAAQDELEVIATHPARDHSFF-VDEFDNLYK-FVP 632 Query: 444 RI 445 RI Sbjct: 633 RI 634 >gi|294011131|ref|YP_003544591.1| hypothetical protein SJA_C1-11450 [Sphingobium japonicum UT26S] gi|292674461|dbj|BAI95979.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 418 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 39/81 (48%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ TG+ ++ A MP+++G G+ D V+W+ ++ +++ A +A + + + Q Sbjct: 8 RLLRDRTGNVLMMAAASMPLLVGAAGLATDTVQWTLWKRQIQRQADSAALAGAYAVAQGF 67 Query: 73 EEVSSRAKNSFTFPKQKIEEY 93 S + + + Sbjct: 68 NASDSATADISRMALVALTQT 88 >gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Anolis carolinensis] Length = 914 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 75/234 (32%), Gaps = 13/234 (5%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 ++ + + N + P + FV S + KK +AL ++ Sbjct: 238 VKRSATAGDIQIVNGYFVHYFAPDQMPTLPKNIIFVIDKSGSMIGKKIQQTIEALQKILE 297 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D+ N V G S + + + K + G T IN A+ Sbjct: 298 DLNPEDHFNLVVFSGEIS-----EWQSSLLKATEENVE-LAKQYVRTIMAQGGTDINGAL 351 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRI 400 TA +++ + E + IVLLTDG+ T +I K + Sbjct: 352 LTAINSLDRATSAE---LLPEQSIS-MIVLLTDGQPTVGETNVNSIQTNIKKANDGNYFL 407 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + F + + + L N +E + + F + I + ++ Sbjct: 408 YCLGFGFDVSYTFLEKLALENRGIARRIYEDSDAALQLQDFYQEVATPILKEIV 461 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 77/244 (31%), Gaps = 19/244 (7%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 Y R +EG L G+ +G Y C + + Sbjct: 400 YCRCNEGYTLQADGKTCVADDMCNIVDHGCEFKCVSTPGSYYCICPEGQELQADGKTCNK 459 Query: 252 SLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + FV S + LV++ + ++ + +G ++ RV + Sbjct: 460 CSTGYIDLVFVIDGSKSVRPQNFELVKEFVINI---VDSSAISAQGTHIGLVQYSSRVRT 516 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + S + + I + T A++ + S E K Sbjct: 517 EFPLSQYTNG--QDIKTAVKNIQYMEKGTMTGLALKHMVEQSFSEAEGARKN-----VPK 569 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +V TDG + ++ KAK GI + + K +++ S+ + +SF Sbjct: 570 IGLVF-TDG---RSQDDISEWAKKAKEAGITMYAVGVG--KAVEDELNEIASDPVNKHSF 623 Query: 429 FEAN 432 + A+ Sbjct: 624 YTAD 627 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 58/171 (33%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++ T R+G ++ +V + S + K T Sbjct: 55 MIDIINSLEVGLSTTRVGVVQYSSQVQTVFSLK--TFSNKSDMEKAINEIIPLAQGTMTG 112 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + + E KN + +++TDG + + + +A+ GI I Sbjct: 113 LAIQYAMNVAFTEEEGARPLSKN---IPRVAIIVTDG---RPQDRVTEVAVQAREAGIEI 166 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + Q L AS + F S L+ F+D++ Sbjct: 167 YAVGV------QRADVSSLRAMASHPLDDHVFHVESFDLIQHLSIQFQDKL 211 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/263 (16%), Positives = 87/263 (33%), Gaps = 26/263 (9%) Query: 201 LNCFGQPADRTVKSYSSQNGKVG---IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 L+ F A T Y+ + ++ R +SP + S+ P + Sbjct: 12 LSAFLMEAQGTYGPYARNHNQLYAGRQRSPHISPNIPGHGNSINIGAPAEPCKSRPLDLV 71 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F+ SS + V+ L+ ++ S+ D +D R+ + V + Sbjct: 72 FIIDSSRSVRPAEFEKVKIFLSEMVDSL---DIGSDATRVALVNYASTVNIEFHLKKYFS 128 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K + T A++TA + + + N K K +++TDG Sbjct: 129 KAEVKQAFSRIDPL--STGTMTGMAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDG 183 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS- 433 + ++ + A++ GI I + + + L AS + F + Sbjct: 184 ---RPQDKVEEVSAAARASGIEIYAVGVDRAEVRS------LKQMASQPLDDHVFYVETY 234 Query: 434 --THELNKIFRDRIGNEIFERVI 454 +L FR+ + E + Sbjct: 235 GVIEKLTSKFRETLCEEARADLT 257 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 76/240 (31%), Gaps = 43/240 (17%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL---RHVIKKKHLVRDALASVIRSIKKIDN 290 C + + G + +DSSS + K+ V+ L + Sbjct: 96 YGCYYNYTHYNNTGICQGVMDVVFLIDSSSSITDENYRKEIDFVKSILDYYY-----LHP 150 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V + F+ V ++ + + I G T I A++ A+ + Sbjct: 151 NYTLVSI--LEFSTDVRVLQELTYDACDVRKAIDSDR-----MSGLTNIAKAIEEAHRIL 203 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENT------------QDNEEGIAICNKAKSQGI 398 +S D + IVL+TDG T N I AK+ I Sbjct: 204 KNSRSDIPDQ----------IVLITDGFQTVHSSINCNDHPHDCNAYAIEKARAAKADDI 253 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 +I TI ++ L AS + +F + + R+++ N V +I Sbjct: 254 QIYTIGVGAASYYEDD----LRQIASSPSDQYFSLVDDYSSIQTVREKLQNSTCPLVTQI 309 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 69/190 (36%), Gaps = 23/190 (12%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + + + + +D L + L + K + +DA++ I+ Sbjct: 296 RKTMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSSGMPKIEIAKDAVSQYIQF 355 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI--VKTFAIDENEMGSTAINDA 342 + K ++ R+ FN R WG+H++ R + G T + +A Sbjct: 356 LSKTND-----RLAMVLFNFRAD----VLWGLHQVRRYWQQMNYMLKYVYAGGGTNLANA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ + + + S + H ++ +TDG T ++ + + + G I T Sbjct: 407 LERSREVLTRSKSNSKH-----------VICVTDGR-TVNSSMCVKEAVRLRRNGTTIST 454 Query: 403 IAFSVNKTQQ 412 IA N + Sbjct: 455 IAIGENSDDE 464 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 45/148 (30%), Gaps = 18/148 (12%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F D ++ + I + +TAI + + + S Sbjct: 137 GVVLFGDFAFIATPVTYEKEIVSEMI--GYLSHGMAGQNTAIGEGIAMGVRALRDSKAKS 194 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 K I+LLTDGE+ + + IR+ TI E Sbjct: 195 -----------KVIILLTDGEHNSGSISPKEAVAMVGKEHIRLYTIGIGQKG---EFDNA 240 Query: 418 FLSNCASPNS--FFEANSTHELNKIFRD 443 L A FF A + EL ++ + Sbjct: 241 LLKQLAHDGHGKFFAAANEKELQSVYDE 268 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 46/389 (11%), Positives = 119/389 (30%), Gaps = 46/389 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 290 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 349 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 350 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 388 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + + Q E + + + + L Sbjct: 389 ARESGINVFFITVEGAAEREKQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 448 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 449 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 503 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 504 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 558 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 559 KKSKPNK---------RKVMIIITDG---RSYDDVRIPAMAAYQKGVITYAIGIA--WAA 604 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKI 440 Q++ ++ A +SFF + L KI Sbjct: 605 QDELEVMATHPAKDHSFF-VDDFDNLYKI 632 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 52/391 (13%), Positives = 112/391 (28%), Gaps = 42/391 (10%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + AA+ + + I +V + + + + D Sbjct: 50 QTAAEKERESVAKQEITEARDVELKTSEGNAPQSLPMR-VQANDAMIEHRTLAEQAISDD 108 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRY----DLLLNPLSLFLRSMGIKSWLIQTKAEA----E 165 D +++++ R S R L N + + + G+ ++ + ++ Sbjct: 109 KSADMSIDVSSRSYEAGKQASERMIKMKSLSRNHIMGQMSAPGLPAFREASSSDNFKRQT 168 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 E VS + + S +R L G + +Y + Sbjct: 169 ANGIMVAGEIPVSTFSIDTDTGSYTTLRRWINQGRLPEKGTVRVEEMINYFNYQYSTPST 228 Query: 226 DEKLSPYMVSCNKSLYYMLY------------PGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 E+ S Y + + +D S + K L Sbjct: 229 VEQPFSVNTELAPSPYNDHKMLLRIGLKGYEVDKSQLGASNLVFLLDVSGSMNSRDKLPL 288 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ +L + + + + D+V+ V GA+ GV + + Sbjct: 289 LKTSLKMLSQQLSEQDHVSIVVYAGASGVVLD---------GVKGNDIYAINQALNNLKA 339 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 GST +Q AY + H ++ + ++L TDG+ + A+ + Sbjct: 340 GGSTNGGAGIQQAY------GLAQKHFIQGGVNR---VILATDGDFNVGTTDHQALMDLI 390 Query: 394 ---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + QGI + T+ F Sbjct: 391 ASKRDQGIALTTLGFGQGNYNDHLMEQLADK 421 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 98 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 157 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 158 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 188 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 189 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 225 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 226 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 276 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 277 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 315 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 78/201 (38%), Gaps = 23/201 (11%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + K + A + I R+G +NDR++ S + R + Sbjct: 54 SNSMNKTDPGKTAAEVMSMFIDMS--EATRTRIGFVAYNDRIVQAQSPASMAEARNREQL 111 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----EN 379 K G + + ++ + I + + K +++LL+DG +N Sbjct: 112 KRTIQGLRYSGYSDLGLGLRRGAEMIEKAKDPAR---------KPFLILLSDGGTDLRQN 162 Query: 380 TQ------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 N++ + +KAK+QG I TI + + + Q++ ++ A+ + F S Sbjct: 163 AGGRSVAASNKDVETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAE-ATGGTSFVTQS 221 Query: 434 THELNKIFRDRIGNEIFERVI 454 T +L +IF I +++ Sbjct: 222 TDDLPEIFNQIFAKHIQSQLV 242 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/271 (14%), Positives = 81/271 (29%), Gaps = 43/271 (15%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 F Q D V+ +N +++ + + ++ + + S Sbjct: 239 FAQSIDTVVEFCKEKNHNKEAPNDQNQKCNLRSTWEVIQDSEDFKKTTPMTTQPPAPTFS 298 Query: 264 LRHVIKKKHLV------RDALASVIRSIKKI------DNVNDTVRMGATFFNDRVISDPS 311 L + ++ + + ++ + + V +G F+ Sbjct: 299 LLQIGQRIVCLVLDKSGSMTVGGRLKRLNQAGKLFLLQTVEQGAWVGMVAFDSAAYVKSE 358 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 ++ ++ G T+I +++A+ I + I Sbjct: 359 LV-QINSAAERDALARSLPTAASGGTSICSGLRSAFTVIKKKYPTDGSE----------I 407 Query: 372 VLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYF------LSNCAS 424 VLLTDGE+ I+ C + K G I T+A + ++ L AS Sbjct: 408 VLLTDGEDN-----TISACFPEVKQNGAIIHTVALG--PSAAKELEELSQMTGGLQTYAS 460 Query: 425 PNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 +A + + L F GN + Sbjct: 461 D----QAEN-NGLIDAFGALSSGNRAASQRS 486 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 67/191 (35%), Gaps = 26/191 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKTFA 328 + A+ S++ S KK + + +++ F V + + K Sbjct: 33 IDTLNKAVESMLNSFKKAETMETFIKLSIITFGSENGVDLHTPLT--------EVSKIDF 84 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 GST M A+ + ED+ + + + IVLL+DGE D + + Sbjct: 85 KPLTVSGST----PMGAAFKMGKAMIEDK--DIFKGRDYRPTIVLLSDGEPNDDWRQPLD 138 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +++ M +A + F+ C NS F A + F+ Sbjct: 139 DFVSTGRTKKCDRMALAIGA--ADKTVLNMFIEGC--ENSLFYAEDAENIIDEFK----- 189 Query: 448 EIFERVIRITK 458 +I V + TK Sbjct: 190 KITMSVTQRTK 200 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + T Sbjct: 59 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSVAEGAR---PPEARVPRIAVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 171 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 208 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 364 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 418 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 419 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 472 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 473 RAKEEGIVMYAVGVGKAVEEE------LRQIAS 499 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 23/208 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + VD S + +K LV+ L + ++ D V + Sbjct: 185 KKELPPANLVFLVDISGSMNSPEKLPLVKKTLRILTEQLRPQDKVT------LITYASGE 238 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G +K + GSTA A++ AY+ + +KN + Sbjct: 239 ELVLPPTSGRNK---DEILRAINKLQAGGSTAGESALKMAYEQAQKA------YVKNGIN 289 Query: 367 AKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+L TDG+ + ++ + + GI + T+ F ++ A Sbjct: 290 R---ILLATDGDFNVGVSSTDALKSMVAEKRKSGISLTTLGFGTGNYNEDMMEQIAD--A 344 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E K+ + ++ + + Sbjct: 345 GDGNYSYIDNEKEAKKVLQHQLTSTLAT 372 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + T Sbjct: 35 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTMTG 92 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 93 LAIQYAMNVAFSVAEGAR---PPEARVPRIAVIVTDG---RPQDRVAEVAAQARARGIEI 146 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 147 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 184 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 381 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 435 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 436 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 489 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 490 RAKEEGIVMYAVGVGKAVEEE------LRQIAS 516 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 68/193 (35%), Gaps = 33/193 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-ISDPSFSWG---------VHKL 319 + + A+ ++ ++ ++G + +D + G V L Sbjct: 78 RIDAAKAAVTDLLGTLPAP------TQVGLMVYGTSTGSTDAERAAGCQDIKTLAPVGTL 131 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + G T I +A++ A + + + IVL++DGE+ Sbjct: 132 NAATLTSQVAGITASGYTPIGNALRAAAQALPNEGP-------------RSIVLVSDGED 178 Query: 380 TQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T + + QG+ + T+ F V+ T +++ A+ ++ +A + L Sbjct: 179 TCAPPAPCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQ--ATGGTYSDAGNATGL 236 Query: 438 NKIFRDRIGNEIF 450 + ++ I Sbjct: 237 TDALQAKVEVAIS 249 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 62/468 (13%), Positives = 142/468 (30%), Gaps = 67/468 (14%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK G + +P+ +G+ + ++ + L A + A + ++ + Sbjct: 17 FIKDEIGAILWPFIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLALTIENNTNNP 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + KN + R L V DI+ P Y+ Sbjct: 77 DETQTEKNIS------LVNAYAR---AYLPSESFSAPVIDIISH------PNYIEYRAAT 121 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + Y I +I + ++ + +V+D+S SM Sbjct: 122 TLNYT---PKFLTKELITNIDRRIIVSDNGVAIKNKFTSPGEITDVVFVVDYSVSMDGNF 178 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC------NKSLYYMLYPG 247 D + + + + +N + ++ +Y Sbjct: 179 GDEKKTTKIQELRRIFEDLNNTILKNNNTHTIGFVPFSWGTKKIIGKGIHRKIYCHFPFV 238 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P P + D S + V++ + +IK I + + N + Sbjct: 239 PKTPMPPSYYLGDLKSYNPAKELTDAVKNNIDY-DETIKSITANYNFI-------NIPID 290 Query: 308 SDPSFSWGVHKLIRTIVKTF---------AIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 S+ + +++ I+ G T I+ + A D ++++ Sbjct: 291 DIKPSSFCLKGSDAYTLRSDDITNDNIQENIEHEVNGLTLISSGILVANDIFRKDSKNK- 349 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIA-----------ICNKAKSQGIRIMTIAFSV 407 K +++++DG + + + + +C + K IR++ I + Sbjct: 350 ---------DKLMIIISDGNDQEISSDLTQEKITKTLIEKGMCERIKENNIRMVFIGIAY 400 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + +C +++EA + HEL R +G V R Sbjct: 401 TVKEIKW-----EDCVGKRNYYEAQNAHELEADLRQALGTIEASEVGR 443 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 75/207 (36%), Gaps = 22/207 (10%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 ++ + + ++ F+ S +++ + +++ I+ D ++G Sbjct: 613 VHSICTEKGCEDMKADIMFLVDGSSSIGYANFEKMKNFMQTLLAKIQI---GADKTQIGV 669 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D + + + I G+T A++ DT + ++ H Sbjct: 670 AQFSDYNKEEFPL--NKYFTQKEISDAIDRMLLITGNTLTGSALKF-IDTYFTQSKGARH 726 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +K K+++L+TDGE D E + +G+ I+++ Q L Sbjct: 727 GVK------KFLILITDGEAQDDVREP---AVALRDKGVIILSVGVYGANRTQ------L 771 Query: 420 SNCASPNS-FFEANSTHELNKIFRDRI 445 + S F + +L +I R I Sbjct: 772 EEISGDGSLVFHVENFEDLKEIERKLI 798 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 55/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D + I++ + GST A+ Sbjct: 841 VKKSDVGRDRVQFGALRYSDDPDILFYL--NTYSNRSAIIEHLRRRRDTGGSTFTAKALG 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + K+ ++++TDGE + D K +++GI I+ + Sbjct: 899 RSATLFE-----EQHGSRIKQNVKQMLIIITDGE-SHDRHLLNDTALKLRNKGITIIAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + + +L ++ Sbjct: 953 V-----GKANQEELEAMAGNKENTIHVKDFDKLKDVY 984 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 98 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 157 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 158 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 188 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 189 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 225 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 226 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 276 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 277 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 315 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 60/144 (41%), Gaps = 15/144 (10%) Query: 303 NDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 N + ++ + G + K E G+T I + A + ++ Sbjct: 734 NIVIETNTKATVLGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKFSNDSKTS---- 789 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K IV+++DG+ ++ + + N+AK QG+++ T++ + LS+ Sbjct: 790 -------KAIVVVSDGKTSKS--KMTKVINEAKKQGVKVYTVSMGKKSQVNDATLMQLSS 840 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 + ++F A +L+++F+ I Sbjct: 841 -ETSGAYFHAIDNMQLHQVFQKLI 863 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 46/143 (32%), Gaps = 18/143 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A + +K + R+G F+ + + L+ + Sbjct: 107 KPDRLERAKREITDFLKVVKGD----RVGLVAFSGAAYTQCPLTLDYGALMMFLNILHPN 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T + A+ A +E K I+L+TDGE+ + + G+ Sbjct: 163 NI-PHPGTDLGAAVLGAIKAFDPKSET-----------DKVILLITDGEDNE--KRGLDA 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 +A +GI+I Sbjct: 209 AREAVRKGIKIFVFGMGDPAGGP 231 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 90/309 (29%), Gaps = 32/309 (10%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 NP S G S + T + Q ++ + Sbjct: 440 NPASYLTFGPGGSSAGLMTYSAGGHAYFQSQNTG----QILLQSAGHHGGMAAYQSYPWG 495 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + QPA + G S +L + P S S S Sbjct: 496 SLYSQPAMQQRAPCPPAMGGAREHQPLTSTLPPLSFFTLRACVRRVPRTSSSSLTAAAAS 555 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 K L + I+ +D + R+G + RV ++ S H+ Sbjct: 556 DPSEFEQVKVFLAKV--------IEGLDVGPNATRVGVVNYASRVKNEVSLK--THRTKA 605 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++K E T A+Q A + S E R + + K +++TDG + Sbjct: 606 GLIKAVTKIEPLSTGTMTGLAIQFAMNVAFSEAEGARLR---SPDISKVAIVVTDG---R 659 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---TH 435 + + +A+ GI I I + L AS + S Sbjct: 660 PQDNVKDVAQRARDAGIEIFAIGVGRVEMST------LRQMASDPLDDHVDYVESYSVIE 713 Query: 436 ELNKIFRDR 444 +L K F++ Sbjct: 714 KLTKKFQEA 722 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 57/189 (30%), Gaps = 23/189 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + ++ F+ S + LV+ + +I + DN G ++ V Sbjct: 766 ACSNAATDVVFLIDGSKSVRPENFELVKKWINQIIDKLDVSDNKAHV---GLVQYSSAVK 822 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + G + + + + T A++ + ++ + + Sbjct: 823 QEFPL--GRYNNKKDLKEAVKKMAYMERGTMTGQALRY----LTDNSFGPGQGARPGVT- 875 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-- 425 K ++ TDG + + KAK QG ++ + + L AS Sbjct: 876 -KVGIVFTDGRSQDYIGD---AAKKAKDQGFKMYAVGVGNAVEDE------LKEIASEPT 925 Query: 426 -NSFFEANS 433 +F Sbjct: 926 AEHYFYTAD 934 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 59/162 (36%), Gaps = 33/162 (20%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNE----DEVHRMKNNLEAKKYIVLLTDGEN 379 +K+ + GST ++ + + T+ + DE + + K +V +TDGE Sbjct: 168 LKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDETSPAEVSDSVHKVMVFMTDGEM 227 Query: 380 T-----QDNEEGI------------------AICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 D + I C K GI I T+++S + Sbjct: 228 NTKYDPNDKFDWICSQTQSSACNAFATAAMQTACTAMKKSGIEIYTLSYSADADVVN--- 284 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + NCA+ + F S + ++ + I I +R+T+ Sbjct: 285 --IRNCATNTAHFFTASPATIKTVY-ETIAAAIRGDTLRLTQ 323 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 64/406 (15%), Positives = 132/406 (32%), Gaps = 52/406 (12%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 TA IT +I + + + + +++ + + R L V+ Sbjct: 60 TAFITNFSMIIDGVTYPGNIKEKAA--AQEQYKTAISRGESAGL------------VKAV 105 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV- 177 ++ + A V S++ L L R +G L+ + + + + + + Sbjct: 106 GRKVETFQVAVNVAPSAKVTFELVYEELLKRQLGKYELLLNIRP--KQLVKHLQMDIHIF 163 Query: 178 -----------SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 S + + ++ Q +++ L +P+ + + +R Sbjct: 164 EPQGITFLETESTFMTSNLTDALSVIQNETKAHILFKPKEPSQGDTILDGNFIVRYDVRQ 223 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + + N + P L FV S +K R+AL V+ + Sbjct: 224 DAIDSGIQIVNGYFVHYFAPTGLPTLPKSVVFVIDKSGSMDGRKIVQTREALLKVLGDLN 283 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D N FN + + K F +D G T IN+A+ A Sbjct: 284 PEDQFN------LVVFNSMISQWQPSLLKATQENVGSAKKFVLDIRASGGTNINEAVLAA 337 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTI 403 + SN+ E+ + I+LLTDGE T I + + + Sbjct: 338 VHLLDESNQREL----LPENSVSMIILLTDGEPTVGETNPENIQQNIQRSLDGKYALFCL 393 Query: 404 AFSVNKTQQEKARYFLSNCASPNS------FFEANSTHELNKIFRD 443 F + + FL A NS + ++++ +L +++ Sbjct: 394 GFGFDVS-----YSFLEKMALDNSGLARRIYEDSDAALQLQDFYQE 434 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 80/222 (36%), Gaps = 20/222 (9%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ K M + D L +D+S+ K +DA + I+ + + Sbjct: 66 EVIESKKPQSIMEFNAESDLPLRLGILIDTSNSIRDRFKFE--QDAASEFIKGVVHANQD 123 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 F+ + KL I G TA+ DA+ A Sbjct: 124 KAM----LVSFDTKAELVSDLIGDTEKLDHAIRSLR-----PGGGTALYDAIFFACR--- 171 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 D++ + + + ++ IV+++DG++ Q A+ + + +I+ +++K + Sbjct: 172 ----DKLSQDQPKHKFRRAIVIVSDGDDNQSQYTRDQALEMAQKADVVLYSISTNISKIE 227 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + A + F +L + F + I NE+ + Sbjct: 228 SDGDKVLKYYAAETGGKAFFPFKVEDLEQSF-ENIANELRHQ 268 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + G + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGGYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 84/263 (31%), Gaps = 26/263 (9%) Query: 201 LNCFGQPADRTVKSYSSQNGKVG---IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 L+ F A T Y+ + ++ R +SP + S+ P + Sbjct: 12 LSAFLMEAQGTYGPYARNHNQLYAGRQRSPHISPNIPGHGNSINIGAPAEPCKSRPLDLV 71 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F+ SS + V+ + + +D +D R+ + V + Sbjct: 72 FIIDSSRSVRPAEFEKVKI---FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFS 128 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K + T A++TA + + + N K K +++TDG Sbjct: 129 KAEVKQAFSRIDPL--STGTMTGMAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDG 183 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS- 433 + ++ + A++ GI I + + + L AS + F + Sbjct: 184 ---RPQDKVEEVSAAARASGIEIYAVGVDRAEMRS------LKQMASQPLDDHVFYVETY 234 Query: 434 --THELNKIFRDRIGNEIFERVI 454 +L FR+ + E + Sbjct: 235 GVIEKLTSKFRETLCEEARADLT 257 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 70/172 (40%), Gaps = 8/172 (4%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + Y + +D SLS ++ +D ++ + +D + + I D + + Sbjct: 1520 TKYPVDLVFVIDKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEI-IPDLCSQQYDIQ 1578 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + S W + + ++ I +T + A+ A D + + ++ Sbjct: 1579 IAGYQFSGSSTRVLDW--SREEQQVLNNLKISNTSY-NTEPSQALADALDMLKTGSQ--A 1633 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 H+ ++N+ KKY++ +TDGE T+ E +K G I TI S + + Sbjct: 1634 HQNQSNV--KKYLIFMTDGEPTESEELSYYAISKNPVPGASIYTIGVSSDAS 1683 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 62/209 (29%), Gaps = 37/209 (17%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + ++ +V AL + D G F+D Sbjct: 1112 KSGSMDQSFGSGNSDARREVVNSALELFFNQLSDGDYNIQF---GGYKFSDSGERVNFND 1168 Query: 314 WG------VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG + GST + +++A + + ++ Sbjct: 1169 WGWQDKYWETDTSNALSHLKLTSWETDGSTYPSQTLRSAISALEN--------VELGENG 1220 Query: 368 KKYIVLLTDGENTQDNEEGIA----ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 K+Y++ LTDGE Q++ C A G I + + + F+ Sbjct: 1221 KRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQVANSDS-----HGFME 1275 Query: 421 NCASPNSF--------FEANSTHELNKIF 441 + S +F F NS ELN F Sbjct: 1276 SMVSNANFVDGVTAQKFVGNSADELNAAF 1304 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 18/176 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ I +D + R+G ++ +V + S + + T Sbjct: 55 RFMMDIIGNLDVGPNATRVGVIQYSSQVQNIFSLK--TFFTRADMERAINSIIPLAQGTM 112 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + ++ E + + +++TDG + + + +A++ GI Sbjct: 113 TGLAIQYAMNVAFTTQEGARPL---HKRIPRIAIVVTDG---RPQDRVTEVATQARNAGI 166 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 I + Q L ASP F S EL + F + +++ Sbjct: 167 EIYAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFAKQFQDKLCA 215 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 61/180 (33%), Gaps = 23/180 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ ++ + R+G ++ RV ++ + I K E Sbjct: 362 ELVKQFVNRIVDLLEVSPHG---TRVGLVQYSSRVRTEFPL--NKYHSADEIKKAVMDVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E E R + + +V TDG + ++ Sbjct: 417 YMEKGTMTGLALKHMVE--HSFSELEGARPLSYNIPRIGLVF-TDG---RSQDDISEWAR 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFF----EANSTHELNKIFRDRI 445 +AK GI + + ++ L AS F + + L + F I Sbjct: 471 RAKESGIVMFAVGVGKAVEEE------LRAIASEPVEQHFSYSADFTTMTHLVENFSLNI 524 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 67/217 (30%), Gaps = 11/217 (5%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + S G+ + C+ Sbjct: 157 ALRRAGILLYATGVRDAVWAELREIASSPQENFTSFVPYFSGLSNLAQKLRQELCDMLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + P+ E D L + + + +D +D VR+G Sbjct: 217 AAPRVDHVSPACREAALADIVFLVDSSTSIGPQNFQKVKNFLYSVVLGLDISSDHVRVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L T+++ G T A++ + E+ Sbjct: 277 AQYNDNIYPAFQL--NQHPLKSTVLEQIQNLPYRTGGTNTGSALE--FIRTNYLTEESGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 R K+ + + ++L+TDGE N+E + ++ K Sbjct: 333 RAKDRVP--QIVILVTDGE---SNDEVQEVADRLKED 364 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 38/296 (12%), Positives = 90/296 (30%), Gaps = 25/296 (8%) Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ + S+ + G++I V + S P + P + ++ ++ + Sbjct: 752 DSVSRPASHLRRAGITIYAVGTQNVSESKELEKIASYPHWKYSVPLESFLQLSVVRSKLI 811 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 ++ VS Y + +D S + + + + VI Sbjct: 812 NQLCSEMVDSKVSFRGMSYPLQEGCVHVEKADIYFLIDGSGSINPQDFLEM-KAFIKEVI 870 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + N V+ G ++D++ S S V + + I Sbjct: 871 KMFQIGPNR---VQFGVIQYSDKIQSQFILS------QYPSVAELKVAID-----NIQQG 916 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +N +V + +Y++++TDG + ++ + G+ I Sbjct: 917 GGGTATGEALNNMTQVFADTGRINVARYLIVITDG---KSSDPVAEAAEGLRENGVIIYA 973 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 I + L A FF L I ++ + + + +K Sbjct: 974 IGVREANIDE------LKEIAKDKIFF-VYEFDLLKDIQKEVVQDICSSEACKTSK 1022 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 64/210 (30%), Gaps = 43/210 (20%) Query: 258 FVDSSSLRHVIK--KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + +S L +K + + L + ++ ND V G F + + Sbjct: 98 DISNSMLCEDVKPNRLSFAKQVLGKLFDGLQ-----NDKV--GLVVFAGNAYTQIPITTD 150 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + G TAI A++ A + + E K I++LT Sbjct: 151 LSAAKQFLADISPNMVTAQG-TAIGAAIELASKSFSDNKE-----------IGKTIIVLT 198 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARY 417 DGEN + N I +A GIR+ I + Sbjct: 199 DGENHEGNA--IEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEK 256 Query: 418 FLSNCASPNS--FFEANSTHELNKIFRDRI 445 + AS FF S L + ++ Sbjct: 257 MCRDIASAGEGTFFSGQSASALVRAIESQL 286 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 56/151 (37%), Gaps = 28/151 (18%) Query: 296 RMGATFFN---DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 +G F+ V++ S + ++ G T I A+Q Y+ + + Sbjct: 447 YIGVIAFDDTPWDVVAPQSVT------KLDEIQQQISRIQADGGTDIFPALQLGYERVKA 500 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N H ++LLTDG+ + +++ + + ++ I + T+A Sbjct: 501 MNTQRKH-----------VILLTDGQ-SALDDDYEGLLQQMTAENITVSTVAL-----GD 543 Query: 413 EKARYFLSNCAS--PNSFFEANSTHELNKIF 441 + R L A ++ AN + KIF Sbjct: 544 DSDRGLLEMIAELGKGRYYFANDAESIPKIF 574 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/258 (15%), Positives = 86/258 (33%), Gaps = 27/258 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + N + + S + + + ++ + Sbjct: 242 SLDSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQHAPPMRGTEAPPPPTFSL 301 Query: 248 PLDPSLSEEHFVDSS---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +D S S I + L+ A + I + +++ +G F+ Sbjct: 302 LKSRQRVVCLVLDKSGSMSSGDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDS 356 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 I + ++ + + + + G T+I + ++ ++TI SS++ Sbjct: 357 IAIVQNNLIRMIND-SSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE--- 412 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 IVLLTDGE ++ I+ C + K G I TIA + + LS+ Sbjct: 413 ------IVLLTDGE-----DDQISSCFEEVKHSGAVIHTIALGPDA---ARELETLSDMT 458 Query: 424 SPNSFFEANSTHELNKIF 441 F+ + + L F Sbjct: 459 GGRRFYASEGINGLIDAF 476 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 86/277 (31%), Gaps = 43/277 (15%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 E + + + + G G + N Y Y P +E Sbjct: 433 ENLKIEATPVNGTKELHLWVEDGGLYGPYSSSNGEAYETTNAGGTYTAYVVADFPYGEQE 492 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--FFNDRVISDPSFSW 314 +++ I K + A + +K D V G + R + + Sbjct: 493 FYLNV-----YIAKIDAAKIAAKTFNGFLKSSDQVGVAYFGGDVPGGYTPRYDVSQTLTN 547 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +I D G T + ++ A +++ +++L Sbjct: 548 DTLSANNSID-----DLWAYGGTPMGGGIKVARQELVA-------NTAPGNIP--VMIVL 593 Query: 375 TDGENTQDNEEG----------IAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFL 419 +DG T ++ I K + I I TI F + + L Sbjct: 594 SDGNPTLTSDGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGFGNDANE-----TLL 648 Query: 420 SNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A SP+ ++ A ++ EL+ I+R +I E+ E+ + Sbjct: 649 KQIATSPDYYYFAATSEELSSIYR-QIAKELKEKAAK 684 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/258 (15%), Positives = 86/258 (33%), Gaps = 27/258 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + N + + S + + + ++ + Sbjct: 242 SLDSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQHAPPMRGTEAPPPPTFSL 301 Query: 248 PLDPSLSEEHFVDSS---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +D S S I + L+ A + I + +++ +G F+ Sbjct: 302 LKSRQRVVCLVLDKSGSMSSGDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDS 356 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 I + ++ + + + + G T+I + ++ ++TI SS++ Sbjct: 357 IAIVQNNLIRMIND-SSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE--- 412 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 IVLLTDGE ++ I+ C + K G I TIA + + LS+ Sbjct: 413 ------IVLLTDGE-----DDQISSCFEEVKHSGAVIHTIALGPDA---ARELETLSDMT 458 Query: 424 SPNSFFEANSTHELNKIF 441 F+ + + L F Sbjct: 459 GGRRFYASEGINGLIDAF 476 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 125/396 (31%), Gaps = 78/396 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ ++ G +L +M+G G+ +DV+R+ L+ A++ A+ Sbjct: 42 RQFMRREEGTITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAAS----- 96 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 P+ + +Y + + + E + R Sbjct: 97 --------LTQPYDPEGVVRDYFAIAGIDGYRLDVRVEEGLNFRR--------------- 133 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + +L + S+F++ G+++ + + + + + + V+D S SM + Sbjct: 134 -VHAYAELEVR--SIFMQMFGVRAM------TSPAIGAAEERVRRIEVSMVLDISGSMGE 184 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + +P + V S + + + PY N G L Sbjct: 185 NNRMTNMRPA---AREFVTEVLSANENVNNELLVSVSIVPYNGRVNG--------GDLIE 233 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV--IRSIKKIDNVNDTVRMGATFFNDRVISD 309 S+ + S S + A+ ++ I D N+ + + + Sbjct: 234 SVFTYDDLHSESNCTRFAEADFTSTAIDPAVPLQRIAHWDRGNEEEDESFQWAHCQTDQY 293 Query: 310 PS-FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI--ISSNEDEVHRMKNNLE 366 + W + + N G TAI+ M A + ++ ++ Sbjct: 294 GAILPW---QHTEAALHAHIDSLNTGGWTAIDLGMNWAVGLLDPAAAPALTGLIASGHVH 350 Query: 367 AK----------------------KYIVLLTDGENT 380 + K +VL+TDG+NT Sbjct: 351 PEFSDRPAPYRDGDRATTIDDETIKVVVLMTDGDNT 386 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 32/65 (49%), Gaps = 6/65 (9%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELN 438 +Q + +AIC+ A + GI + I F ++ + + +CAS ++F+ E++ Sbjct: 541 SQADTNLMAICDVANAAGIIVYAIGFEA----PDRGQRVMEHCASVDANYFDVE-GREIS 595 Query: 439 KIFRD 443 + F Sbjct: 596 EAFAS 600 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 76/257 (29%), Gaps = 28/257 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK-VGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ + N + + S + D + + M N + Sbjct: 243 SLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPTHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F+ Sbjct: 303 LKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVA 356 Query: 307 ISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + I + G T+I ++ + II S++ Sbjct: 357 EIQNHLTRITDDNVYQKITAKL--PQVANGGTSICRGLKAGFQAIIHSDQSTSGSE---- 410 Query: 366 EAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDGE+ + N C K G I TIA + K LSN Sbjct: 411 -----IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALGPSA---AKELETLSNMTG 457 Query: 425 PNSFFEANSTHELNKIF 441 FF L F Sbjct: 458 GYRFFANKDITGLTNAF 474 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 66/195 (33%), Gaps = 29/195 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + K + ++A S++ D R+G F+ + F L ++ Sbjct: 415 VSKFDMAKEAAQLATESLQPED------RIGLLAFDTETLWVVPFQLISGGLSVAQIQEQ 468 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEG 386 G T I A++ + H VLLTDG + DN Sbjct: 469 IASLPSGGGTRIERALEVGLPALAEQPTKVRHA-----------VLLTDGRSFMNDNALY 517 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHEL--NKIFR 442 + A+SQ I + TIA ++ L A+ ++ A+ ++ + Sbjct: 518 QRLVETARSQQITLSTIAIGLDSDT-----ALLKQLAAWGGGRYYYADQPADIPRLTLLE 572 Query: 443 DRIG--NEIFERVIR 455 +I + E+ +R Sbjct: 573 SKIAGSDPAVEQALR 587 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 53/151 (35%), Gaps = 17/151 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+GA + F + + I+ G T+ A+ A + + + Sbjct: 506 TRVGAVQYT--YEQRLEFGFDDYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLFKKS 563 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +K ++L+TDG + ++ A +G+ I + Q++ Sbjct: 564 KPNK---------RKLMILITDG---RSYDDVRIPAMVAHHKGVTTYAIGVA--WAAQDE 609 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 ++ AS +SFF + L K I Sbjct: 610 LEVIATHPASDHSFF-VDEFDNLYKFVPKVI 639 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 244 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 299 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 300 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 357 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 358 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 410 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 411 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 444 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 63/171 (36%), Gaps = 17/171 (9%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ SV+ + + R+G F D F+ + A Sbjct: 123 TAVKSVLETFAAKREGD---RLGLILFGDSAYLQSPFT--ADHEAWLALLDQAQVGMAGE 177 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ST + DA+ T I + E N +K ++LTDG +T I A++ Sbjct: 178 STHLGDAVGLTIKTYIDNPE--------NQTVEKVAIILTDGNDTDSLVPPIDAAKVAQA 229 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLS----NCASPNSFFEANSTHELNKIFR 442 GIR+ +A T ++A F + + F A S +L+ +++ Sbjct: 230 YGIRLYIVAMGSPNTTGDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQ 280 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 66/178 (37%), Gaps = 26/178 (14%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +V++ +++ I S +K DN MG F ++ + L + Sbjct: 107 DLTRFDVVKEIVSNFISS-RKNDN------MGIVVFGAYSFIASPLTYDSNILKGVVSNL 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + M + + S V+ +K + K +LLTDG NT D+E Sbjct: 160 YI-------------GMAGKFTALFESLAQGVNLLKTSKSKTKIAILLTDGYNTPDSEFP 206 Query: 387 IAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + A QG+++ I E + L A + F A++ EL ++ Sbjct: 207 FDAAIDFANKQGVKVYPIGIG---KSDEYNQKMLEKIAEQTGGVAFGASNASELAIVY 261 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 70/199 (35%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + +++ + + + ++ L + I + D R G F D Sbjct: 103 DLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + + + M T + DA+ + + Sbjct: 156 TPFT----ADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA-- 423 ++ +++LTDG +T E + A ++GI+I TIA +Q + + Sbjct: 203 QQVMIVLTDGNDTGSFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQL 262 Query: 424 SPNSFFEANSTHELNKIFR 442 + F A EL+K ++ Sbjct: 263 TQARAFIAIDQAELDKAYQ 281 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 70/199 (35%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + +++ + + + ++ L + I + D R G F D Sbjct: 103 DLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + + + M T + DA+ + + Sbjct: 156 TPFT----ADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA-- 423 ++ +++LTDG +T E + A ++GI+I TIA +Q + + Sbjct: 203 QQVMIVLTDGNDTGSFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQL 262 Query: 424 SPNSFFEANSTHELNKIFR 442 + F A EL+K ++ Sbjct: 263 TQARAFIAIDQAELDKAYQ 281 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 76/257 (29%), Gaps = 28/257 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK-VGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ + N + + S + D + + M N + Sbjct: 243 SLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPTHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F+ Sbjct: 303 LKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVA 356 Query: 307 ISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + I + G T+I ++ + II S++ Sbjct: 357 EIQNHLTRITDDNVYQKITAKL--PQVANGGTSICRGLKAGFQAIIHSDQSTSGSE---- 410 Query: 366 EAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDGE+ + N C K G I TIA + K LSN Sbjct: 411 -----IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALGPSA---AKELETLSNMTG 457 Query: 425 PNSFFEANSTHELNKIF 441 FF L F Sbjct: 458 GYRFFANKDITGLTNAF 474 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 31/68 (45%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L + Sbjct: 100 RRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 159 Query: 72 LEEVSSRA 79 + + A Sbjct: 160 ISLSVAEA 167 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 72/207 (34%), Gaps = 21/207 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + G +D+SS + VR +++++ + + D R Sbjct: 23 SGERTEAQRAGCKSVHYDLVFLLDTSSSVG-KEDFEKVRQWVSNLVETFEIGP---DKTR 78 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G ++DR ++ G +K I + + G+T DA++ S Sbjct: 79 VGVVRYSDRPSTEFDL--GKYKTREEIKEAARKIQYYGGNTNTGDALRYITTYSFSK--- 133 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E ++ KK +LLTDG + + A GIRI + E + Sbjct: 134 EAGGRLSDRTVKKVAILLTDGRSQDFVLDP---ATAAHQAGIRIFAVGVG------EALK 184 Query: 417 YFLSNCAS---PNSFFEANSTHELNKI 440 L AS F + + ++KI Sbjct: 185 EELDEIASEPKSAHVFHVSDYNAIDKI 211 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 39/85 (45%), Gaps = 2/85 (2%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L +G+ ++ A+ PV+ G G+ VD+ ++ ++ ++ A + + Sbjct: 8 KRLQADTSGNALMLVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAGAWSRGNG 67 Query: 72 LEEVS--SRAKNSFTFPKQKIEEYL 94 + +RAK F + ++YL Sbjct: 68 DMGLEYKTRAKQEFYINLSETKDYL 92 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 31/62 (50%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + L K G +I A++ PV++G G+ V+ W + L+ AA A+ A Sbjct: 9 SNLFRFFRSLAKDQGGSVAVIAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAA 68 Query: 65 SV 66 SV Sbjct: 69 SV 70 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 53/392 (13%), Positives = 114/392 (29%), Gaps = 43/392 (10%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + PL + + N ++ ++++T + + T Sbjct: 315 GTDPLKADTDNDGLPDGEEVRLGTDPLVADSNDNGVSDGEESYTTTATNETLGVTLSLTG 374 Query: 124 PRKSAYQVVLSSRYDLLLNPLS-LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 ++ + D N + + + + + + V+ Y++ V Sbjct: 375 NGDIGNGTTIAPQDDPRFNTSRVGNMSASPVVELNSEQEFSSANVTLGYNETG------V 428 Query: 183 IDFSRSMLDYQRDSEG---QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + S+ + + D E PLN + T + ++ + D + + Sbjct: 429 ENESQDLAVFTYDPEAGIFVPLNSTVDATNNTATAETTHFSTFAVFDISNWATTYNATEP 488 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + G ++ S V + + +A ++ D R Sbjct: 489 VRQTDDDGLRPVDVTLVMDTSGSMSSSVKLRNTAGQRFVAGLL----------DVDRAAV 538 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + T+ G T I + TA S++ D Sbjct: 539 VDFDSSAYVAQDLTSDFGAANSTLDN-----LGSGGGTDIGSGLSTANSQFASNSNDSR- 592 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + ++LLTDG GI+ A +Q + T+ F + ++K R Sbjct: 593 --------AQVMILLTDGRGN----GGISEAQTAANQNTTVYTVGF--DNANRDKLRDIA 638 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + F EL +F RI E Sbjct: 639 N--ITDGEFNYVTDRSELPNVF-SRIAENTTE 667 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 50/377 (13%), Positives = 104/377 (27%), Gaps = 60/377 (15%) Query: 108 DREVRDIVRDTAVEMNP--RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + T +E NP R + Q L S L + +S+ + G + + + + Sbjct: 407 GEAVAQGIDPTGIEENPAVRTQSVQTALESTEKPLFDNVSVTMNVGGNLNQHLSIRNIEK 466 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQ-RDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + S + +S I S + + + L + V Y +N + I Sbjct: 467 EDTLSANVVGALSAPIEITSDASFTSAKITFTYDEALLGDTPAENLAVMWYDRENRRYVI 526 Query: 225 RDEKLS-----------------------PYMVSCNKSL--YYMLYPGPLDPSLSEEHFV 259 D+ C + Y G + Sbjct: 527 LDKDTVVDPNAHTVSYTTTHFSTYLVVDREVWYDCWRENIDYRSGDAGTSQLEPYDIGLC 586 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + + AL + I ++ DN F+D + Sbjct: 587 VDVSGSMYGDRLEKAKTALNTFIDAMLPQDNAC------MVSFSDNAYLVAG-----YGA 635 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++++ ++ T + + + + K I+++ DG+ Sbjct: 636 SKEVMRSRTNQLRDLYGTNTDVGLSKTISILADQGRSDA---------SKMIIMICDGDV 686 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + AK+ GI + TI L A + ++ A +T E+ Sbjct: 687 NYIQG----TVDAAKAAGIAVYTINVV------SGDNDLLQKIADETGGEYYYAATTEEV 736 Query: 438 NKIFRDRIGNEIFERVI 454 G + Sbjct: 737 VSQVEAIRGETVSAVDT 753 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 59/180 (32%), Gaps = 24/180 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + + K Sbjct: 104 KIQNVKKTLEYLLELLGDND------RLCLILFNSKATKLCHLMR-TNNSNKPAFKEIIN 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + H ++ + LL+DG++ + + Sbjct: 157 KIEANGGTDINSGMELAFRVLKDRK---YHNPVSS------VFLLSDGQDGSADLKVRQS 207 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 208 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 262 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 VR +A+++ + + D R+G ++DR + G+ + Sbjct: 63 EKVRQWVANLVDTFEVGP---DRTRVGVVRYSDRPTTAFEL--GLFGSREAVKAAARHLA 117 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T DA++ S + K+ +LLTDG + + + Sbjct: 118 YHGGNTNTGDALRFITRHSFSRQ---AGGRPGDRAFKQVAILLTDG---RSQDLVLDAAA 171 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 A GIRI + ++ L AS F + ++KI Sbjct: 172 TAHRAGIRIFAVGVGAALREE------LEEIASEPKSAHVFHVSDFDAIDKI 217 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 66/449 (14%), Positives = 137/449 (30%), Gaps = 37/449 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L+ G AL++ +L + G+ +++VR + L AA I +A+ + Sbjct: 3 LLSEQRGGLSPAFALMLVGVLALTGVAIELVRGYNGQSLLSAAADAVIYSAADSDTALQD 62 Query: 74 EVSSRAKNSF------TFPKQKIEEYLIRNFENNLKKNFTDREVRDI-VRDTAVEMNPRK 126 + N P+ E + D D + R Sbjct: 63 AQALMQANLSGRNLQVATPRLSQGEQGAQVILQGKVPALMDLSAIDEGEEGMPIAAAARA 122 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV---I 183 S+ + + L ++ IK L++ E ++E VSI + Sbjct: 123 SSARSRIEVALVLDISDSMSGAPMKAIKQGLVEF-GEVLFGRERRNQERVVSIIPATGLV 181 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + +S P +R + S+ R + + Sbjct: 182 NIGDHPELFHPESIEIPFGLRTLAEERGWLNLLSREVPGRQRKAFCARLPEHVDGIDRVA 241 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS--VIRSIKKIDNVNDTV---RMG 298 L + + K + ++ K + ++ + R Sbjct: 242 EVTPGWIRKLEQAPRDELRPHLFYSTKPPPIARYGDGTPLLAFAPKENPLDRYLENRRDK 301 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS----- 353 F+D H R + +T + + + + Sbjct: 302 LGIFDDPDCGVSPIQ--AHLSTRADYRQALDTLYAAFNTNTAEGVMWGWRLLSPEWQGRW 359 Query: 354 --NEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIAFSV 407 + R + +K +VL +DGE+ + + + + +C + K +GI+I T+AF Sbjct: 360 RQGAAALPRPYELQDNRKIMVLFSDGEHMTEAALRDRKQLLLCREMKRKGIQIYTVAF-- 417 Query: 408 NKTQQEKARYFLSNCASPNSF-FEANSTH 435 E F++ CAS S F+A ++ Sbjct: 418 -----EGDTRFVAQCASDRSLAFKATKSN 441 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 74/234 (31%), Gaps = 26/234 (11%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + Y + K+ + L + + L Y + + +D+S + Sbjct: 74 QRYLGFSWKMLVPSTFLMVGVAALLLQLLYGSSFQQASGANNIVMVLDTSGSMQSSDPDN 133 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDE 331 + A A +++ + + + F+D+ + + +VK Sbjct: 134 QLFKAAADMVQRMDS------DMNIAVVTFHDQTNVLQPLTELSSQSVKDEVVKKLLQFP 187 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T I+ A+Q D + + N A +VL++DG + + A Sbjct: 188 RTDGGTRIDLALQAGLDQLQA-----------NQMANSTVVLMSDG---YSDLDVPAALA 233 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 K + + T+ S L A + S+F ++ IF Sbjct: 234 PYKQNQVIVHTVGMSQID---ADGTALLQKIAAETGGSYFNVEHADQMTGIFGQ 284 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 58/180 (32%), Gaps = 22/180 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + V +++ Sbjct: 224 KIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNL-RKVDDENTPNIQSIIN 276 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M A++ + I LL+DG++ +E+ Sbjct: 277 SITADGGTDINSGMLMAFNILQKRQ---------FFNPVSSIFLLSDGQDNGADEKIKKY 327 Query: 390 CN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N K++ I + F + R +F+ +++++ F D +G Sbjct: 328 INSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLK---DGNFYYVEKINQVDEFFVDALG 384 >gi|73980138|ref|XP_540147.2| PREDICTED: similar to vitrin isoform 1 [Canis familiaris] Length = 649 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 54/416 (12%), Positives = 130/416 (31%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + L AQT I + PL +Q + +++ + Sbjct: 267 FLIDGSSGIGKRRFRIQKQFLADVAQTLDIGPAGPLMGVVQYGDNPAAQFNLRTHMNSRD 326 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A V + + Sbjct: 327 LKTAIEKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPNVAV--------------VI 372 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + + +E G++I ++ + + Q +E N + Sbjct: 373 VDGWPTDKVEEASR-------FARESGINIFFITIEGATENEKQYVTEPNFSNKAVCRTN 425 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + ++ + S +D SS Sbjct: 426 GFYSLNVQSWFSLHKTVQPLVKRVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTGN 481 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ +A++ + + D R+GA + F + + I+ Sbjct: 482 FRTVLQF-VANLSKEFEISDTD---TRIGAVQYT--YEQRLEFGFDDYNTKSDILNAIKR 535 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 536 VGYWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 583 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + Q++ ++ A+ +SFF + L K I Sbjct: 584 AMVAHHKGVITYAIGVA--WAAQDELEVIATHPANDHSFF-VDEFDHLYKFVPKVI 636 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 107 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 156 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 212 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 271 Query: 438 NKIFRD 443 + +++ Sbjct: 272 SDVYKS 277 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 32/74 (43%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 11 TRRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 71 SLEEVSSRAKNSFT 84 ++ + A Sbjct: 71 AISLSVAEADGIAA 84 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 107 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----TDRAAVSTAI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 156 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 212 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 271 Query: 438 NKIFRD 443 + +++ Sbjct: 272 SDVYKS 277 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 79/482 (16%), Positives = 151/482 (31%), Gaps = 59/482 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G ++ AL + +G+ G+ VD + L AA A + V + + Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + ++ + + K + L N K F + + + +Y + Sbjct: 74 ANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRTGQTLDATVSYTATV 133 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S F R G+ + + + A SY + ++D S SM Sbjct: 134 QST----------FGRIFGLSATTLTNRVNASVDLASY-----LDFYLMVDVSGSMGLPT 178 Query: 194 RDSEGQPL---------NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 +DS+ + L NC ++ GK+ +R + ++ + K + Sbjct: 179 KDSDAEVLAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPV 238 Query: 245 YPGPLDPSLS--EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--DTVRMGAT 300 P + + L +R A K+ + + + G+T Sbjct: 239 VPNQYRIGIYPFINQLATLAPLTDTTTSLAALRTAA-----QCDKVWPLAFTNLLDTGST 293 Query: 301 -FFNDRVISDPSFSWGVH--KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F + + S G H + + T N ST + D + +S Sbjct: 294 QLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYS 353 Query: 358 VHRMKN--NLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRI------MTIAFSVN 408 + + K+ D +Q + + C K+ G I I + N Sbjct: 354 AWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYN 413 Query: 409 KT-----QQEKARYF-------LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + F L CASP F+ AN+ ++ + RV RI Sbjct: 414 NDSYIVWENGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQAL--RVARI 471 Query: 457 TK 458 T+ Sbjct: 472 TQ 473 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 103/292 (35%), Gaps = 22/292 (7%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 + ++ S + + +++ V + D + + N + + K+ Sbjct: 118 SIVSTSQETSVAGNPLQALTMSMV----GASWDEGQSNLVVCNNMWTRNCKDAYKT---- 169 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRD 276 + + + N+ + L P + F V S + +V+ Sbjct: 170 --EGVCYVTNTTSELNVLNRPKWKQLSPCKRECYRKMIDFMFVVDGSRSVGNESFEVVK- 226 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 I+ + +++ +V++G ++ + ++ K + + + Sbjct: 227 ---HWIQQVTSGFDISSSVQVGVVQYSTYQYRKVVQPFIKTEIRLGEYKDHILFDAAVDK 283 Query: 337 TAIND-AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 +D + TAY + NED M ++++ +VLLTDG++T D E+ + +AK Sbjct: 284 IKYHDRSTFTAYAIRKTVNEDFKGNMSRYPDSRRVMVLLTDGQST-DKEDLSSAAAEAKQ 342 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +G+ + + + SP+ N +EL I G+ Sbjct: 343 EGVETFAVGV----GSKIILSELVLIAGSPDKVITVNDFNELLGIVNQLQGD 390 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 73/487 (14%), Positives = 132/487 (27%), Gaps = 99/487 (20%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 L+PV +G+ + + + L A + A + S + Sbjct: 6 LIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALST---------------ENNYRND 50 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQVVLSSRYDLLLNPLSLF 147 + + +++ E R Y L Y L L L+ + Sbjct: 51 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLAL--LNSY 108 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 L+ +W + A S + + + +V DFS SM D E Sbjct: 109 LKQTPSPTWDVNENGAARKYLSSIAE--PIDVVFVTDFSGSMDLPFGDIERNNRITKLDE 166 Query: 208 ADRT-VKSYSSQNGKVGIRDEKLSPYMVSCNK--------SLYYMLYPGPLDPSLSEEHF 258 VK + GI P+ + S Y P + + Sbjct: 167 LKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYL 226 Query: 259 -------------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND---TVRMGATFF 302 +D+ S + L D +++ I+K N+ R A F Sbjct: 227 QRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNF 286 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM- 361 D+ S + I I+ + I+ M D E + Sbjct: 287 LDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLN 346 Query: 362 ----------------------------------KNNLEAKKYIVLLTDGENT------- 380 + K +++L+DG++ Sbjct: 347 FDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSP 406 Query: 381 QDNEEGI----------AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 D + GI +C K K GI+++ I + +C +F+ Sbjct: 407 HDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEKDCVGTGNFYL 464 Query: 431 ANSTHEL 437 A + HEL Sbjct: 465 AKNAHEL 471 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 38/168 (22%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ + + + + ++ D TAI +A+QTA + SN+ Sbjct: 130 RIAFIPYAAQAYPQLPLTSDYS-AAKIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSNQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 K +++L+DGE+ Q + + + K +GIRI TI + Sbjct: 189 SS-----------KILIILSDGEDHQ--QGATEMIQEVKEKGIRIFTIGLGTTQGTTIPI 235 Query: 412 ------------------QEKARYFLSNCAS--PNSFFEANSTHELNK 439 + + L A +F+ ++T ++ Sbjct: 236 KENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGSNTQQVID 283 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 79/482 (16%), Positives = 150/482 (31%), Gaps = 59/482 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G ++ AL + +G+ G+ VD + L AA A + V + + Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + ++ + + K + L N K F + + + + +Y + Sbjct: 74 ANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRSGQTLDATVSYTATV 133 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S F R+ G+ + + + A SY + ++D S SM Sbjct: 134 QST----------FGRTFGLSATTLTNRVNASVDLASY-----LDFYLMVDVSGSMGLPT 178 Query: 194 RDSEGQPL---------NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 +DS+ + L NC ++ GK+ +R + ++ + K + Sbjct: 179 KDSDAEALAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPV 238 Query: 245 YPGPLDPSLS--EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--DTVRMGAT 300 P + + L +R A KI + + + G+T Sbjct: 239 VPNQYRIGIYPFINQLATLAPLTDTTTSLAALRTAA-----QCDKIWPLAFTNLLDTGST 293 Query: 301 -FFNDRVISDPSFSWGVH--KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F + + S G H + + T N ST + D + +S Sbjct: 294 QLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYS 353 Query: 358 VHRMKN--NLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRI------MTIAFSVN 408 + + K+ D +Q + + C K+ G I I + N Sbjct: 354 AWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYN 413 Query: 409 KTQQ------------EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 L CASP F+ AN+ ++ + RV RI Sbjct: 414 NDSYIVWENGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQAL--RVARI 471 Query: 457 TK 458 T+ Sbjct: 472 TQ 473 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/288 (14%), Positives = 89/288 (30%), Gaps = 45/288 (15%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 M++Y +P + A + S + I L Y + Sbjct: 233 RVEEMINYFDYGYTRPTSAARPFAVTATTTASPWSEGRRIVHVGLQGYEL---------- 282 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L+ VD S + K L + ++ +I ++ D R+ ++ + Sbjct: 283 -PENQRRPLNLTFLVDVSGSMNSPDKLDLAKQSMNLIIDRLRPQD------RVAVAYYAE 335 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + G KL ++ G TA M AY D+ Sbjct: 336 GAGTTLAPTAGTQKLK---LRCAVASLRASGGTAGATGMTNAY--------DQAQASFGR 384 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSN 421 + + ++ TDG+ + + + + GI + F Q + + Sbjct: 385 NKVNRILMF-TDGDFNVGVTDDKRLEDYVADKRRTGIYLSVYGFGRGNYQDARMQAIAQA 443 Query: 422 CASPNSFFEANSTHELNKIFRD-------RIGNEIFERV----IRITK 458 ++ + E ++F I +++ +V RIT+ Sbjct: 444 GNGTAAY--VDDLKEARRLFGPMFDRGAFPIADDVKIQVEFNPARITE 489 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 79/246 (32%), Gaps = 29/246 (11%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYPGPLDPSLSEEH-FVDSSSLRHVI 268 + NG R + + C + + Y + +L+ + F+ S Sbjct: 1543 NNSPCLNGGTCQRVGLTTEFTCLCPEGYHGPICQFYAACSNRTLNLDVVFLLDGSGSVGS 1602 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 L++ + + D + R+G ++D+ S+ ++ A Sbjct: 1603 ANFDLLKTFTTRIATNF---DVSTNLTRVGVVQYSDQTNSEFVL--NTFSTEAEVLAAIA 1657 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T+ A+ + S + N +++LTDG +++ Sbjct: 1658 AISYQNGGTSTGAALDYVRQNVFISASGDRPDAAN------ILIVLTDG---VSSDDVSF 1708 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGN 447 A++ GI I ++ L A PN +A L D IG Sbjct: 1709 PAMAARNAGITIYSVGI-----GDGVDYNTLQQIAGDPNKVLQATGFSSL-----DDIGG 1758 Query: 448 EIFERV 453 ++ E V Sbjct: 1759 QLEELV 1764 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 40/299 (13%), Positives = 89/299 (29%), Gaps = 22/299 (7%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG--VSIQWVIDFSRSMLDYQRDSEGQ 199 N L + + S +A +Y G ++ ++ + + Sbjct: 239 NVLIVITDGISADSVDAPAEAARADNITTYSIGIGDEINYLTLLSIAGMRERVLNVTTFG 298 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 LN + + + + + + + + + P + + F+ Sbjct: 299 DLNDLDEVLLQILCERPTTTPLPTTTPLPTTELITTEEATTMLLTTVEPCERLEIDVIFL 358 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + L++ L ++ D +D R+G ++ V ++ + Sbjct: 359 IDGSSSISLLNFDLLKTFLQNITMKF---DVSSDITRIGVVQYSTDVNTEFELK--TYAT 413 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTA-YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ + + GST I + ++ + D A +V +TDG Sbjct: 414 EAEVIHAISNITRQRGSTFIGAGINFVRTNSFTVAAGDRP-------LAPNILVTITDGI 466 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + D A+ QGI +I LS + + F S EL Sbjct: 467 SADDVAGP---AQAARDQGILTYSIGIGEEIQWPT----LLSIAGARHRVFNVTSFSEL 518 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 58/173 (33%), Gaps = 14/173 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G +I +++PV+LG +++D + L+ AA +A + A L S Sbjct: 7 RFRADADGTMTMIVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAVQVLPDSA 66 Query: 73 EEVSSRAKN-SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 VS P + + AV++ R+SA Sbjct: 67 TAVSRGVSLVGQNLPPSYGTAAAATDVVVGTYDPGAKAFTAGGAQPNAVKVTARRSAALG 126 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Y F G +S A+AE+V+ + + + + +D Sbjct: 127 NAIPVY---------FAWIFGYRSLE----AKAESVAVAAGGGNPAACLYALD 166 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 76 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 135 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 136 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 166 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 167 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 203 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 204 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 254 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 255 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 293 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 45/127 (35%), Gaps = 1/127 (0%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 11 TRRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 71 SLE-EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ +V+ + + + + F+D + TAV Sbjct: 71 AISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKYV 130 Query: 130 QVVLSSR 136 + Sbjct: 131 KCTAQLS 137 >gi|253582981|ref|ZP_04860199.1| magnesium chelatase [Fusobacterium varium ATCC 27725] gi|251835187|gb|EES63730.1| magnesium chelatase [Fusobacterium varium ATCC 27725] Length = 632 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 83/217 (38%), Gaps = 28/217 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K + + S VDSS V K+ V+ A+ S+++ D R+ Sbjct: 435 KKEHIRVKVREKRTGASILFVVDSSGSMGVKKRMEAVKGAVMSLLK-----DAYEKRDRV 489 Query: 298 GATFFN-DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G F D+ + + + + K G T + + + AY I ++ Sbjct: 490 GMVSFRRDKAEELLPITRSIDLAQKKLEK-----LATGGKTPLAEGIAKAYTII----KN 540 Query: 357 EVHRMKNNLEAKKYIVLLTDGENT------QDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 E+ + K + IV L+DG+ +E + + K K++GIR + I Sbjct: 541 EMRKDKEVVPL---IVFLSDGKGNFSASGKDPVKESLEMAEKIKNEGIRAIVIDTEEGFI 597 Query: 411 QQEKARYFLSNCASPNSFFEANS--THELNKIFRDRI 445 + E A+ A +++ + + ++ K+ +D I Sbjct: 598 KLEMAKTLSE--AMKAEYYKLENLRSEDMLKLIKDNI 632 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 70/518 (13%), Positives = 164/518 (31%), Gaps = 87/518 (16%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 FI S+K IK+ G + +++P + + + ++ + + L A + A + +V Sbjct: 26 FIIISRKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQATLALTV 85 Query: 67 PLIQSLEEVSSRAKNS--------------FTFPKQKIEE--------------YLIRNF 98 +E+ R F+ P+ I+ Y Sbjct: 86 ENDDLPDELQMRKNVDLVSNFSSAYLPLEHFSVPEIDIKNNCGQLTYNAKITMSYFANFL 145 Query: 99 ENNLKKNF-----TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 N T+ I + + ++ A V+ + Y +N Sbjct: 146 SKTAMTNAITTIGTEDNGAAIKQVSTIQ----DKATDVIFVADYSGSMN---EGFHGKVP 198 Query: 154 KSWLIQTKAEAET-VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + I + ++ S K +++ + FS ++ + C + Sbjct: 199 RGEKINALRDVFNRLNGSILKNSNINLIGFVPFSWGTKRIVIENSQEKKYCHFPFVPKQY 258 Query: 213 ---KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 +Y Q G++ + +K Y L G + + + + + K Sbjct: 259 RADNNYFRQYTVSGLKKFPGLEGLTDIDKINYGELTLGEYNTLTNVIK--NMAKQEYRNK 316 Query: 270 KKHLVRDALA---------------SVIRSIKKIDNVNDTVRMGAT-FFNDRVISDPSFS 313 +R L +IK I++ + + N+ + + S + Sbjct: 317 ALEFLRITLNIPTYMQQMIFITTTIDYDATIKSINSDAQYIDIPLDDIINESICLNNSNA 376 Query: 314 WGV-HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + I + +G T ++ + A +T+ + + K +V Sbjct: 377 YSLDSHNSHDDLIDKMIAMSPLGQTLVSSGILYA-NTLFKKESNNSNN--------KLMV 427 Query: 373 LLTDGENTQDNEEGIA-------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +++DG + N+ I +C + K I+++ IA + + ++ Sbjct: 428 IISDGIDVFINDTTIQQSIYISKTLIDKGMCERIKENNIKMVFIAIKDGSNETNEPANYI 487 Query: 420 --SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 C ++++ + HEL R + E V R Sbjct: 488 DWKKCVGEDNYYYVSDAHELEAALRQSLTTTSSEVVGR 525 >gi|260824533|ref|XP_002607222.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] gi|229292568|gb|EEN63232.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] Length = 1897 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 61/197 (30%), Gaps = 14/197 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + SS N G V S + P P S ++ H + S + Sbjct: 1540 TQIYVSSSSNRVWGCGLSHHIYLRVGITWSQGPVDIPAPTCRSKADIHVLVDGSKSVKTR 1599 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 VR + + + N R G F + ++ + + I Sbjct: 1600 NFPAVRQFILKLAAGFEIGPNKA---RFGVYQFAKDMQTEFKM-NQYNNREALLDAIKKI 1655 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T +++ Y+ +N +K I+L+TDG+ T + Sbjct: 1656 EYMNQYQTKTGQSLKAVYEEFTKANGAR-------DGVEKIIILITDGKATDQVRQPAQY 1708 Query: 390 CNKAKSQGIRIMTIAFS 406 K++G + T+ + Sbjct: 1709 ---VKNKGAHVFTVGVA 1722 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 68/406 (16%), Positives = 131/406 (32%), Gaps = 79/406 (19%) Query: 12 KKLIKSCTGHFF--IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ G +I L ML VGG+ VD++R L+ A++ A+ Sbjct: 24 RSFLREEDGVLAKPMIGTFLA--MLAVGGIGVDLMRMERDRTELQYTLDRAVLAAADLDQ 81 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + +YL + L++ ++D + + + Y Sbjct: 82 SLDADAV-------------VLDYLTK---AGLEQYYSDPD------------DQKGLGY 113 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + D + L+ G + + + AE + V I V+D S SM Sbjct: 114 K-SVEATIDTDF--EAYLLKFAGGDNMSLYANSRAEEIIG------SVEISMVLDISGSM 164 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 R Q + + N + I S Y + Sbjct: 165 NSGNRLVNLQAAAK--SFVTQITSNTDVSNLSISIIPYATQVNAGEKLLSKYTKVSQEH- 221 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDA-LASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D S D S + + + L+R A + S+ ID R G+ + Sbjct: 222 DYSYCVNFIKDQFSKHTLNQNEDLIRTAHFDTFTYSMNMIDRPVCPTRPGS--------A 273 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ KL I G+T+I+ M+ + + + V+ + ++ Sbjct: 274 ILPFTNDAAKLHAYIDS-----LTASGNTSIDIGMKWGSALLDPTAQPVVNALVDDKVIS 328 Query: 369 ----------------KYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 K I+L++DG+NT + N+ + GI Sbjct: 329 ENFRGRPKAYGSGDTLKIIILMSDGQNTNQ-----YMVNEHRRDGI 369 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 7/76 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 ++ IC+ K QG+ + ++ F L +CAS + F E++ F Sbjct: 482 DQHTKTICDITKDQGVIVYSVGFEA----PSAGIKVLEDCASSPAHFFDVEGLEISDAF- 536 Query: 443 DRIGNEIFERVIRITK 458 I I R +R+T+ Sbjct: 537 SSIATSI--RQLRLTQ 550 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/118 (16%), Positives = 38/118 (32%), Gaps = 14/118 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + + + + T + A++ + Sbjct: 129 RLGLVAFSGVAFTQCPLTLDYQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMTAFDPKS 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K I+L+TDGE+ ++ G+ KA +GIRI + Sbjct: 189 ST-----------DKVILLITDGEDNEE--AGLKAAEKASDEGIRIFVLGIGDPAGGP 233 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 76/257 (29%), Gaps = 28/257 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK-VGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ + N + + S + D + + M N + Sbjct: 243 SLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPTHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F+ Sbjct: 303 LKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVA 356 Query: 307 ISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + I + G T+I ++ + II S++ Sbjct: 357 EIQNHLTRITDDNVYQKITAKL--PQVANGGTSICRGLKAGFQAIIHSDQSTSGSE---- 410 Query: 366 EAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDGE+ + N C K G I TIA + K LSN Sbjct: 411 -----IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALGPSA---AKELETLSNMTG 457 Query: 425 PNSFFEANSTHELNKIF 441 FF L F Sbjct: 458 GYRFFANKDITGLTNAF 474 >gi|301758046|ref|XP_002914871.1| PREDICTED: LOW QUALITY PROTEIN: vitrin-like [Ailuropoda melanoleuca] Length = 686 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 53/151 (35%), Gaps = 17/151 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+GA + F + + I+ G T+ A+ A + + + Sbjct: 540 TRVGAVQYT--YEQRLEFGFDDYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLFKKS 597 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +K ++L+TDG + ++ A +G+ I + Q++ Sbjct: 598 KPNK---------RKLMILITDG---RSYDDVRIPAMVAHHKGVTTYAIGVA--WAAQDE 643 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 ++ AS +SFF + L K I Sbjct: 644 LEVIATHPASDHSFF-VDEFDNLYKFVPKVI 673 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 73/218 (33%), Gaps = 26/218 (11%) Query: 246 PGPLDPSLSEE---HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P P +P S F+ SS + V+ L+ ++ S+ D +D R+ + Sbjct: 192 PAPAEPCKSRPLDLVFIIDSSRSVRPAEFEKVKIFLSEMVDSL---DIGSDATRVALVNY 248 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V + K + T A++TA D + + N K Sbjct: 249 ASTVNIEFHLKKYFSKAEVKQAFSRIDPL--STGTMTGMAIKTAMDQVFTENAGARPLKK 306 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K +++TDG + ++ + A++ GI I + + + L Sbjct: 307 G---IGKVAIIVTDG---RPQDKVEEVSAAARASGIEIYAVGVDRAEMRS------LKQM 354 Query: 423 AS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 AS + F + +L FR+ + E + Sbjct: 355 ASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 392 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 76/257 (29%), Gaps = 28/257 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK-VGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ + N + + S + D + + M N + Sbjct: 243 SLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPTHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F+ Sbjct: 303 LKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVA 356 Query: 307 ISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + I + G T+I ++ + II S++ Sbjct: 357 EIQNHLTRITDDNVYQKITAKL--PQVANGGTSICRGLKAGFQAIIHSDQSTSGSE---- 410 Query: 366 EAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDGE+ + N C K G I TIA + K LSN Sbjct: 411 -----IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALGPSA---AKELETLSNMTG 457 Query: 425 PNSFFEANSTHELNKIF 441 FF L F Sbjct: 458 GYRFFANKDITGLTNAF 474 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 52/416 (12%), Positives = 124/416 (29%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + L AQ I + PL +Q + +++ + Sbjct: 252 FLIDGSSSIGKRRFRIQKQFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRD 311 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A V + + Sbjct: 312 LKTAIEKITQRGGLSNVGRAISFVTKNFFSKSNGNRGGAPNVAV--------------VM 357 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + +E G++I ++ + Q E N + Sbjct: 358 VDGWPTDKVEEASR-------LARESGINIFFITIEGAVENEKQYVLEPNFANKAVCRTN 410 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + ++ + S +D SS Sbjct: 411 GFYSLNVQNWYGLHKSVQPLVKRVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTGN 466 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ +A++ + D R+GA + F + + ++ Sbjct: 467 FRTVLQF-VANLSKEFDISDTD---TRVGAVQYT--YEQRLEFGFDQYTTKPDVLNAIKR 520 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 521 VGYWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 568 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + QE+ ++ A ++FF + L K I Sbjct: 569 AMVAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFF-VDEFDNLYKSVPKII 621 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 76 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 135 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 136 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 166 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 167 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 203 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 204 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 254 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 255 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 293 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 65/227 (28%), Gaps = 56/227 (24%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K +++D I + + + ++ + + + Sbjct: 93 TKMEVLKDTAKDFITQLSAQSPASQV---SVVSYASDSKTNIGLTSLDTQENIQSLNRAI 149 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G+T + ++ AY + +++ K++++ LTDGE + Sbjct: 150 DKLWASGATRSDLGLEDAYSVLGAADSGN----------KQFVIFLTDGEPNSYSGFDRE 199 Query: 389 IC-------------NKAKSQG----------------------------IRIMTIAFSV 407 I + K G I +I Sbjct: 200 IAARAESTASIIKGEDLIKRDGRIFGDYDGSLDDGSSHNPDWEFEGDPLSAEIFSIGILK 259 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + Q + +L+ S +S A++ L IF + I ++I Sbjct: 260 SWSSQ-RVHDYLNYIDSQHSAALADTAQALQDIF-EAITHQIAVTAT 304 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 59/180 (32%), Gaps = 28/180 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ A+ +I+ + ID R+ F+ + + + ++ Sbjct: 78 KISKVKTAMLFMIKKLSSID------RLSIVTFSGDARRLCPL-RQISETSQRELENLIN 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I ++T + + R+ I+L++DGE + Sbjct: 131 GLKAEGATNITAGLKTGLNVLND------RRLSGGRVVG--IMLMSDGEQNAGGDAA--- 179 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIG 446 + + + T F +N L A +F + +T L+K F + Sbjct: 180 --QVPVGNVPVHTFGFGINHEP-----RVLKAIAQNSVGGTFSDVQNTDNLSKAFSQCLA 232 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/120 (25%), Positives = 52/120 (43%), Gaps = 35/120 (29%) Query: 363 NNLEAKKYIVLLTDGENTQD----------------NEEGI------------AICNKAK 394 N E ++I+ +TDG+ + ++G+ A+C+ AK Sbjct: 517 NGGEVSRHIIFMTDGQMDTNYKVMSTYGIEWHDRRITDDGVTDQDARHTLRFRALCDAAK 576 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++G R+ IAF+ + LS CAS +S F A + ELN F++ I + E + Sbjct: 577 AKGFRVWVIAFASDLNDD------LSYCASASSTFPATNATELNTAFQE-IAKNVAELRV 629 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 31/235 (13%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + A +PV++ + G +D+ R + L+ A + + + + +++A+ + Sbjct: 4 LAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQAAAF 63 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 F E+ L N + + + + T VEM Sbjct: 64 FNANFNEDDLGATETNFATSSADGGSLVEGIATTDVEM---------------------- 101 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + + G+ S I + A + + V+D + SM + + ++ Sbjct: 102 -VLMNLFGVISVPINVECSATMDIGNTD------VTMVLDTTGSMSQTLSGTTTKRIDAL 154 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKL-SPYMVSCN-KSLYYMLYPGPLDPSLSEEH 257 + S+ R PY S N L Y L P L + + + Sbjct: 155 RTAMKNFYDTVSAATTGSNARVRYSFVPYSSSVNVGQLIYDLDPDYLVDTWAIQS 209 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 76 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 135 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 136 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 166 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 167 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 203 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 204 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 254 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 255 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 293 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/279 (12%), Positives = 75/279 (26%), Gaps = 70/279 (25%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPAD 209 G + + + + I V+D S SM + QP + + Sbjct: 76 GTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVN 135 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + +++N K+ + ++ +V + Sbjct: 136 KFINATAAENAKITDQSQRNRIALVKFAGTE----------------------------- 166 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ ND R G + +N + ++ V L T+ Sbjct: 167 -----KTSVG------------NDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----N 203 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 + G+T+ + A A + KK ++ TDGE + Sbjct: 204 GLSASGATSADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTV 254 Query: 384 -EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K G I +I + + Sbjct: 255 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 293 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 17/163 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + +RV+++ + + ID+ T + A Sbjct: 186 IQVGVVQYGERVVNEFRLD-DFRTVDEVVAAAKNIDQRGGEETRTALGINVARTQAFKHG 244 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 +AKK ++++TDGE + D+ + A +++ I + IA + Sbjct: 245 --------GRPDAKKVMIVITDGE-SHDSPDLKAAVEESEKDNITLYGIAVLGYYNRRGI 295 Query: 412 -QEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + A+ FF L I D +G +IF Sbjct: 296 NPEAFLREIKFIATDPDEHFFSVTDESALKDI-VDALGEKIFS 337 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 60/168 (35%), Gaps = 15/168 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I KI+ + VR+G ++D + + + K + T I+ ++ Sbjct: 1867 IDKIELSSSGVRVGVLTYSD--EAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLE 1924 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ--DNEEGIAICNKAKSQGIRIMT 402 A + + + +KK ++LLTDG+ T D ++ + + G+ I Sbjct: 1925 AAKELFLEKSG-------GRGSSKKVLILLTDGQQTYIPDAKDPVDYAKELAEYGVDIFA 1977 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 I ++ + +S P F ++ + L I + Sbjct: 1978 IGIG-DEINKVDLEDLISK---PQHIFLSDDINSLITDLSKDISTALS 2021 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 38/283 (13%), Positives = 86/283 (30%), Gaps = 39/283 (13%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 F D + G + + G+VG+R K + Sbjct: 1173 FGGITGDKGESGKNGTDGKPGSSGLPGQQGRKGEPGEVGLRGPDGVTERGEKGKEGEPGM 1232 Query: 245 -----YPGPLDP------SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 P P ++ F+ SS + ++ + +++ + Sbjct: 1233 PGKDGEDAPPPPGCFTFGDPTDIAFILDSSRSVTRDHFNRQKEFVKTILGEFPL---GEE 1289 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTI-VKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + G + ++G + I ++ + ++ A++ A++ + + Sbjct: 1290 LTQAGIIKYGRTAD--IEINFGEFLTQTDLFNAIDKIKHSQADESRLDLALKKAHEELFT 1347 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-------GIRIMTIAF 405 S +++ + +K IV+L DG + + AK + G+ I TI Sbjct: 1348 SQG-----ARSDKDIEKAIVILGDGYISGGGNRSRDLIESAKKEAAKLRELGVLIFTIGV 1402 Query: 406 SVNKTQQEKARYFLSNCASPNSFF-EANSTHELNKIFRDRIGN 447 L N AS +++ +L +IG Sbjct: 1403 GAEPNS-----LLLQNFASKKTYYITVKDYGQLI----GKIGA 1436 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 68/197 (34%), Gaps = 16/197 (8%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P+ + + SS + ++ A +++ + I R+ F + Sbjct: 383 PVKDCRYDIGIIFDSSGSLEKNFQKQLKFAT-TLVEQMP-ISPNA--TRVAIIQFAGKTK 438 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + K + G+T N A++T + ++ ++K Sbjct: 439 LRVLADFAQKKSAAELKTIIGRSHFFSGTTFTNGALKT-MADLFQKSKRADAKLK----- 492 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +VL TDG E+ KSQG+ + T+ S K+ + SPN Sbjct: 493 ---VVLFTDG---YSAEDTSEGAEALKSQGVVVYTVGISTEKSTGLNMKELHGMATSPNH 546 Query: 428 FFEANSTHELNKIFRDR 444 FF A+ EL+K F Sbjct: 547 FFNASDFVELSKNFPSS 563 >gi|291386938|ref|XP_002709809.1| PREDICTED: vitrin [Oryctolagus cuniculus] Length = 869 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 60/168 (35%), Gaps = 21/168 (12%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +A++ + + + R+GA + F + + I+ G T Sbjct: 709 VANLSKEFEISETD---TRIGAVQYT--YEQRLEFGFDKYNTKPDILNAIKRVGYWSGGT 763 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + A+ A + + ++ +K ++L+TDG + ++ A +G Sbjct: 764 STGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIPAMAAHHKG 811 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + I + Q++ + A +SFF + L K F RI Sbjct: 812 VITYAIGVA--WAAQDELEVIATYPAKDHSFF-VDEFDNLYK-FVPRI 855 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 56/137 (40%), Gaps = 20/137 (14%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + R++++ + G T + + AY+ + S + +V++TD Sbjct: 478 TEAHRSLLRERLLGLTLSGGTGLYNTTAAAYEKMTGSRRGDAINA---------VVVMTD 528 Query: 377 GENTQDN----EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFE 430 G+N + + IA + + +R+ TI + ++ + L A+ + ++ Sbjct: 529 GKNERPGGLDLDGLIAKLGARREESVRVFTIGY-----GEDADQNVLRRIAEAADGAAYD 583 Query: 431 ANSTHELNKIFRDRIGN 447 ++ + + IF + I N Sbjct: 584 SSDPNTIGDIFTEVISN 600 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 70/221 (31%), Gaps = 23/221 (10%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 + TV+ S N ++ ++ Y + + + +D S Sbjct: 196 YQYSTPSTVEQPFSVNTELAPSPYNEHKMLLRIGLKGYEV--DKSQLGASNLVFLLDVSG 253 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + K L++ +L + + + + D+V+ V GA+ GV Sbjct: 254 SMNSRDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGASGVVLD---------GVKGNDTQA 304 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + GST +Q AY + H ++ + ++L TDG+ Sbjct: 305 INQALNSLKAGGSTNGGAGIQQAYRL------AQKHFIQGGVNR---VILATDGDFNVGT 355 Query: 384 EEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + A+ + + QGI + T+ F Sbjct: 356 TDHQALMDLIAAKRDQGIALTTLGFGQGNYNDHLMEQLADK 396 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 55/174 (31%), Gaps = 24/174 (13%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + S+ +L D + + + + L I + Sbjct: 130 DLSGSMRAILDARASDLPGQPKL-----PRGKRLTRLDTAKLVLQDFISRRRTD------ 178 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + H L + + + + +TAI DA+ TA + Sbjct: 179 -RLGVVVFGKAAYVLSPPTLDYHLLTQMVSQMTLNVIDGS-ATAIGDALGTAVARL---- 232 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + K ++LLTDG++ + A S G ++ TI + Sbjct: 233 -------RRSDAQSKVVILLTDGDSNAGAISPEYATHLATSLGAKVYTIQIGTD 279 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/279 (14%), Positives = 87/279 (31%), Gaps = 28/279 (10%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V D+S +R ++ Q + + + S +N + D L + Sbjct: 810 VKDYSG-----RRPNQEQLEDVVCKNDPKPGFSLVLENFAELLDDTFLQNLTSQICQDKK 864 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKK-KHLVRDALASVIRSIKKID-NVNDTVRMGA 299 Y P+ + S + V S V +K + + + + +VR+G Sbjct: 865 CPDYKCPISFTRSADILVMMDSSASVGQKNFETSKKFVRLLAERFLSAERQGGASVRVGL 924 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + + + L+ + T + AM+ A ++ + Sbjct: 925 AQYSRIARLEAELTNNLTHLVHHT----EQAAFQNDGTNVLGAMEYAIRSLEGRGDTSGG 980 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 R K +VL +DG + E + + G+ + IA N+ + R Sbjct: 981 RKK--------LVLFSDGRSQAITEALLEKRAREVADAGVEVFVIAVG-NQVNEANLRTL 1031 Query: 419 LSNCASPN------SFFEANSTHELNK-IFRDRIGNEIF 450 +S + F L + +F + + Sbjct: 1032 VSRGRQDDISFAQRHLFRVPDYASLLRGVFHQTVSRRVS 1070 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 77/194 (39%), Gaps = 21/194 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM---GATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + I +K I + D + GA ++D VI + T+ + Sbjct: 27 IDQIKTFTTNFIDELKNIRHQCDRILTWNSGALHYSDEVILVGELM-DMQTQRSTLKTSI 85 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN----TQDN 383 + E T + A++ ++ H +N KYIV++TDG + Sbjct: 86 SGIEYIGKGTYTDCAIKRGLSELLIGGS---HYHEN-----KYIVVVTDGHPLTGYKEPC 137 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 N+A+ G+++ +A S ++ LS A+ +++ + + + +KI + Sbjct: 138 GGVQEAANEARQHGVKVFAVAIS-----PDQEDTRLSLIATDHNYRQNFTAADDSKITKM 192 Query: 444 RIGNEIFERVIRIT 457 + I ++R++ Sbjct: 193 DTIHTIINMIVRVS 206 >gi|194221225|ref|XP_001492399.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Equus caballus] Length = 883 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 96/317 (30%), Gaps = 39/317 (12%) Query: 147 FLRSMGIKSWLIQTK------AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 GI + + + K+ VS + +D RS Sbjct: 188 IFEPQGISTLDAEASFVTNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------C 236 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 C + N + + ++ Y V + P L FV Sbjct: 237 PTCTDSLLNGDFTITYDVNRESPANVQIVNGYFV-------HFFAPQGLPVVPKNVVFVI 289 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSFSWGVHKL 319 S +K +DAL ++ +K+ D +N + G + D ++ + Sbjct: 290 DVSGSMYGRKIQQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDNLVQATP-----ENI 344 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + F ++ + G T INDA+ + + E+ + I++LTDG+ Sbjct: 345 QQA--REFVMNIHSQGMTNINDALLRGISMLNKAREENA----VPERSTSIIIMLTDGDA 398 Query: 380 TQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + I A + + F N L N +E + + Sbjct: 399 NVGESKPEKIQENVHNAIRGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANL 458 Query: 437 LNKIFRDRIGNEIFERV 453 + F + + N + V Sbjct: 459 QLQGFYEEVANPLLTSV 475 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 50/149 (33%), Gaps = 18/149 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F D S+ L +TI+ TA+ +A+ + E Sbjct: 116 RIALEVFADYAYLAAPMSYEKKGL-KTILAALEPGVVGGRDTALYEALFLG-ARLFKKEE 173 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + ++LLTDG +T N A + K IR+ T+ + Sbjct: 174 GRS---------NRVMILLTDGIDTVGNIPLEAAIRELKRAHIRVYTVGV-----GDDFR 219 Query: 416 RYFLSNCASP--NSFFEANSTHELNKIFR 442 R L A F++A L I+R Sbjct: 220 RGVLEKIARSTGGRFYDARYPEALANIYR 248 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 100/318 (31%), Gaps = 28/318 (8%) Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 +Y ++ S L P + + ++ I ++ + E T ++S +S V+ Sbjct: 284 TDGYSYDIIESGARLLREVP-NSVVYAVTIGEIYLRKELELITGNKSNVMIGSMSYGTVV 342 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 ++ R + + N S + + + +E + + + Sbjct: 343 KRIKNCEARARAQQLRDENPVELVHPGEFLSDAFSHRQSVQTNENIKKDEPAKDSVTEPT 402 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 D DSS + ++ N ++ F Sbjct: 403 DKLPVNDCQYDVGIIFDSSGSLE------------KNFQTQLQIA---NKLFQVAIVQFA 447 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + +K + K G+T N A++ +S D Sbjct: 448 GKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKRDNCKM--- 504 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K +V TDG + +D EGI K QGI + T+ S +K Sbjct: 505 -----KLLVF-TDGYSAEDTAEGI---EALKRQGITVYTVGISTDKNAGLNVSELKGMAT 555 Query: 424 SPNSFFEANSTHELNKIF 441 SP+ +F+++ L K F Sbjct: 556 SPSHYFDSSDFDNLLKHF 573 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 58/182 (31%), Gaps = 22/182 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L P FV S K + A+ ++ ++ D Sbjct: 265 DGYFIHYFAPRGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILSDLRAND------- 317 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTI 350 +FN SD W I+ + K + G T IN A+ A + Sbjct: 318 ----YFNIISFSDTINVWKAGGSIQATIQNVHSAKDYLHRMEADGWTDINSALLAAASVL 373 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSV 407 SN++ I+ LTDGE T I + + + + T+AF Sbjct: 374 NHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGD 431 Query: 408 NK 409 + Sbjct: 432 DA 433 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 70/218 (32%), Gaps = 26/218 (11%) Query: 246 PGPLDPSLSEE---HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P P +P S F+ SS + V+ + + +D +D R+ + Sbjct: 192 PAPAEPCKSRPLDLVFIIDSSRSVRPAEFEKVKI---FLSEMVNSLDIGSDATRVALVNY 248 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V + K + T A++TA + + + N K Sbjct: 249 ASTVNIEFHLKKYFSKAEVKQAFSRIDPL--STGTMTGMAIKTAMEQVFTENAGARPLKK 306 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K +++TDG + ++ + A++ GI I + + + L Sbjct: 307 G---IGKVAIIVTDG---RPQDKVEEVSAAARASGIEIYAVGVDRAEMRS------LKQM 354 Query: 423 AS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 AS + F + +L FR+ + E + Sbjct: 355 ASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 392 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 59/168 (35%), Gaps = 21/168 (12%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDE--NEMGSTAINDAMQ 344 D V F+ R+ + + +L + G T + DA+ Sbjct: 108 PNKDQVF--LMQFDFRIFMRQPLTNSLRQLSDSLPYVDTPTFNQLRAQSGGGTLLYDAVV 165 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA ++ N +K ++LLTDGE+ + A+ I +I Sbjct: 166 TASQEVML-----------NRTGRKALILLTDGEDYGSDASVGDAIEAAQRADTLIYSIL 214 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 F+ + + + SFFE + ++++IF I E+ + Sbjct: 215 FADQGDGRRPLQRMSKE--TGGSFFEVSKKQDIDQIFT-AIQEELRSQ 259 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 43/117 (36%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + ++ D TAI DA+ A N+ Sbjct: 130 RVGIIAYAGSAFPQLPITTDYA-AAKMFLQALNTDMISSQGTAIGDAIDLATTYYDDDNQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ + N E IA ++A +GIRI TI K Sbjct: 189 TN-----------RVLFIISDGEDHEGNVESIA--DEAAEKGIRIYTIGVGTEKGGP 232 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 59/411 (14%), Positives = 136/411 (33%), Gaps = 78/411 (18%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 F +L SC FI L++PV++ V + + + + + +A++ + S Sbjct: 4 FDMNRLKNSCGAAAFI-FILILPVLICVMALSLQASQILLAQSKITEASEVTSLALSA-- 60 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 S +QK+ Y R ++ L D+ ++ + + Sbjct: 61 ------------LSEERAQQKLSSYATRVLKHYLV------GTDDVKGQATMQSSTFQFQ 102 Query: 129 YQVVLSSRYDLLLN--PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +V + ++ P + + G + HK + + ++ D S Sbjct: 103 TDLVGEATHEFWFKHKPQADTFKVSGAST-------------SRKHKPQPMDVYFITDLS 149 Query: 187 RSMLDYQ-------RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR------------DE 227 SM + +D+ Q ++ + + Y+++N K+ R + Sbjct: 150 ESMNRSEPSRLTIVKDAIRQVVSKLPKGSRAAFIGYNTENVKLTGRYFDKRTGREITSKK 209 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + Y P E+ D + + + + Sbjct: 210 PTELQGPNIWAEKSIYDYLTGRHPDFVIENLFDIALSKKKKSSIDKLLEGYGP------- 262 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 +++ +R T + D S + + TI N G+T ++ Sbjct: 263 -PDLDQKIRDFETKYPFY---DISLTTDLGSFKETIKS---NAINANGNT-------HSW 308 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA--ICNKAKSQ 396 + II++ + + + ++ VLLTDGE+++ +G +C K ++ Sbjct: 309 NGIIAAAREAHRQPSSVFNPQQVFVLLTDGEDSKKFPKGYYAPLCEKIRND 359 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 79/265 (29%), Gaps = 25/265 (9%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 L Q L G RT + NG + L + Sbjct: 98 LTLTAKENSQKLLACGPVVRRTCNENAYVNG-------LCFMLDSDLKQELNFPESLQEC 150 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 ++ F+ S + +++ + +V+ K D + ++D Sbjct: 151 PKQANDIVFLIDGSGSIRPLQFVQMKNFVMTVMDQFKGTD-----TQFSLMQYSD--DFK 203 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ K T E G T ++ + K K Sbjct: 204 THFTFNNFKNDPTSKNLVGPIEQLNGKTHTASGIRKVVRELFQEWNGARKDAK------K 257 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PN 426 ++++TDG+ D+ + +A+ +G+ I + L AS Sbjct: 258 ILIVITDGQIQGDSLNYRDVIPEAEKEGVIRYAIGVGYAFNTPSARQE-LRTIASQPAQE 316 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ N+ L I ++++ +IF Sbjct: 317 HVFQVNNFDALKNI-QNQLQEKIFA 340 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 50/143 (34%), Gaps = 19/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A+ + + + R+G F + +++ + K Sbjct: 111 RLERTKYAINKLFDGLHQD-------RVGLIVFAGEPKVQLPITSD-YRMAKAFAKRIDP 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI A+ A + E+ H + ++L+TDGEN +D+ A Sbjct: 163 SLVPVQGTAIGKALSQALMSFSGETEEN-H--------SRVVILITDGENHEDDALAAAR 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 A GIRI TI + Sbjct: 214 --HAAEMGIRIYTIGIGTPEGAP 234 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 60/191 (31%), Gaps = 27/191 (14%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 FV +S + + + + ++ S + R+G ++ R Sbjct: 4 FVLDASGSVRANRFKMCLNFINKLVNSFHIGPHN---TRIGIVRYSTR---PSGIFRFTS 57 Query: 318 KLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + K G T A+ A + H + + +K ++++TD Sbjct: 58 YRNKHSTKHRVNRIRYTGGWTRTGAAINYARRYLY------QHNRRRGV--RKVLIVMTD 109 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G + + + K GI + I + R L+ A+ + + Sbjct: 110 G---KSQDSVVGASRSVKRMGIEVFAIGI-----GRGYRRSELNQMATDRNHVLTARFRD 161 Query: 437 LNKIFRDRIGN 447 L+KI IG Sbjct: 162 LHKI----IGK 168 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 3/164 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE-EVSSR 78 G II L + V++G G+ +D+ + + L+ + + A+ + + VS Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEA 78 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A + + + NL +++D V + K Y + R Sbjct: 79 AGLATGTSNAALFQGKAVEMFENLNVSYSDTPDSTFYTKNNVPFSLDKVKYVKCTAERKG 138 Query: 139 LL--LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + L + IK+ + A A T S V + Sbjct: 139 IAHWFIHMLNLLPGIDIKASTVNAMAVATTTSAQAACAIPVYVC 182 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ +K + Sbjct: 251 NGYFVHYFAPEGLPTIPKNVIFVIDKSGSMSGRKIQQTREALIKILDDLK---PNDQF-- 305 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ V K +A + G T INDAM TA + S+N+ Sbjct: 306 -NLISFSGDVTHWKPLLVPASPENVDQAKRYAANIEAHGGTNINDAMLTAVRLLQSANQK 364 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQE 413 E+ + + I+LLTDG+ T I KA + + F + + Sbjct: 365 EL--LSDGSV--SLIILLTDGDPTVGETSPARIQKNVQKAIDGQYSLFCLGFGFDVS--- 417 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + +++S +L +++ Sbjct: 418 --YVFLEKLALDNGGLARRIYEDSDSALQLQDFYQE 451 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 57/188 (30%), Gaps = 35/188 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A +I + D +G F + S +I + Sbjct: 109 RLEATKSAAEELINDLDPKD------YVGIVVFESGASTASYLSPDKDSVIENLENIMEK 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D G+TAI D + + S KK ++LL+DG N Sbjct: 163 D----GATAIGDGLSLGINMADSIP-----------NRKKVVILLSDGVNNAGVISPDEA 207 Query: 390 CNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SPNSFFEANSTH 435 AK I++ TI N E L A + +F++ Sbjct: 208 IQFAKDSDIQVFTIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQ 267 Query: 436 ELNKIFRD 443 LN+I+ + Sbjct: 268 TLNEIYSN 275 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 70/509 (13%), Positives = 136/509 (26%), Gaps = 113/509 (22%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALST------- 69 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQVV 132 + + + +++ E R Y Sbjct: 70 --------ENNYRNDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEILGYTEYNAS 121 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS---- 188 L Y L L L+ +L+ +W + A S + + + +V DFS S Sbjct: 122 LQMNYQLAL--LNSYLKQTPSPTWDVNENGAARKYLSSIAE--PIDVVFVTDFSGSMNLP 177 Query: 189 ----------------------MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + + ++G F + T + ++ Sbjct: 178 FGDIELNNRITKLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCH 237 Query: 227 EKLSPYMVSCNK---SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 SP + N Y + + + + + K H + + Sbjct: 238 FPYSPKKIDGNGHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAI-------LS 290 Query: 284 SIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I+K N+ R A F D+ S + I I+ + I+ Sbjct: 291 EIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETID 350 Query: 341 DAMQTAYDTIISSNEDEVHRM-----------------------------------KNNL 365 M D E + + Sbjct: 351 IPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQ 410 Query: 366 EAKKYIVLLTDGENT-------QDNEEGI----------AICNKAKSQGIRIMTIAFSVN 408 K +++L+DG++ D + GI +C K K GI+++ I Sbjct: 411 NNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYV 470 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHEL 437 + +C +F+ A + HEL Sbjct: 471 PD--NNIIDWEKDCVGTGNFYLAKNAHEL 497 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 71/209 (33%), Gaps = 21/209 (10%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N A + + + +V + +V +L P + VD Sbjct: 164 NYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDV 223 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S K L + +L + + ++ D++ G N +V+ + K++ Sbjct: 224 SGSMDTEDKLQLAKSSLKMLTKQLRAQDSITLITYAG----NTKVVLPSTPGNQTQKILN 279 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT- 380 I GST A++ AY H K+ + I++LTDG+ Sbjct: 280 AIDN-----LTASGSTNGEAAIKLAYQQAT------EHFKKDGINR---ILMLTDGDFNV 325 Query: 381 --QDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ + I + +GI + T+ F Sbjct: 326 GVSSVKDMLQIIRSNRDKGISLSTLGFGQ 354 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 49/127 (38%), Gaps = 8/127 (6%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHPSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E ++ N+ + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAE-GARPLRENVP--RVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSV 407 I Sbjct: 192 FAIGVGQ 198 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 61/185 (32%), Gaps = 31/185 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-------- 320 + + A+ S++ + V + + G + Sbjct: 55 TRMDAAKRAVGSMVDGL----PAGAQVGLAIYGAGTGSSGAEKVA-GCKDVRVVQPVGPV 109 Query: 321 -RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +K G T I A++TA + E ++ IVL++DGE+ Sbjct: 110 NKPALKRAVTATKASGYTPIGQALRTAAAQL-------------PKEGQRSIVLVSDGED 156 Query: 380 TQDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T + + + QG+ + TI F V+ + + N + ++ +A+ L Sbjct: 157 TCAPPQPCEVAKELSKQGVDLHVHTIGFRVDAKARAQLACIAQN--TGGTYHDASDADSL 214 Query: 438 NKIFR 442 + Sbjct: 215 LGVLG 219 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 75/235 (31%), Gaps = 47/235 (20%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 R + P +V+ S +L D +++ + + A+ I Sbjct: 66 CRPQWQEPPLVTYQSSRDLILAVDLSDSMRTQDMLDEGEQRDRL--------SAVRQQIS 117 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-NDA 342 + + + R+ F D + + T +D + +G T +A Sbjct: 118 RLIETRPGD---RIALVVFADHAYLLSPL---TQETKALLTLTRELDFDLVGRTTALGEA 171 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKK--YIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A + + K+ ++L+TDG NT + + ++ +A + G+ + Sbjct: 172 ILLARQ---------------HADPKRPTALLLVTDGRNTAGSADPLSEARRAAASGMTL 216 Query: 401 MTIAFSVNKT-------------QQEKARYFLSNCAS--PNSFFEANSTHELNKI 440 T+ + E L A +F A + +L+ I Sbjct: 217 YTLGVGADPDTFVEALQPAQSDPSAELDEALLQQLAKVGQGRYFRARTQGDLDAI 271 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 66/222 (29%), Gaps = 42/222 (18%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + S+ P ++ + + V + + + + + + + I + Sbjct: 91 DMSGSMEAYDVPRNINDARTLIAAVKNKEVEN---RIEVAKKEIRRFIEQ----RPND-- 141 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ S + L+ + T I + + + + Sbjct: 142 -RIGLIGFADQAYSFAPPTLDHAWLLAHL--EQLEPGMIGQQTGIAAPLASGVNRL---- 194 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNE-EGIAICNKAKSQGIRIMTIAF-----SVN 408 K + ++ +VL TDG N DN K + I T+ V Sbjct: 195 -------KKSDAPRRVLVLFTDGRNNVDNRLTPEQAAALGKEFDVVIHTVGIGSRNAFVL 247 Query: 409 KTQQEKARYF-----------LSNCA--SPNSFFEANSTHEL 437 T + F L + A + ++F A + Sbjct: 248 VTDPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGM 289 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 58/183 (31%), Gaps = 25/183 (13%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S S + K L ++A + +++ D R+G F+ I F Sbjct: 406 SMSASFGVSKFDLAKEAAILALTALQAGD------RIGVLAFDTDTIWVIPFQAVGEGAA 459 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++T G T I A+ + + H VLLTDG + Sbjct: 460 VAELQTRIATMAIGGGTNIERALAVGLPALAAEPHSVRHA-----------VLLTDGRSY 508 Query: 381 QDNE-EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 +N + A++ I + TIA + L A ++ +L Sbjct: 509 SNNYPRYQQLVETARAAQITLSTIAIGTDADT-----DLLEQLARWGNGRYYFVPDAADL 563 Query: 438 NKI 440 +I Sbjct: 564 PRI 566 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 60/194 (30%), Gaps = 63/194 (32%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT---- 375 ++ + G T +++ + A+ +I + N +K +VL+T Sbjct: 380 TQSTITAKIAQLTAAGDTNLHEGVMWAWRSISPNPPFSAGSAYNTAGVRKILVLMTDGYN 439 Query: 376 ---------------------------------------------DGENTQ--------- 381 DG Sbjct: 440 NWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLDGAANSWTDYKSVSR 499 Query: 382 --DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCAS-PNSFFEANSTHEL 437 +E C AK++GI I +IAFSV+ + L +CA+ + + A + ++ Sbjct: 500 QAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCATNADHYLLATDSTQI 559 Query: 438 NKIFRDRIGNEIFE 451 ++ F +I + + Sbjct: 560 DRAF-SQIAMNLSK 572 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 59/186 (31%), Gaps = 30/186 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ ++ L + ++ + G VD R L+Q A TA + + + + Sbjct: 7 EFVSDHRGNVVMLFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVASRITAAT 66 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + N+ ++ + R + + + + + Sbjct: 67 STADAIKQ------------------AQNVLRSASQRLAAATISNATISADRKTFCIDAQ 108 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S ++ ++ I S + AE S + E +D S SM + Sbjct: 109 VSIP--------TMIMKIARIDSMAPAVMSCAEIGGGSTNYEIA----LALDNSGSMNES 156 Query: 193 QRDSEG 198 + Sbjct: 157 AGGATK 162 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 68/205 (33%), Gaps = 23/205 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + F+ SS V++ + ++++ + I V G + V Sbjct: 7 NSCNNKHLDLVFIIDSSRSVSHYDFEKVKEFILTILQFLD-ISPDATHV--GLIQYGSTV 63 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + S + + I + + T A+Q A + S E ++ N+ Sbjct: 64 KQEFSLK--TFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIAFSETE-GARPLRQNVP 120 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-- 424 + I+++TDG + + I KA++ GI I I L + S Sbjct: 121 --RIIMIVTDG---RPQDPVAEIAAKARNSGILIFAIGVGRVDM------NTLKSIGSEP 169 Query: 425 -PNSFFEANS---THELNKIFRDRI 445 F + L F+ ++ Sbjct: 170 HEEHVFLVANFSQIETLTSAFQTKL 194 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + ++ ++ D N VR+GA F+D + G + I Sbjct: 1018 KKMKEFMVSVVQDFDVSNKRVRIGAAQFSDAYRPEFPL--GTFIGAKEISIQIENITQIF 1075 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T I A++ D R+ + +++LTDG Q +E + Sbjct: 1076 GNTHIGAALRKVEHYF---RPDMGSRINTGTP--QVLLVLTDG---QSQDEVAQAAEALR 1127 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I ++ + + + ++ EL K+ Sbjct: 1128 RRGIDIYSVGI-----GDVDDQQLMQITGTAEKKLTVHNFDELKKV 1168 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 69/230 (30%), Gaps = 27/230 (11%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S SQN I+D + + P D + ++ S K Sbjct: 194 SMFSQNMTQIIKDVAKYKEGAVDDIFVEACQGPSMADVVFLLDMSINGSDENLDYLK--- 250 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S+ +D + +R+G +++ S S G++K +++ Sbjct: 251 -----GFLEESVSALDIKENCMRIGLVAYSNETKVINSLSMGINK--SEVLQHIQNLSPR 303 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G A++ + S+ KN + VL+T + + Sbjct: 304 TGKAYTGAAIKKLRKEVFSARNGSR---KNQGVPQ-IAVLVT---HRDSEDNVTKAAVNL 356 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + +G+ I T+ Q L AS A K F D Sbjct: 357 RREGVTIFTLGIKGASDTQ------LEKIAS----HPAEQYVSKLKTFAD 396 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 48/144 (33%), Gaps = 22/144 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ ++ + ++ R+G F + ++ T A Sbjct: 110 TRLERAKLAILDLVEQL-------ESDRIGLVAFAGSAFLQTPPTLDYGAFRESLDAT-A 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D G + + A++ A + K +VLLTDGE+ + I Sbjct: 162 PDMMSRGGSDLGVALREAT------------KAFPVENNYKAVVLLTDGEDLGGHA--ID 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 KA +G+++ I + Sbjct: 208 EAKKASKEGVKVFAIGLGTPEGDY 231 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 54/322 (16%), Positives = 110/322 (34%), Gaps = 42/322 (13%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 K +L + +++ S E+ V + + ++ + + + E + R+VK Sbjct: 465 KDFLTYAGSSQDSIVISSQTENRVDLYY-----NAIAEEKSEDEVTITGAIPRSYFRSVK 519 Query: 214 SYSSQN-GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + + GKV + + + +Y P+ +DSS +K Sbjct: 520 ADDNTTPGKVFPTKTAEWVDEANGIGKINFTIYGNPIRRGSDVILVIDSSGSMEG-EKWS 578 Query: 273 LVRDALASVIRSI-KKIDNVNDTVRMGATFFND----------------RVISDPSFSWG 315 + A I ++ + D V R+ F+ +V + Sbjct: 579 TAKTAAKGFIDNLYQNKDGVVSDDRIAIVDFDSSAKAYPGTNSGSETFLKVDDKITIKNK 638 Query: 316 VHKLIRTIVKTFAID-ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + + G T N A+QTA ++I++ D YIV + Sbjct: 639 TYSAKDYLKSYVLDSQMKDTGGTDYNKALQTA-QSVINNRRDSSRPA--------YIVFM 689 Query: 375 TDGENTQDNE-------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +DGE + +G + KS G+ I ++ ++ T K L++ + Sbjct: 690 SDGEPNGYWDWLTYRYYDGQKYATELKSDGVTIYSLGLNIGSTNFNKFIVPLASDPTSTY 749 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 T +L I+ D I + I Sbjct: 750 AKNIVKTSDLVGIY-DAIASSI 770 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 3/62 (4%) Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 + +A+C K Q + I T+ F T + + + +CAS P+ F+ A L Sbjct: 406 APATQALALCEAMKEQDVVIYTVYF--ETTGAKFGKDLMKSCASDPDKFYLAEDRDGLKA 463 Query: 440 IF 441 F Sbjct: 464 AF 465 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA+ +G T INDAM A + SSN+ Sbjct: 312 --LIVFSTEASQWRPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQLLDSSNQ- 368 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E + ++ I+LLTDG+ T +I N + + + F + + Sbjct: 369 EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 48/377 (12%), Positives = 114/377 (30%), Gaps = 57/377 (15%) Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + +S+ + + I +++ LK+ + E+ Y + Sbjct: 222 DGTSQTHDLKNLQEDNIISLSLKSQNTGLKE--IEAEIYSPEDTYRDNNVTSSYIY-IQG 278 Query: 134 SSRYDLLL----NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ------WVI 183 + L+ NP F +S+ ++ W + ++ + + V + I Sbjct: 279 KPKVLYLIGRDFNP--TFAKSLAVQGWDVVFDSKPYPQILNLNSYQAVIMDNIPQEDLSI 336 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D + ++ + G L G + + ++ + + N ++ + Sbjct: 337 DKMEILKNFVTEKGGTLLVLGGDKSFSAGNYQGTPLEEILPVTLRPEQILKKSNVAIVIV 396 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L S S K L +++ V+ +++ D G F+ Sbjct: 397 LDASGSMGSYSGGDM-----------KMELAKESAQLVLDLLEEKD------YFGLIAFD 439 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 +K + + G TA+ +++A + +I + H Sbjct: 440 HSYQWIVPLQPLTNKE---ETASLISKISPGGGTALYPPLKSAGEALIKAPIKSKH---- 492 Query: 364 NLEAKKYIVLLTDGENTQDN-EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+ +TDG+ + + K K I + TI + L + Sbjct: 493 -------IIAITDGQTEGGDFYNLVKYLAKYK---ITVSTIGIGEDANIP-----LLKDI 537 Query: 423 AS--PNSFFEANSTHEL 437 A+ F+ + L Sbjct: 538 ANWGNGRFYHTWNIRNL 554 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 40/117 (34%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + + + + + T + A++ + + Sbjct: 109 RIGLIAFSGVAFVQCPLTLDYGAIQMFLDELKP-ELIPVAGTDLGAAIEAGISSFDFKS- 166 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K I+L+TDGE+ + +G+ KAK +G++I Sbjct: 167 ----------VTDKVIMLITDGEDNEG--KGLIAAQKAKEKGVKIFVFGMGDPSGGP 211 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 75/243 (30%), Gaps = 22/243 (9%) Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P K S + P + + SL + + F+ Sbjct: 318 PKTGQILSCAPNWKYNCSGEVHAPGMCYFIQPDLSLSSISLLSPCRTQVCNSGKLDLIFL 377 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + ++ L + I S + ++G ++D + Sbjct: 378 IDESTSVLENDFDGIKVWLRNTISSF---PIGEEYTQIGLATYSDNPRIIFHL--NKYHK 432 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + I K E+ G TA A+ + +N H AK+ +V+LTDG Sbjct: 433 LDDIRKAVLEVEHTSGGTATGKAI------LYLTNNMFTHENGVRPNAKRLVVVLTDG-- 484 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSF-FEANSTHEL 437 + ++ I AK GI + I + L AS P+ + ++ N L Sbjct: 485 -KSQDDVIVPSRIAKESGIVMFAIGVGKVVMGE------LRAIASDPDRYVYKINDFSAL 537 Query: 438 NKI 440 I Sbjct: 538 ESI 540 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA+ +G T INDAM A + SSN+ Sbjct: 312 --LIVFSTEASQWRPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQLLDSSNQ- 368 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E + ++ I+LLTDG+ T +I N + + + F + + Sbjct: 369 EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 58/184 (31%), Gaps = 44/184 (23%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 T + +R + + ++ + +G T + M + Sbjct: 499 LTSYTNRTSTPTG--------QSSSFNSYIDNLIAVGGTYHDIGMLWGARFLSPKGIFAS 550 Query: 359 HRMK--NNLEAKKYIVLLTDGEN----------------------TQDNEEGIAI----- 389 N ++IV +TDG+ + + AI Sbjct: 551 DNNSAPNGFNISRHIVFMTDGDMSAYQQVYGAYGYQQLDARVAPGNTSDTDLTAIHNTRL 610 Query: 390 ---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE-ANSTHELNKIFRDRI 445 CN K++GI I I F + + L NCA+ ++ + A L++ F+D I Sbjct: 611 QMLCNAIKAKGITIWVIGFR--NQSEGNIQTPLQNCATSSNHWTMAYDATSLSQKFKD-I 667 Query: 446 GNEI 449 I Sbjct: 668 AKNI 671 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/208 (11%), Positives = 63/208 (30%), Gaps = 28/208 (13%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + +L ++ G+ + A + + + G +D+ R ++QA A + + Sbjct: 11 FMSRLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAMT 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S +++ + + F+ N + T V + Sbjct: 71 TSSMTQANK-------------DEAKKFFDFNFPQGTFQAATF-----TPVIRSKPGETT 112 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V +++ + + ++ ++ + EA + + V+D + SM Sbjct: 113 TVQVTASTTMP----TTVMKIFRYETLPLSVTCEARFDIGNTD------VMLVLDTTGSM 162 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSS 217 D +G Y + Sbjct: 163 AYAISDGKGGSTTRLAALKQAVKDFYDT 190 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 67/495 (13%), Positives = 137/495 (27%), Gaps = 88/495 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ + +ITAL+M ++ + GM +D + + L AA A I A P + L Sbjct: 8 RFVRDRKANIAVITALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM--L 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + A+N+ L + T +V Q Sbjct: 66 MQTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDV----------------GLQRT 109 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVIDFSRSML 190 + Y+ L+ + A + + + ++ ++D S SM Sbjct: 110 VKVSYNAA--------SLNNFPQLLMNNVSWAISGASTAQASSAPNMNFYLLMDDSPSMG 161 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 ++ N A + K+ SSQN + ++ + L + Sbjct: 162 IGATATD--ISNLIASTAPKYQKASSSQNCGFACHETNIAH-----DGGTKDNLAIARAN 214 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 V S+ + + + + ++ + N R F+ + + Sbjct: 215 NITLRIDLVTSAVNQLLNTWSNCPQSGVSGGVMQCMSALNNTTY-RAALYTFDLSLNTLA 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGS---------TAINDAMQTAYDTIIS-SNEDEVHR 360 S + V A+ + T + A ++ + Sbjct: 274 SLT--TPTTAGAQVSNIALMPVAYQNCVVPTTNCKTDNGTDIAGALTSLNGIMPSPGLGS 331 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEE----------------GIAICNKAKSQGIRI---M 401 + ++ + L+TDG + AIC K +GI+I Sbjct: 332 NASGDTPQEVVFLVTDGVEDKIASSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIAILY 391 Query: 402 TI---------------------AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 T A+ + L CASP F + + + Sbjct: 392 TEYLQLKTPNVPVTDTWYMSWIDAYDEPTSSTGAIAKNLQACASPGFFSNVQTGGNITQA 451 Query: 441 FRDRIGNEIFERVIR 455 D Sbjct: 452 LTDLFLKVASSTASL 466 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 64/187 (34%), Gaps = 24/187 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ AL +++ N+T+ +G F+ + S + R+ + Sbjct: 119 MDSVKYALRTMVSLF------NNTIDIGLVEFSHSIKSAIPPT-----PNRSYIDMVIDR 167 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + A+ TA + E L + V +TDG ++ + Sbjct: 168 MEAGGGTMYSFALSTALSWLRPYRE---------LNVSAFTVFITDGLPGDP-QDYRPLL 217 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 ++ GI I T+ + ++ + S + F S L+ + I ++I Sbjct: 218 DEYNKLGIPIYTVFIGEDPRGIDETKLIASK--TGGEQFTVESIDRLSDTL-NTIASKIN 274 Query: 451 ERVIRIT 457 + I Sbjct: 275 TIIANIN 281 >gi|162454787|ref|YP_001617154.1| hypothetical protein sce6505 [Sorangium cellulosum 'So ce 56'] gi|161165369|emb|CAN96674.1| putative membrane protein [Sorangium cellulosum 'So ce 56'] Length = 384 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 17/149 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +A +I+ ++ R GA F + + + + + Sbjct: 109 SRIFRAKVEVARLIKDLEGA-------RFGAVAFAGEPMG-FPLTADGAAIAQFFRQLDP 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D +G TAI A+ A + + + H K+ I+L+TDGE+ + +A Sbjct: 161 NDM-PIGGTAIARALDQANELLKRDPKSAEH--------KRIILLVTDGEDLEGYPLSVA 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 A+ I ++ I + E + Sbjct: 212 QAIGAQGTTIHVVQIGGRTPEPIPEIGQD 240 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA+ +G T INDAM A + SSN+ Sbjct: 312 --LIVFSTEASQWRPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQLLDSSNQ- 368 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E + ++ I+LLTDG+ T +I N + + + F + + Sbjct: 369 EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 67/226 (29%), Gaps = 25/226 (11%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C +L S + V S +K LV+ L ++ + D Sbjct: 71 CTINLESPAQTSEASRSGVDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPAD------ 124 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F++ K + + G T I ++ + Sbjct: 125 RICLISFSNDATKISRLVQMSPKGKKQLKSMIP-RLVASGGTNIVGGLEYGLQAL----- 178 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI---MTIAFSVNKTQQ 412 R N L + I+LL+DG++ + KA I I ++ Sbjct: 179 -RQRRTINQLSS---IILLSDGQDNNGT--TVLQRAKATMDSIVIRDDYSVHTFGYGHGH 232 Query: 413 EKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L+ A P +F+ + F + +G + +I Sbjct: 233 DS--TLLNALAEPKNGAFYYVKDEETIATAFANCLGELMSVVADQI 276 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 72/204 (35%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S K LV+ ++ ++ +++ ID V V GA + Sbjct: 240 PATNFVFLIDVSGSMDWDGKLDLVKSSMKLLVNNLRPIDRVAIVVYAGAAG----QVLPS 295 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 ++ + GSTA + + AY + E+ + N Sbjct: 296 -----TPGSEKSKILESLNGLTAGGSTAGGEGIVLAYKI---AKENLIEGGNNR------ 341 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ N+ + + G+ + + + + + +K + A + Sbjct: 342 IILCTDGDFNVGVSSNDGLEKLIENERKSGVFLSILGYGMGNYKDDKMQTLAQ--AGNGN 399 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 ++ E NK+ G +F Sbjct: 400 HAYIDNMQEANKVLVSEFGGTMFT 423 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 52/346 (15%), Positives = 102/346 (29%), Gaps = 22/346 (6%) Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 V T N K + V S+ L L R +G ++ + + Sbjct: 85 VSATPKRENKFKVSVNVAAQSKVTFNLTYEELLQRRLGSYELVLSIRPQQVVRHLKIDVR 144 Query: 175 HGVSIQWV-IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + V +D + + + +P R + GI + L Y Sbjct: 145 IIETRDIVMLDNTYGSGELEGVEIARPSPNRAHIQYRPTDMEQMRMSPSGISGDFLVRYD 204 Query: 234 VS----------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 V N + P L F+ S K + A+ ++++ Sbjct: 205 VKRDLSVGDIQIVNGYFVHYFAPSGLPVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILK 264 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ +D + ++ + + I + N G T IN A+ Sbjct: 265 DLR----DHDRFNVMPFSYSSTMWRPNEMVLATRENIESARTYVRRSINAGGGTNINQAI 320 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRI 400 A D + +D+ N+ + I+ LTDG + + I K+ + + Sbjct: 321 IDAADLLRRVTDDQP----NSPRSASLIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSL 376 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + F + + L N +E + K F D + Sbjct: 377 FCLGFGKDVDFPFLEKMALENRGLARRIYEDSDAALQLKGFYDEVA 422 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 61/337 (18%), Positives = 103/337 (30%), Gaps = 61/337 (18%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL----NCFGQPADRTVKSY 215 T + +T G + + D M D + + N D T Sbjct: 199 TADDGKTGDADDADNDGNTAEDADDAEHIMRDRFTFNRKTVMRAARNVAVPQPDHTKSIT 258 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL---RHVIKKKH 272 + GK + + +++ + LD S S + +D S + K+ Sbjct: 259 YNNGGKYTLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRGCNKSNPKRLT 318 Query: 273 LVRDALASVIRSI----KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +++A S I + I + N VR+ F S + +K+ Sbjct: 319 ALKEAATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTA-----ALKSSV 373 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G+T + M A ++ + KK ++ DG T N Sbjct: 374 SRLSANGATPADKGMAAAQTALLRARPGA----------KKVVIFFADGVPTTQNTFSTR 423 Query: 389 ICNKA-------KSQGIRIMTIAFS--VNKTQQ-------EKARYFLSNCASP------- 425 + N A KS G I +I N QQ ++A F+ +S Sbjct: 424 VANDAVTTALAMKSAGTLIYSIGIFEGANPEQQSFGNRENDQANQFMHAVSSNYPNATAY 483 Query: 426 -----------NSFFEANSTHELNKIFRDRIGNEIFE 451 + NS +L KIF D I EI Sbjct: 484 NKTNWGTGSNLGYYKATNSADDLTKIF-DDIQKEITT 519 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 54/155 (34%), Gaps = 32/155 (20%) Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 TF G T ++ + A + E + + +Y++ +TDG + + Sbjct: 380 TFNNGFIPNGGTWLDVGLLWAARLLSRDGLWSTENDELYHTYPVSRYVIFMTDGYMSIGS 439 Query: 384 EEGIAI---------------------------CNKAKSQGIRIMTIAFSVNKTQQEKAR 416 A C K+ +I TI+F T Sbjct: 440 SNYAAYAQEDYWRRVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTISFGAGSTLDSNLI 499 Query: 417 YFLS--NCASPNSFFEANSTHELNKIFRDRIGNEI 449 S N +P ++A+S+ +LN++FRD IG I Sbjct: 500 NCSSSTNTTNPEFAYKADSSSDLNRVFRD-IGENI 533 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 62/208 (29%), Gaps = 27/208 (12%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + +F +L ++ G+ I A M + G+ G +D+ R + L+QA ++ Sbjct: 6 KRALFILMRLYRNQAGNTLAIVAAAMLPLAGMVGGALDISRGYLAKTRLQQACDAGVLAG 65 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + S + + + + N + + T Sbjct: 66 ------------RKVMGSSGVLSDSVRDEVRKYVSFNYPSGYLGSTL-----ATTDINPT 108 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S Q+ LS L + +R G + I A + + I V+D Sbjct: 109 LGSNDQIALS----LTTAIPTAVMRLFGRNNMSITASCTARNDYSN------IDIVLVLD 158 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + SM ++ G Sbjct: 159 TTGSMACKPERNDSDCSTWAGSRYVTQW 186 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 62/189 (32%), Gaps = 10/189 (5%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G +I AL ++G+ G +D R + ++QA ++ L + Sbjct: 32 QDRGGTVTVIVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKLVVSNT 91 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA-------VEMNPRKSA 128 +S + + +I++ + V +V T V M+ K + Sbjct: 92 ASVIGLTTQTIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQTVHLSFGPFVGMSESKVS 151 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + S + L L+L + G + + A + + + + + + S Sbjct: 152 AKAKASVVGKMRLCMLALDPAAAGAFNLEKSAQVTAYDCALYSNSSNSGGM---VGRNNS 208 Query: 189 MLDYQRDSE 197 M Q Sbjct: 209 MARAQTICS 217 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTAMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ I ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ + I+LLTDG+ T +I + + + F + + Sbjct: 370 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 59/340 (17%), Positives = 120/340 (35%), Gaps = 22/340 (6%) Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA--ETVSRSY 171 +V+ T M + + V +++ L L R +G+ L++ + + + + Sbjct: 117 LVKATGRNMEQFQVSVSVAPNAKITFELVYEELLKRRLGVYELLLKVRPQQLVKHLQMDI 176 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 H I ++ S M + D+ N Q + ++ N + ++ Sbjct: 177 HIFEPQGISFLETESTFMTNQLVDALTTWQNKTKQKSPEQQETVLDGNLIIRYDVDRAVS 236 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEH--FVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 ++ Y++ Y P + ++ FV S +K R+AL ++ + D Sbjct: 237 GGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRD 296 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N F+ ++FA+ +G T INDAM A Sbjct: 297 QFN------LIVFSTEASQWRPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQL 350 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + SSN++E + I+LLTDG+ T +I N + ++ Sbjct: 351 LDSSNQEEQLPEGSVSL----IILLTDGDPTVGETNPRSIQNNVREAVSGRYSL--FCLG 404 Query: 410 TQQEKARYFLSNCASPNS------FFEANSTHELNKIFRD 443 + + FL A N +++S +L +++ Sbjct: 405 FGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 444 >gi|168702184|ref|ZP_02734461.1| hypothetical protein GobsU_21830 [Gemmata obscuriglobus UQM 2246] Length = 638 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 69/205 (33%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P + VD+S + LV+ +L ++ + + D V+ G + R Sbjct: 265 PAEKLPPRNLVFLVDTSGSMQQENRLPLVQKSLELLVEKLTEKDRVSVVTYAG----DSR 320 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V P + + G T ++ AY + D Sbjct: 321 VALPP-----TSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFLDGGVNR---- 371 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++L TDG+ DN E + + + + + + + + + + ++ + ++ Sbjct: 372 -----VILCTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYGMGNYKDDRLKELANH- 425 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 ++ E K+F ++ G Sbjct: 426 -GNGHHAYIDTLDEAKKVFVEQGGA 449 >gi|74315933|ref|NP_001028276.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|72679321|gb|AAI00122.1| Inter-alpha (globulin) inhibitor H3 [Danio rerio] Length = 892 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 55/359 (15%), Positives = 108/359 (30%), Gaps = 40/359 (11%) Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--- 166 E +V + K++ V +S+ L L R +G LI + Sbjct: 107 ESAGLVSAVGRTLEEFKTSVTVAANSKVTFELTYEELLKRRLGKYKLLINAQPMQPVADF 166 Query: 167 -VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP-------------ADRTV 212 + H+ G+S+ ++ + + D Sbjct: 167 KIDIHIHESAGISL---LEVKGGLNTKDLANAVTTTRAQEDAWVKFYPTRDQQKDCDDCT 223 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH--FVDSSSLRHVIKK 270 K+ + N + E++ + Y++ Y P D ++ F+ S K Sbjct: 224 KNGLNGNLVIMYDVERVKQSGDFKVANGYFVHYFAPTDVQRIPKNVVFIIDQSGSMQGNK 283 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R A+ ++ + K D G F+ + + KTF Sbjct: 284 IEQTRMAMLRILSDLAKDD------YFGLITFSSHIQAWKPELLKATAENVEEAKTFVKQ 337 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T IN A+ A + I + ++LLTDG+ T + I Sbjct: 338 IRSGGATDINGAVLNAVNMI---------NQYTQEGSASILILLTDGDPTSGVTNPVTIQ 388 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K+ + + F N + + L N + +E + + F + + Sbjct: 389 QNVKTAIGGKYPLYCLGFGFNVRFEFLEKMSLENNGAARRIYEDSDADLQLQGFYEEVA 447 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 99/262 (37%), Gaps = 34/262 (12%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + PLN G P S GK + ++P Y + G D ++S Sbjct: 626 TNTSDPLNNTGSPGTAVAYFDSIIYGKSKVEATLVNPI-----DPRYATILKGLKDKTVS 680 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIR---SIKKIDNVN-------DTVRMGATFFND 304 ++ F + + K + +A V+ S+K +D N + + N Sbjct: 681 KDIFTN----PYFSKNSCSLATEVAYVVDYSSSMKAVDPTNYRGKKMIEFINQLKAKNNI 736 Query: 305 RVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + ++ + G + K E G+T I + A + + Sbjct: 737 VIETNTKATILGEGTTDAVLKKDLYKASKEKGATDIFAGIDIALTKFSNDTKTA------ 790 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K IV+++DG+ ++ + + N+AK QG++I T++ K+Q A + Sbjct: 791 -----KAIVVVSDGKTSKS--KMTKVINEAKKQGVKIYTVS-MGKKSQINDATLMQVSTE 842 Query: 424 SPNSFFEANSTHELNKIFRDRI 445 + +++ A +L+++F+ I Sbjct: 843 TGGAYYYALDNLQLHQVFQKLI 864 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 74/216 (34%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTAMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ I ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ + I+LLTDG+ T +I + + + F + + Sbjct: 370 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 66/175 (37%), Gaps = 41/175 (23%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS----NEDEVHRMKNNLEAKKYIVLLT 375 ++ + +GSTA + + + + + + ++ + K +VL+T Sbjct: 471 VKKTLTDAVDGMTAVGSTAGHIGLAWGWYLVSPNFGLWSGLGAPAAYDSSKTLKAVVLMT 530 Query: 376 DGE------------------------------NTQDNEEGIAICNKAKSQGIRIMTIAF 405 DGE N E+ +C K+ + + T+ F Sbjct: 531 DGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCENMKAADVIVYTVGF 590 Query: 406 SVNKTQQ-----EKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + A ++ CA+ P+ F+A+S+ +L+ FRD IG +I I Sbjct: 591 DIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRD-IGRDITRLRI 644 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 84/249 (33%), Gaps = 26/249 (10%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +L G+ +I L +PV++ + VD+ R + + A Sbjct: 12 RLRKLVSRLRDDRRGNVAMIFGLSLPVIVMLALGGVDLHRITTARSQFQDAL-------- 63 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + ++ + K L N + + D +V + D +V + Sbjct: 64 ---DAATLAAARSSETTPAGLKSVALATLHGNIQGTEVEPINDADVEVAMNDKSVVIATA 120 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + +++ +++L P L + I ++E SR V + V+D Sbjct: 121 QGRVKTLVA---NIVLPPYGQLLD----DTLPISARSEVNRSSRD------VEVALVLDI 167 Query: 186 SRSMLDYQRDS-EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN-KSLYYM 243 + SM D G+ ++ A + N L+PY + N Y Sbjct: 168 TGSMNDCADSCSSGRKIDNLKSAAKELIDIVVQTNQSPFYSKVALAPYSMGVNVGDTYAS 227 Query: 244 LYPGPLDPS 252 G LD + Sbjct: 228 RARGSLDSN 236 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 71/239 (29%), Gaps = 40/239 (16%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKI 288 +V + L + +L V +S + + + +IRS+ + Sbjct: 67 GCVVVGAADPHIPLVSASENVNLVVALDVSASMSASDYSPTRVEAAKGSSEILIRSLSES 126 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D G F S S ++++ + + G TA+ D + A D Sbjct: 127 DT------AGVVIFESGASSAAYLSSDKNRVVSRLEQVSVK----TGKTALGDGLALAVD 176 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + +VLL+DG + AK+ G+ + TI Sbjct: 177 MVTAIP-----------AGTYIVVLLSDGVSNSGMITPQEAAEYAKNSGVVVYTIGVGSE 225 Query: 409 KTQQEKARYFLSNCA------------SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + + + +F + L +I N I +IR Sbjct: 226 SPVEVSSDGVQQYASLDEETLRSIAEITGGEYFRSVDEKTLV-----QIQNTIQTSIIR 279 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 70/445 (15%), Positives = 140/445 (31%), Gaps = 88/445 (19%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 +D L+ A A + S+ L ++++ L+R Sbjct: 34 IDSGMLYLSHSRLQAAVDAAALAGSLQL----------------PYDPQLDKGLVRGAVT 77 Query: 101 -NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + N+ + + + T S ++F+ ++GI S + Sbjct: 78 QYMDANYPEASLNGVTPGTEER------------SVTVTATATVPTIFMNALGIGSSEVH 125 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 KA A Y+K + + +VID S SM + + S + Sbjct: 126 AKATA-----GYNK---LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKV 177 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G V R + P V G L+PS E + +S ++ ++ Sbjct: 178 GLVPFRGKVHLPAGVDGLPDG-CRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCT 236 Query: 280 SVIRSIKKIDNVNDTVRMGATFFN-----DRVISDPSFSWGVH--------------KLI 320 S I ++ + +T+ + N + WG H K I Sbjct: 237 S-IPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGSSAKDI 295 Query: 321 RTIVKTFAIDENEMGSTAINDAMQT---AY--DTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R ++ + E G + A+ AY + + H Sbjct: 296 RKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKL--------- 346 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFE 430 N + K K GI + I F + + + + AS + +++ Sbjct: 347 -------NAAMLEEARKVKEAGIEVFAIRFGDSDSVDV---SLMKSIASSKAGTNDHYYD 396 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A S ++++ +F+ +IG ++ R++R Sbjct: 397 APSAYDIDDVFK-KIGRQLGWRLLR 420 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S KK R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGKKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 E R+ + I+LLTDG+ T +I + + + F + + Sbjct: 370 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 66/495 (13%), Positives = 137/495 (27%), Gaps = 88/495 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ + +I AL+M ++ + GM +D + + L AA A I A P + L Sbjct: 8 RFVRDRKANIAVIAALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM--L 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + A+N+ L + T +V Q Sbjct: 66 MQTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDV----------------GLQRT 109 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVIDFSRSML 190 + Y+ L+ + A + + + ++ ++D S SM Sbjct: 110 VKVSYNAA--------SLNNFPQLLMNNVSWAISGASTAQASSAPNMNFYLLMDDSPSMG 161 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 ++ N A + K+ SSQN + ++ + L + Sbjct: 162 IGATATD--ISNLIASTAPKYQKASSSQNCGFACHETNIAH-----DGGTKDNLAIARAN 214 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 V S+ + + + + ++ + N R F+ + + Sbjct: 215 NITLRIDLVTSAVNQLLNTWSNCPQSGVSGGVMQCMSALNNTTY-RAALYTFDLGLNTLA 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGS---------TAINDAMQTAYDTIIS-SNEDEVHR 360 S + V A+ + T + A ++ + Sbjct: 274 SLT--TPTTAGAQVSNIALMPVAYQNCVVPTTNCKTDNGTDIAGALTSLNGIMPSPGLGS 331 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEE----------------GIAICNKAKSQGIRI---M 401 + ++ + L+TDG + + AIC K +GI+I Sbjct: 332 NASGDTPQEVVFLVTDGVEDKISSSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIAILY 391 Query: 402 TI---------------------AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 T A+ + L CASP F + + + Sbjct: 392 TEYLQLKTPNVPVTDTWYMSWIDAYDEPTSSTGAIAKNLQACASPGFFSNVQTGGNITQA 451 Query: 441 FRDRIGNEIFERVIR 455 D Sbjct: 452 LTDLFLKVASSTASL 466 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 56/150 (37%), Gaps = 9/150 (6%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + ++ + F+ ++++ ++ G+T+ + A+ AY + + D Sbjct: 1 LALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDT--D 58 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + ++ + KK I++ TDG + + K +G+ I I + E Sbjct: 59 NENGVREGV--KKMIIIFTDG---HSQRSPQDMALRLKDKGVEIFAITLTPAPYADEG-- 111 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIG 446 LS + + F + + F IG Sbjct: 112 ELLSITQNTDHIFTPVNLKDFEIKFLPYIG 141 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 52/152 (34%), Gaps = 18/152 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 G +D + FV S KK R+A+ V+ ++ + D N ++ V Sbjct: 300 GEVDLTKKTVIFVVDRSGSMQGKKIEQAREAMRYVLNNLHEGDTFN------IVAYDSTV 353 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 S R + GST I+ A+ +A+ + S+ Sbjct: 354 ESFKPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRPN--------- 404 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 YI+ LTDG T I AK + + Sbjct: 405 ---YILFLTDGLPTAGETNEGKIVELAKQKNV 433 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 56/414 (13%), Positives = 128/414 (30%), Gaps = 27/414 (6%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 D + + + A A + + + SL + ++ E + Sbjct: 14 DPTLYLQRGNGIPSATNMAAL---LLVAVSLTACGGKGAEVEHRQAEQQAEQRHQEASQR 70 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS-RYDLLLNPLSLFLRSMGIKSWLIQT 160 + +V ++M+ + + ++ D + PL+ +++ ++ Sbjct: 71 QAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAVIPLAQNKFEQQVQNGIMVA 130 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + + + + + + LN F K+ + + Sbjct: 131 GEIPVSTFSIDVDTGSYATLRRMLREGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSV 190 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 + + M+ L P + + +D S K L++ AL Sbjct: 191 TTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKL 250 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + D V+ V GA V+ D G + + GST Sbjct: 251 LTAQLSAQDKVSIVVYAGAAG----VVLD-----GASGNDTQTLTYALEQLSAGGSTNGG 301 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQG 397 + AY + H + N + ++L TDG+ D ++ IA+ K K G Sbjct: 302 QGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLIALIEKEKDHG 352 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + T+ F + + ++ ++ +E K+ D + + +F Sbjct: 353 IGLTTLGFGLGNYNDQLMEQLADK--GNGNYAYIDTLNEARKVLVDELSSTLFT 404 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 61/195 (31%), Gaps = 40/195 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ + + ++ D +G F D + + + Sbjct: 112 TRFDAAKHAIRTFVEG-REHDP------LGLVIFGDEAALVTPPTLDYTSFLSRMDAVRV 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +A+ M A VH K++ E +K +V+++DGEN + Sbjct: 165 MKL--GRGSALGLGMAVA----------TVHLEKSSAE-RKVMVIVSDGENNAGEITPES 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------QQEKARY------------FLSNCASP--NSF 428 A S GIRI + + E + L Sbjct: 212 AARVAASLGIRIYAVGVGGEGSVATEFTDPETGKSYRGTYEGKIDMELLKAVTESTRGQA 271 Query: 429 FEANSTHELNKIFRD 443 F A S L+++FR+ Sbjct: 272 FLAGSPGALSQVFRE 286 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 72/211 (34%), Gaps = 27/211 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+++ +D+S K L++ AL ++ ++ D R+ + Sbjct: 207 PAVNLVLLIDTSGSMADRAKLPLLKSALRQLVTQMRAQD------RVAIVAYAGSAGLVL 260 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + GST ++ AY E +K+ + Sbjct: 261 PSTPGDRHAQ---ILAAIDGLQASGSTNGGAGLELAY------AEAAKGLVKDGVNR--- 308 Query: 371 IVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 IVL +DG+ + + + + +GI + T+ A + A S Sbjct: 309 IVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQLAN--AGDGS 366 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +S + K+F +E+ ++ I K Sbjct: 367 YHYIDSLLQARKVF----ASELSATLLTIAK 393 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 55/135 (40%), Gaps = 21/135 (15%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 T + A TAI A++ AY + S + + IVL+TDGEN Q Sbjct: 466 TAISAAADGLTLGSGTAIYSALEAAYRYVADSAAAPA----DGVAPLTSIVLMTDGENNQ 521 Query: 382 DN------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 +A+ + A+S +R T+ F + + + A + + F+A Sbjct: 522 GTTADAFHSSYLALPDAARS--VRTFTVVFGDARVDE------MRTIADWTGGAMFDAR- 572 Query: 434 THELNKIFRDRIGNE 448 T L++ FR+ G + Sbjct: 573 TSSLSEAFREIRGYQ 587 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 45/117 (38%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + + + +V D TAI A+ + + + Sbjct: 130 KVGVVVFAGNAYMQLPITTDLA-MAKKMVDDANPDMLSNQGTAIASAIDLSLGSFSDRH- 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K I+L TDGEN + + + KAKSQG+++ TIA + Sbjct: 188 ----------DVGKAIILFTDGENHEGDA--LEAAKKAKSQGVKVYTIAVGSEEGAP 232 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 62/214 (28%), Gaps = 20/214 (9%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 M + P D +L +D S + AL + + ++ Sbjct: 164 MYDKKGYARMSVIPDACDTNLDVVFVLDQSGSIGYYNH----QLALNFLSKVVEFFKIGA 219 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++G ++ + TI+ + G TA + A + Sbjct: 220 NKTQVGLITYSTHAYVQFDL--NDYHSKSTILNRISRIYYTGGWTATALGLFQAGVILNP 277 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + + ++LLTDG + + + + GI++ T+ + Sbjct: 278 QQMRGARPISQGVP--RVVILLTDGRSNRVPID--EVAPSLHDFGIQVYTVGVGNIYLPE 333 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 L AS + L F D G Sbjct: 334 ------LKFIASDPDPYHIF----LLDSFSDASG 357 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 33/88 (37%), Gaps = 6/88 (6%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE----GI 387 G T + +AM A D + ++ + +IVL+TDG+ D+ E Sbjct: 91 FANGGTPMGEAMNMALDLLEKRKG--EYKASGVDYYQPWIVLMTDGKPNGDSSEYARAVQ 148 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 C K++ + I I + A Sbjct: 149 RTCEMIKNRKLTIFPIGIGEDADMNALA 176 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 81/252 (32%), Gaps = 4/252 (1%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE-EVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A + + A+ L ++ V A Sbjct: 2 SILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLSVPEAAGI 61 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + + E + N FTD +A+ +P Y ++S+ ++ Sbjct: 62 TAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAIT-SPSSIKYVKCMTSQTGIVN 120 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + G+ A A + + + + + + + Sbjct: 121 WFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFIC--KAGTQTNPPVAGATYNI 178 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + S+ + N D S ++ + Y P + + + Sbjct: 179 GDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAAT 238 Query: 262 SSLRHVIKKKHL 273 ++ + + Sbjct: 239 TNAYNTRFGIYA 250 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVPPT-----PDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 59/174 (33%), Gaps = 30/174 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFA 328 K L ++A A + ++ D +G F+D+ K+ Sbjct: 422 KMELAKEAAARSVELLRSDDT------LGVIAFDDQPWEILP----TGKVDDPKKAADKI 471 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T I +++ AY + H I+LLTDG+++ N + A Sbjct: 472 LSITPGGGTEIYRSLEQAYTELEDLELQRKH-----------IILLTDGQSSTSN-DYDA 519 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + K I + T++ Q+ R L A F++ + I Sbjct: 520 LIENGKDHNITLSTVSI-----GQDADRNLLEQLAGTGSGRFYDVTDATTIPAI 568 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 56/174 (32%), Gaps = 38/174 (21%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ + + + + + G TAI +A+Q A + S++ Sbjct: 130 RVAFIPYAAQAYPQLPLTSDYSSAKIFLEGINTNMLSSQG-TAIGEAIQMAINYFEESSQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 K +++L+DGE+ Q + + +AK +GIR+ TI + Sbjct: 189 TS-----------KILIILSDGEDHQQGVDTVI--QEAKDKGIRLFTIGLGTAQGATIPV 235 Query: 412 ------------------QEKARYFLSNCASP--NSFFEANSTHELNKIFRDRI 445 + + L A +F +T E+ + + Sbjct: 236 SENGQIVAKRDNNGQVVITKLNQALLEEIAQEGGGKYFNGANTKEVLDALQKAL 289 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 66/196 (33%), Gaps = 31/196 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND-RVISDPSFSWGVHKLIRTIVKTFA 328 K ++A + + + R+ F+ +I + F+ + I Sbjct: 92 KMLNAKEAAKGFVDLMDLTKH-----RVAIVDFSSSNMIGNLPFTTNPTEAKNYIDT--- 143 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 N GSTA DA+ +A + + EA+ IV++TDG+ TQ + + Sbjct: 144 --INANGSTATGDAIDSAIALLANHRP----------EAQPVIVIMTDGDATQPSTDPYG 191 Query: 389 ICNK----AKSQGIRIMTIAFSVNKTQQ--EKARYFLSNCA--SPNSFFEANST--HELN 438 + AK GI TIA + L A S + F ST ++ Sbjct: 192 YAKQKALLAKDNGIIFYTIALLKSTDDPVTSGPNILLKEMATTSDHHHFVLGSTGLSQIY 251 Query: 439 KIFRDRIGNEIFERVI 454 IG V Sbjct: 252 AAIVKEIGMASAYDVT 267 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 57/144 (39%), Gaps = 15/144 (10%) Query: 303 NDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 N + ++ + G + K + G+T I + A + + Sbjct: 735 NIVIETNTKATILGEGTTDAVLKKDLYKASKDKGATDIFAGIDIALTKFSNDTKTA---- 790 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K IV+++DG+ ++ + I N AK QG++I T++ + LS Sbjct: 791 -------KAIVVVSDGKTSKS--KMIKAINDAKKQGVKIYTVSMGKKSQINDATLMQLST 841 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 + +++ A +L+++F+ I Sbjct: 842 -ETGGAYYHALDNLQLHQVFQKLI 864 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 54/173 (31%), Gaps = 35/173 (20%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 ++ + K + R+G F+ ++ L I T +N Sbjct: 107 LKELTKSFPSD---RIGLIIFSSEAFMQCPLTFDQSVLQLYIDGLN-TGLVPNFGTDLNA 162 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 ++ A D + EV K ++L++DGEN D E I + K+ G+++ Sbjct: 163 PLRIALDRFQNDESQEVK--------SKSVILISDGENFGDELENIGS--ELKNLGVKVF 212 Query: 402 TIAFSVNK-----------TQQEKA--------RYFLSNCA--SPNSFFEANS 433 + + + L A + +FE + Sbjct: 213 ALGIGTESGSTIPRGNGIVMDPQTGEPAQTVLDKRPLQQIAAETDGQYFEISD 265 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 72/211 (34%), Gaps = 27/211 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+++ +D+S K L++ AL ++ ++ D R+ + Sbjct: 207 PAVNLVLLIDTSGSMADRAKLPLLKSALRQLVTQMRAQD------RVAIVAYAGSAGLVL 260 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + GST ++ AY E +K+ + Sbjct: 261 PSTPGDRHAQ---ILAAIDGLQASGSTNGGAGLELAY------AEAAKGLVKDGVNR--- 308 Query: 371 IVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 IVL +DG+ + + + + +GI + T+ A + A S Sbjct: 309 IVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQLAN--AGDGS 366 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +S + K+F +E+ ++ I K Sbjct: 367 YHYIDSLLQARKVF----ASELSATLLTIAK 393 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 72/216 (33%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQE 413 R+ I+LLTDG+ T +I N + + + F + + Sbjct: 370 G--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS--- 422 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 423 --YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 60/144 (41%), Gaps = 21/144 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G+ +TI + GST A+ +++ +S+++ KK+I+L+ Sbjct: 690 GLSSADKTINVLDQL-IYSGGSTNHGQAINACQESLFTSDQNINR--------KKFIMLI 740 Query: 375 TDG----ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 TDG ++ + I AKS GI I I ++ A F+S+ ++ + Sbjct: 741 TDGVSATDDLNPEADAIDAAETAKSSGITI--IPIFISPYNDIDAVSFMSSLSNDGEVY- 797 Query: 431 ANSTHELNKIFRDRIGNEIFERVI 454 + L D + +++ E+V Sbjct: 798 VTNFDSL-----DSLKDQLVEQVS 816 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 79/258 (30%), Gaps = 29/258 (11%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI----RDEKLSPYMVSCNKSLYYML 244 M + +E N A + + D + + M N + + Sbjct: 295 MQNLDSVTEFCTANTHNTEAPNLQNKMCNHRSTWDVIMDSEDFQNASSMAGTNPPPHPIF 354 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + S S+ + + + A +I+ I++ V G F Sbjct: 355 SLLRAKQRVVCLVLDKSGSMDSEDRLLRMNQAAALYLIQIIERESLV------GMVTFES 408 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + T + + G T+I ++ + I SN++ Sbjct: 409 TAKIQNNLT-KITDDDTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNTSGSE--- 464 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 IVLLTDGE+ GI C + K G I TIA + K LS+ Sbjct: 465 ------IVLLTDGEDN-----GIHSCFEEVKQSGAIIHTIALGPSA---AKELEILSSMT 510 Query: 424 SPNSFFEANSTHELNKIF 441 F+ + L F Sbjct: 511 GGYRFYANKDINGLIDAF 528 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 66/197 (33%), Gaps = 25/197 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + D +G F L I + Sbjct: 107 RLTAAKEAARRFVDGLP------DEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRL-VE 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ T+ + + + + IVLL+DG NT + + Sbjct: 160 GATGVQGTAIGEAINTSLGAVKALDGEAAK-----DPPPARIVLLSDGANTSG-MDPMEA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELN 438 A + + + TIAF ++ L A + F EA+S EL Sbjct: 214 ATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDEVARETGGQFHEADSAKELR 273 Query: 439 KIFRDRIGNEIFERVIR 455 ++ D IG+ + R R Sbjct: 274 AVY-DDIGSSVGYRTKR 289 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 12 RRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDL--- 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN----FTDREVRDIVRDTAVEMNPRKS 127 +S + + +N + N F+D + +AV Sbjct: 69 TSAISLSVAEADGIAAGHLNFVFFQNKSVQMSTNANVTFSDSLTDPFLTRSAVTTPSSIK 128 Query: 128 AYQVVLSSR 136 Q + Sbjct: 129 YVQCTATLS 137 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 25/197 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + D +G F S L I + A Sbjct: 107 RLTAAKEAGRRFVDGLP------DEFNVGLVAFAGSAAVLVPPSTDREALHDGIGRL-AE 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ T ++ + + K+ + I+ L+DG NT + + Sbjct: 160 GITGVQGTAIGEAIST---SLGAVKSLDATAAKDP-PPARIII-LSDGANTSG-MDPMEA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELN 438 ++A + + + TI+F ++ L A + F EA++T EL Sbjct: 214 ADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEASTTAELK 273 Query: 439 KIFRDRIGNEIFERVIR 455 ++ D IG + R R Sbjct: 274 DVYED-IGTSVGYRTER 289 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 25/197 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + D +G F S L I + A Sbjct: 107 RLTAAKEAGRRFVDGLP------DEFNVGLVAFAGSAAVLVPPSTDREALHDGIGRL-AE 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ T ++ + + K+ + I+ L+DG NT + + Sbjct: 160 GITGVQGTAIGEAIST---SLGAVKSLDATAAKDP-PPARIII-LSDGANTSG-MDPMEA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELN 438 ++A + + + TI+F ++ L A + F EA++T EL Sbjct: 214 ADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEASTTAELK 273 Query: 439 KIFRDRIGNEIFERVIR 455 ++ D IG + R R Sbjct: 274 DVYED-IGTSVGYRTER 289 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 50/143 (34%), Gaps = 19/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A+ + +K+ R+G F + + +++ + + + Sbjct: 111 RLERTKYAIDKLFDGLKQD-------RVGLVVFAGDAVVQLPITSD-YRMAKAFARRISP 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T I A+ A + K + A + IVL+TDGE A Sbjct: 163 SMVSVQGTDIGQALSLATMSFSE---------KGDNPAGRVIVLITDGEGHDSGAIEAAE 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 +A QGIRI TI + Sbjct: 214 --RAAEQGIRIFTIGIGTPEGAP 234 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 59/160 (36%), Gaps = 8/160 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L G I+ AL +PV+LG + VD+ R L+ AA A + + L S Sbjct: 11 KRLQSERGGA-AILFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSLSDS 69 Query: 72 LEEV---SSRAKNSFTFPKQKIEE-YLIRNFENNL-KKNFTDREVRDIVRDTA-VEMNPR 125 S+ + ++ + I++ N D T V Sbjct: 70 GGNPYNWSAAGSAALDIARRNVANGAGIQDALIETGYWNIQDPSEGLRAPGTPGVPAAGD 129 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLR-SMGIKSWLIQTKAEA 164 +A QV ++ L PL LF +GI +Q + A Sbjct: 130 VAAVQVTITISRTLNNGPLRLFFAPVLGIAEQDVQGSSVA 169 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 63/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R +V T Sbjct: 107 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASSQVVVPPT-----TDRAVVSTAI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 156 ANLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 212 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQL 271 Query: 438 NKIFRD 443 + +++ Sbjct: 272 SDVYKS 277 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 63/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + R +V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASSQVVVPPT-----TDRAVVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 ANLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|255550407|ref|XP_002516254.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] gi|223544740|gb|EEF46256.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] Length = 755 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/265 (17%), Positives = 84/265 (31%), Gaps = 42/265 (15%) Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS--- 252 S + + S SS + G+ + S + V Y L+PG Sbjct: 264 SYESEVLTWTDIDFAVSYSVSSSHIFGGVMLQSPSAHDVDQRDMFYLYLFPGDQPNMKVF 323 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 E F+ S K +++A++ + + D+ N FN S Sbjct: 324 RKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNPKDSFN------IIAFNGETYLFSSL 377 Query: 313 -SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 K + V+ ++ G T I+ + A + + ++ I Sbjct: 378 MELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQGSLP-----------VI 426 Query: 372 VLLTDGENTQDNEEGIAICNKAKS----QGIR---IMTIAFSVNKTQQEKARYFLSNCAS 424 L+TDG E+ IC+ K +G I T YFL A+ Sbjct: 427 FLVTDGA----VEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCN-----HYFLRMLAT 477 Query: 425 P-----NSFFEANSTHELNKIFRDR 444 ++ ++ +S +IF R Sbjct: 478 VCRGQYDAAYDVDSVQARMEIFFSR 502 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 67/179 (37%), Gaps = 27/179 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V+D +++ I+ ++ DN +G F ++ ++ IV Sbjct: 109 TRFDVVKDIVSNFIKE-RQNDN------IGLVVFGAYSFIASPLTYD-ENILNKIVSQLY 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 I M Y + +S V+ +K + K +LLTDG +T + ++ Sbjct: 161 I------------GMAGKYTALFTSLAQGVNLLKMSESKSKVGILLTDGFSTPEVDKIPF 208 Query: 389 --ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + A + I+I I + L A + F A S EL +++++ Sbjct: 209 DVALDMAIKEKIKIYPIGIGMPHEYN---IEVLRKIAEKTGGKAFGAASATELKEVYKE 264 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 72/213 (33%), Gaps = 21/213 (9%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K + G I+ + M +L +G + +D + L+ A A ++ +V + + Sbjct: 9 KGRRRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEY 68 Query: 73 EEVSSRAKNSFTFPKQKIEEYLI--RNFENNLKKNFTDREVRDIVR-------DTAVEMN 123 + + +R + TF + + + NF EV + D V + Sbjct: 69 DYLRARQEGLVTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDPFVPVL 128 Query: 124 PRKSAY-QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + Y +V +S D+ LN + + + + + + + + + Sbjct: 129 TPGAQYVRVTVS---DVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMVC-- 183 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 D D+ G PLN + ++ Sbjct: 184 ------AEDLGEDNFGYPLNKMMAMKISSQQNT 210 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 73/205 (35%), Gaps = 31/205 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R A + ++ T+++G F + + S H++++ + Sbjct: 120 TRVDAARQAAIKFVDEME------PTLQLGLVTFAGTAQTLIAPSSD-HEIVKHALDEAI 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 + TA + + TA I + + + K IVL +DG+ T ++ Sbjct: 173 RPDKLAARTATGEGIYTALQQIETLSSILGGKSK---APSARIVLESDGKETVPDDLNAP 229 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFLSNCA--SPNSFFE 430 +AK++ + I +I+F + L A S FF Sbjct: 230 RGAFTAAKEAKAKEVPIYSISFGTTRPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFT 289 Query: 431 ANSTHELNKIFRD---RIGNEIFER 452 A S +L+ ++ IG ++ ++ Sbjct: 290 AGSLDQLSDVYSSLNAEIGYDLVKQ 314 >gi|292618048|ref|XP_699485.3| PREDICTED: integrin alpha-1-like [Danio rerio] Length = 1201 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 60/166 (36%), Gaps = 19/166 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + D V + S + I + T + TA Sbjct: 202 RVGIVSYGDDVGHVFNLS-QFSNTKELVKNAADIRQRTGHKTMTALGIDTARKEAF---- 256 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + KK +V++TDGE + D+ ++ ++ + GI +A + +Q K+ Sbjct: 257 TVERGARPGV--KKVMVIVTDGE-SHDHHNLKSVIDQCQEDGIERFAVAVLGDYNRQNKS 313 Query: 416 RYFLS-------NCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 + AS + FF + L I D +G++IF Sbjct: 314 IDEIKKFIEEIEYIASETKSDHFFNVSDERALVTI-VDTLGSKIFA 358 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 56/180 (31%), Gaps = 24/180 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + K Sbjct: 137 KIQNVKKTLEYLLELLGDND------RLCLILFNSYSTRLCHLM-KTNNSNKPAFKEIIN 189 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + + + I LL+DG++ + Sbjct: 190 KIQATGGTDINSGMELAFRVL---------KERKYQNPVSSIFLLSDGQDGSADLRVRQS 240 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 241 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 295 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 61/191 (31%), Gaps = 33/191 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ V++ + D R+ F+ H + ++ Sbjct: 289 TKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQRISHHGRQQALQAI- 341 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I ED ++ I+LL+DG++T + + Sbjct: 342 NSLGASGGTNIADALKKAMKVI----EDRSYKNSVCS-----IILLSDGQDTYNISSSVQ 392 Query: 389 -------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 I N A+ + + F + + AS +F Sbjct: 393 GASPDYKSLVPSSIINDARHT-VPLHAFGFGADH-DSDSLHSIAQ--ASGGTFSFIEDEG 448 Query: 436 ELNKIFRDRIG 446 + F IG Sbjct: 449 VMQDAFAQCIG 459 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 832 VKKADIGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 889 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 890 HS-NILF----TEEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 943 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + ++ +L I+ Sbjct: 944 V-----GNANQDELETMAGKKENTVHVDNFDKLRDIY 975 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 73/216 (33%), Gaps = 20/216 (9%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 VS + G +D ++ +F+ S K+ ++ ++SVI N Sbjct: 412 VSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRKKEFEQIQVFMSSVIDMFPIGPNK-- 469 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR+G ++ + + S + + K + G T A+ I Sbjct: 470 -VRVGVVQYSHKNEVEFPVSRYTDGI--DLKKAVFNIKQLKGLTFTGKALDFILPLIKKG 526 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + R Y+++LTDG + N+ + N+ +++ I I I Q Sbjct: 527 KTERTDRAPC------YLIVLTDG---KSNDSVLEPANRLRAEQITIHAIGIGEANKTQ- 576 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L A + + K ++ I + I Sbjct: 577 -----LRQIAGKDERVNFGQNFDSLKSIKNEIVHRI 607 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 24/149 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 GA ++D VI S + + L + I + T + A++ + ++ Sbjct: 118 GALHYSDEVIMISSLTRDMKTLRDDVETVEYIGK----GTHTDCAIKRGIEEVLIGG--- 170 Query: 358 VHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ + KY++++TDG E ++ G+ N+AK GI++ ++A S N + Sbjct: 171 -----SHQKENKYLIVVTDGHPLEGYKEPCGGLEDAANEAKHLGIKVFSVAISPNHLEPR 225 Query: 414 KARYFLSNCASPNSF---FEANSTHELNK 439 LS AS S F A S L Sbjct: 226 -----LSVIASDASHRRNFTATSAAGLTD 249 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 81/230 (35%), Gaps = 29/230 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C + Y GP D +L + + +H K V+ +++ K + + Sbjct: 815 CPDYSCPITYEGPADITLLVDSSTRVGN-QHFQTSKSFVKLLAQRFLKA--KAPP-SGSA 870 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAINDAMQTAYDTIISS 353 R+ ++ + + V +D + G+T + A++ A + Sbjct: 871 RVSVVQYSGLNQQKVEAQF----VSNYTVLEVPVDNMQFINGATNVVSALR-AVTELY-- 923 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ED + + KK +V +DG NTQD ++ + A++ GI I +A ++ Sbjct: 924 REDSLAGVS-----KKLLVF-SDG-NTQDEKDMLKAVQDARAAGIEIYVLAVG-SRLNYP 975 Query: 414 KARYFLSNCASP-------NSFFEANSTHELNKIFR-DRIGNEIFERVIR 455 + L+ A+ F L + R I I + + Sbjct: 976 NLQVMLTGSAADITAPFPEERLFRVPDYPSLLQGVRYQSISRRISLKSSQ 1025 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 620 VKKADIGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 677 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 678 HS-NILF----TEEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 731 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + ++ +L I+ Sbjct: 732 V-----GNANQDELETMAGKKENTVHVDNFDKLRDIY 763 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 73/216 (33%), Gaps = 20/216 (9%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 VS + G +D ++ +F+ S K+ ++ ++SVI N Sbjct: 200 VSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRKKEFEQIQVFMSSVIDMFPIGPNK-- 257 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR+G ++ + + S + + K + G T A+ I Sbjct: 258 -VRVGVVQYSHKNEVEFPVSRYTDGI--DLKKAVFNIKQLKGLTFTGKALDFILPLIKKG 314 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + R Y+++LTDG + N+ + N+ +++ I I I Q Sbjct: 315 KTERTDRAPC------YLIVLTDG---KSNDSVLEPANRLRAEQITIHAIGIGEANKTQ- 364 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L A + + K ++ I + I Sbjct: 365 -----LRQIAGKDERVNFGQNFDSLKSIKNEIVHRI 395 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 41/117 (35%), Gaps = 12/117 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ ++ + G T + A++ + + +++ Sbjct: 134 RVGLVVFAGDAFIQSPLTSDYS-AVKLFLRAVDPEVMPQGGTNVGAALRLSRQVLENADR 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ +VLLTDGE+ + K G++++ + + Sbjct: 193 GSK---------ERVVVLLTDGEDLVGDVA--EATEALKDSGVQVLAVGVGSESGEP 238 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 61/191 (31%), Gaps = 33/191 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ V++ + D R+ F+ H + ++ Sbjct: 289 TKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQRISHHGRQQALQAI- 341 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I ED ++ I+LL+DG++T + + Sbjct: 342 NSLGASGGTNIADALKKAMKVI----EDRSYKNSVCS-----IILLSDGQDTYNISSSVQ 392 Query: 389 -------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 I N A+ + + F + + AS +F Sbjct: 393 GASPDYKSLVPSSIINDARHT-VPLHAFGFGADH-DSDSLHSIAQ--ASGGTFSFIEDEG 448 Query: 436 ELNKIFRDRIG 446 + F IG Sbjct: 449 VMQDAFAQCIG 459 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 12/165 (7%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ F+ SS + V+ LA VI + + R+G Sbjct: 43 AAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVKVFLAKVIDGL-SVGPDA--TRVGVVN 99 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + RV ++ S HK +VK + E T A+Q A + S E Sbjct: 100 YASRVKNEVSLK--SHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAE----GG 153 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +++TDG + + I +A+ GI I I Sbjct: 154 RKSPDISKVAIIVTDG---RPQDNIRDIAARAREAGIEIFAIGVG 195 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 69/184 (37%), Gaps = 15/184 (8%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF--FNDRVISDPSFSWGVHKL 319 + + K ++ +D + IK V TV G+ N + GV+ Sbjct: 50 AKRIDGVSKYNMAKDEIVRFADQIKSKSQVRMTV-FGSEGNNKNSGKVQSCESIRGVYGF 108 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGE 378 R ++F N +G T I + ED + K + LLTDGE Sbjct: 109 QRFDKQSFLNSLNGIGPTGWTP--------IAKALEDAKASFNGVHKLGSKSVYLLTDGE 160 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + I + + Q I++ I F N+ + A ++EA+S ++N Sbjct: 161 ETCGG-DPIKTAKELRKQHIKVNVIGFDFNEGFNGQLHAIAG--AGGGKYYEAHSQKDMN 217 Query: 439 KIFR 442 +IF+ Sbjct: 218 RIFK 221 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 80/267 (29%), Gaps = 33/267 (12%) Query: 184 DFSRSMLDY--QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 FS M DY + S + + DE S V + Sbjct: 477 QFSGYMTDYVSMQRSAVNISSLDASAFSEITAYVQIETPVDYSIDELKSHITVEDCGAQI 536 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 ++ S + S + R A+ S+ S+ + R+G Sbjct: 537 SEYNLEKVEYSSANMLLCCDVSGSMQGRPIEDSRAAVISMAESM------SGNARLGVIL 590 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 FN V F+ + ++++ A G T I D + ++ Sbjct: 591 FNSSVQGLTDFT-----VQPDVIRSTAESMTANGGTNIFDTVVHGLESF----------P 635 Query: 362 KNNLEAKKYIVLLTDG-ENTQDNEEGIAIC--NKAKSQGIRIMTIAFSVNKTQQEKARYF 418 KN E +V+++DG EN + E I AK + I + + + Sbjct: 636 KNGPEVLNTLVVMSDGQENNAHSAEEIQTAIGQAAKDKSILVHCLGLGSEVDA-----NY 690 Query: 419 LSNCASP--NSFFEANSTHELNKIFRD 443 L A ++ + L +++ Sbjct: 691 LQTIAQSAGGTYQYVTDSSSLAVFYQN 717 >gi|194220813|ref|XP_001500011.2| PREDICTED: vitrin [Equus caballus] Length = 662 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 76/234 (32%), Gaps = 21/234 (8%) Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + S N + K +V + S +D SS + Sbjct: 437 TNGFYSLNVQSWFSLHKTVQPLVKRVCDTERLACSKTCFNSADIGFVIDGSSSVGTSNFR 496 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +++ +A++ + + D R+GA + F + + I+ Sbjct: 497 TVLQF-VANLSKEFEISDTD---TRIGAVQYT--YEQRLEFGFDDYNTKPDILNAIKRVG 550 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 551 YWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIPAM 598 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + Q++ ++ A +SFF + L K I Sbjct: 599 VAHHKGVITYAIGVA--WAAQDELEVIATHPAKDHSFF-VDEFDNLYKSVPKII 649 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 58/175 (33%), Gaps = 7/175 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 11 TRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 71 SLEEVSSRAKNSFT-----FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 ++ + A QK + N + T+ + T + Sbjct: 71 AISLSVAEADGIAAGHLNFVFFQKTSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYV 130 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + + + + + L++ + + + A A V + Sbjct: 131 QCTATLSNIAHWFIEV--LNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVC 183 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 73/201 (36%), Gaps = 33/201 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S D + + +G F S + R K Sbjct: 109 TRLAAAQEAAKSF------ADGLTPGINLGLVAFAGTASVLVSPT-----TNRDATKVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ + ++ S + ++ IVLL+DG EN+ D Sbjct: 158 DNLKLSERTATGEAI---FTSLQSIDTLSAVLGGSDQAPPARIVLLSDGKQTVPENSDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F + E + L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTAS 274 Query: 433 STHELNKIF---RDRIGNEIF 450 S EL +++ ++IG E Sbjct: 275 SLEELRQVYDTLEEQIGFETT 295 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 35/75 (46%), Gaps = 2/75 (2%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-ARYFLSNCAS-PNSFFEANSTHELNKI 440 N +C+ K+ GI I I ++ + + + L NCAS P ++++A + + + Sbjct: 497 NSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQA 556 Query: 441 FRDRIGNEIFERVIR 455 F D G R+ + Sbjct: 557 FSDLGGQLSALRISQ 571 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 12/165 (7%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ F+ SS + V+ LA VI + + R+G Sbjct: 25 AAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVKVFLAKVIDGL-SVGPDA--TRVGVVN 81 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + RV ++ S HK +VK + E T A+Q A + S E Sbjct: 82 YASRVKNEVSLK--SHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAE----GG 135 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +++TDG + + I +A+ GI I I Sbjct: 136 RKSPDISKVAIIVTDG---RPQDNIRDIAARAREAGIEIFAIGVG 177 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HS-NVLF----TEEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + ++ +L I+ Sbjct: 985 V-----GNANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 78/239 (32%), Gaps = 20/239 (8%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + S +G + VS + G +D ++ +F+ S K+ Sbjct: 430 SYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRKKE 489 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++SVI N VR+G ++ + + S + + K Sbjct: 490 FEQIQIFMSSVIDMFPIGPNK---VRVGVVQYSHKNEVEFPVSRYTDGI--DLKKAVFNI 544 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T A+ I + R Y+++LTDG + N+ + Sbjct: 545 KQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------YLIVLTDG---KSNDSVLEPA 595 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N+ +++ I I I Q L A + + K ++ I + I Sbjct: 596 NRLRAEQITIHAIGIGEANKTQ------LRQIAGKDERVNFGQNFDSLKSIKNEIVHRI 648 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HS-NVLF----TEEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + ++ +L I+ Sbjct: 985 V-----GNANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 78/239 (32%), Gaps = 20/239 (8%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + S +G + VS + G +D ++ +F+ S K+ Sbjct: 430 SYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRKKE 489 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++SVI N VR+G ++ + + S + + K Sbjct: 490 FEQIQIFMSSVIDMFPIGPNK---VRVGVVQYSHKNEVEFPVSRYTDGI--DLKKAVFNI 544 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T A+ I + R Y+++LTDG + N+ + Sbjct: 545 KQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------YLIVLTDG---KSNDSVLEPA 595 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N+ +++ I I I Q L A + + K ++ I + I Sbjct: 596 NRLRAEQITIHAIGIGEANKTQ------LRQIAGKDERVNFGQNFDSLKSIKNEIVHRI 648 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 50/483 (10%), Positives = 118/483 (24%), Gaps = 105/483 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + +G ++ + + +L + G VD + L A A++ A Sbjct: 17 RVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVSAAGMK 76 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + A + F + R ++ Sbjct: 77 MSQADAEAAMAKLFTDAA--------------AALPNVSASPRAATAPTTDGVRTASLTY 122 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + R + +R G + A T + + ++D S SM Sbjct: 123 SATIR--------TGIMRLAGFSTV-----AFGGTATAASPNPIFTDFYLLLDNSPSMGV 169 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ + D Sbjct: 170 AATTADIATMVANTSD-----------------------------------QCAFACHDM 194 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND-----RV 306 S + + V + +VRDA ++ + RM F + Sbjct: 195 SAGGNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAKAIAAGQYRMAIYSFGTSCSGIGL 254 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDE-----NEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + + + N T + + + Sbjct: 255 NQVSALTANLSTSKTDAGALDLMTVPYQNYNNDQCTDFDGIFARLNSAVPNPGSGA---- 310 Query: 362 KNNLEAKKYIVLLTDG--ENTQDNEE-------------GIAICNKAKSQGIRI---MTI 403 + +K + ++DG + + +A C K +GIR+ T Sbjct: 311 -SAASPQKVVFFVSDGVADANYPSTCTKPTTNGRCQEPITLANCQALKDRGIRVAVLYTT 369 Query: 404 AFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + ++ CASP+ ++ + + + + ++ + Sbjct: 370 YLPLPTNGWYNTWIAPFSSQIATNMAACASPDLYWPVSPSEGIADAMKGLF-KKVVDSQR 428 Query: 455 RIT 457 RIT Sbjct: 429 RIT 431 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 50/154 (32%), Gaps = 38/154 (24%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--------------- 380 T +++ + + + K +VLLTDGEN Sbjct: 372 GTNVSEGLSWGMRVLSPAPPYTDGAPWKTPNTSKIVVLLTDGENVVYGASAEPEKSDYTS 431 Query: 381 -------------------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + +C+K K+Q ++I TI ++ R Sbjct: 432 YGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQQVQIYTITL---QSDTAANRTLYGK 488 Query: 422 CAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 CA+ P ++ N +L +F+ G +++ Sbjct: 489 CATNPADYYAVNDPSKLPNVFQTIAGKFTTLQLV 522 >gi|189518186|ref|XP_001331201.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Danio rerio] Length = 969 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 51/221 (23%), Positives = 79/221 (35%), Gaps = 27/221 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L FV +S + K + AL ++I ++ DN N Sbjct: 298 DGHFVHYFAPRDLPVVPKNVVFVIDTSASMLGTKMKQTKQALFTIINELRPNDNFN---- 353 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTAINDAMQT--AYD 348 F++R+ W KL+ K F + G T IN +QT A Sbjct: 354 --FVTFSNRIRV-----WQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALL 406 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + S++DE H +L I+ LTDG T + I + K+ + TI Sbjct: 407 SDYLSSKDESHHHSVSL-----IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGM 461 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + R L NC + E + K F D IG Sbjct: 462 GDDVDYRLLERMSLDNCGTMRRIPEDADASLMLKGFYDEIG 502 >gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 [Sus scrofa] Length = 889 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 52/317 (16%), Positives = 96/317 (30%), Gaps = 39/317 (12%) Query: 147 FLRSMGIKSWLIQTK------AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 GI + + + K+ VS + +D RS Sbjct: 188 IFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------C 236 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 C N + + ++ Y V + P L FV Sbjct: 237 PTCTDSLLKGDFIITYDVNRESPANVQIVNGYFV-------HFFAPQGLPVVPKNVVFVI 289 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSFSWGVHKL 319 S +K RDAL ++ IK+ D +N + G + D ++ + Sbjct: 290 DVSGSMYGRKMEQTRDALLKILDDIKEDDYLNFVLFSGDVTTWKDSLVQATP-----ENI 344 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + F + + G T IND + T + + E H K + I++LTDG+ Sbjct: 345 QKA--REFVRNIRDQGMTNINDGLLTGISMLNKARE--EH--KVPERSTSIIIMLTDGDA 398 Query: 380 TQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + I ++ + + F N L N +E + + Sbjct: 399 NMGVSKPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANL 458 Query: 437 LNKIFRDRIGNEIFERV 453 + F + + N + V Sbjct: 459 QLQGFYEEVANPLLTSV 475 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 42/116 (36%), Gaps = 17/116 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + R ++ + G T+I +Q A+ +I + Sbjct: 346 GMVTFDSTAYVQSELTQLNSGADRDLL-IKRLPTVASGGTSICSGLQAAFTSIKKKYPTD 404 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ 412 IVLLTDGE+ I+ C + K+ G I T+A + ++ Sbjct: 405 GAE----------IVLLTDGEDN-----TISSCFDLVKNSGAIIHTVALGPSAAKE 445 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 63/186 (33%), Gaps = 26/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + NV+ F + + R V T Sbjct: 109 SRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVVVAPT-----TDRAAVSTAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG T + Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGA-TNVGRPSLE 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPNSFFEANSTHEL 437 +A Q + + TIA+ + Y L+ AS F A S +L Sbjct: 214 AAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQL 273 Query: 438 NKIFRD 443 + +++ Sbjct: 274 SDVYKS 279 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 72/416 (17%), Positives = 139/416 (33%), Gaps = 80/416 (19%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L+++ G I++ + VML G+ +D+VR ++ A++ A+ Sbjct: 23 RLLRNEDGALIILSLQVFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAAS------ 76 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 P +++YL + + D VE + +V Sbjct: 77 -------LTQKRDPTLVVKDYLTKAGLGYI--------ASDSSFTPKVEGSIALGWRRVS 121 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + D+ ++F +G+ S A + + V I V+D S SM +Y Sbjct: 122 VEVDDDMP----TIFGPLLGVSSL------AATGDTTAMQAVGNVEISLVLDLSGSMTEY 171 Query: 193 QRDSEGQPLNC--------FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY--- 241 +D+ NC + Q A ++ + + G+ + S +V + ++Y Sbjct: 172 VKDNPSCTKNCTSSKTRFQYLQVAAKSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGS 231 Query: 242 -----YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI-----KKIDNV 291 Y L S + + + +V D + R++ K D++ Sbjct: 232 EMQEGYTLSSDFSVTGSSFAMPQCADFVANDYN--TMVIDGTGPLTRTMYGSSYKYSDSL 289 Query: 292 NDTVRMGATFFNDRVI----------SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + V G+T N S L F G T+I+ Sbjct: 290 SALVSDGSTSNNPGQDWHNCMNTPQNRVIPLSSDPTFLAAD-KTGFIDKLTAGGWTSIDV 348 Query: 342 AMQTAYDTIISSNEDEVHRMKN------NLEAK---------KYIVLLTDGENTQD 382 + + S DEV +M + + + K +VL+TDG NT + Sbjct: 349 GAKWGLALLDPSARDEVAKMTSVSSAFRETKPRPINYDGDTMKVLVLMTDGANTTN 404 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 31/75 (41%), Gaps = 7/75 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + +C+ AKS+GI I ++A + L C+S S++ L+ F Sbjct: 579 DARTKKLCDLAKSKGIYIFSVA----ADAPSGGKTLLKYCSSGTSYYYEVQGSNLSTAFA 634 Query: 443 DRIGNEIFERVIRIT 457 I I +R+T Sbjct: 635 S-IAASISS--LRLT 646 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 1/79 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G II A++MP+M+ +D+ +Y + L++ A A I + + + Sbjct: 11 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 70 Query: 73 E-EVSSRAKNSFTFPKQKI 90 + KN +I Sbjct: 71 SLATGNAVKNGLQTSSTQI 89 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 73/201 (36%), Gaps = 33/201 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S D + + +G F S + R K Sbjct: 109 TRLAAAQEAAKSF------ADGLTPGINLGLVAFAGTASVLVSPT-----TNRDATKVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ + ++ S + ++ IVLL+DG EN+ D Sbjct: 158 DNLKLSERTATGEAI---FTSLQSIDTLSAVLGGSDQAPPARIVLLSDGKQTVPENSDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F + E + L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTAS 274 Query: 433 STHELNKIF---RDRIGNEIF 450 S EL +++ ++IG E Sbjct: 275 SLEELRQVYDTLEEQIGFETT 295 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/119 (21%), Positives = 45/119 (37%), Gaps = 8/119 (6%) Query: 296 RMGATFFNDRVISDPSFSWGVH--KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F + + KL IV T AI T + A++ + + Sbjct: 129 RVGLVVFAGQSFVQCPITSDKSALKLFMDIVSTDAIPTQ---GTNFSSAIRESIRALERI 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 E KN + K ++ +DGE+ + + + +A S+ IRI T+ + Sbjct: 186 EEGAEAEEKNRVRNKVILIF-SDGEDHEAGID--EVLEEAASKNIRIYTVGVGSAEPTP 241 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 56/174 (32%), Gaps = 5/174 (2%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 11 TRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDL-T 69 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD----IVRDTAVEMNPRK 126 S +S + + ++ + + N T + N + Sbjct: 70 SAISLSVAEADGIAAGHLNFVFFQKKSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKY 129 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 LS+ + L++ + + + A A V + Sbjct: 130 VQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVC 183 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 35/75 (46%), Gaps = 2/75 (2%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-ARYFLSNCAS-PNSFFEANSTHELNKI 440 N +C+ K+ GI I I ++ + + + L NCAS P ++++A + + + Sbjct: 494 NSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQA 553 Query: 441 FRDRIGNEIFERVIR 455 F D G R+ + Sbjct: 554 FSDLGGQLSALRISQ 568 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 1/79 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G II A++MP+M+ +D+ +Y + L++ A A I + + + Sbjct: 17 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 76 Query: 73 E-EVSSRAKNSFTFPKQKI 90 + KN +I Sbjct: 77 SLATGNAVKNGLQTSSTQI 95 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 69/208 (33%), Gaps = 26/208 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +S + GPLD FV SS + +R L ++R + N R+ Sbjct: 21 QSAGPRCHTGPLDL-----VFVIDSSRSVRPFEFETMRQFLVGLVRGLNVGPNA---TRV 72 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G ++ +V S +V+ T A+Q A + S E Sbjct: 73 GVIQYSSQVQSVFPLR--AFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGA 130 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + V++TDG + + + +A+++GI I + Q Sbjct: 131 R---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGV------QRADVG 178 Query: 418 FLSNCASP---NSFFEANSTHELNKIFR 442 L ASP F S +L + F Sbjct: 179 SLRAMASPPLEEHVFLVESF-DLIQEFG 205 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 402 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 456 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 457 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 510 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 511 RAKEEGIVMYAVGVGKAVEAE------LREIAS 537 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 43/396 (10%), Positives = 116/396 (29%), Gaps = 52/396 (13%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 + QA + ++Q ++ ++ + ++ + + + N Sbjct: 332 MVQALDVGPAGPLLGIVQYGDDPTTEFNLKTHTNSRDLKAAIEKITQKGGLSNVGRALSF 391 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 + R A V + + G + ++ + Sbjct: 392 VNKNFFSDANGNRGGAANVAV--------------VMVDGWPTDRVEESSR-------LA 430 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 +E G++I ++ + + Q E ++ + + L+ Sbjct: 431 RESGINIFFITIEGAAESEKQNVVEPNFVDKAVCRRNGFYSLNVPSWFGLQKVARPLAKR 490 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + + S +D SS + +++ +A++ + + D Sbjct: 491 VCDT----HRLACSKTCLNSADVGFVIDGSSSVGTGNFRTVLQF-VANISKEFEVSDTD- 544 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+GA + F + H+ ++ G T+ A++ A + + Sbjct: 545 --TRVGAVQYT--YEQRLEFGFDQHRTKSDLLSAIKRVNYWSGGTSTGAAIRYALERLFE 600 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ +K ++++TDG + ++ A +G+ I + Sbjct: 601 KSKPNK---------RKLMIVITDG---RSYDDVRIPALAAHRKGVITYAIGIT------ 642 Query: 413 EKARYFLSNCASP---NSFFEANSTHELNKIFRDRI 445 A+ L AS + F + L I Sbjct: 643 WAAQDELEVMASDPDKDHAFFVDEFDNLYTFVPQII 678 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 69/208 (33%), Gaps = 26/208 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +S + GPLD FV SS + +R L ++R + N R+ Sbjct: 21 QSAGPRCHTGPLDL-----VFVIDSSRSVRPFEFETMRQFLVGLVRGLNVGPNA---TRV 72 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G ++ +V S +V+ T A+Q A + S E Sbjct: 73 GVIQYSSQVQSVFPLR--AFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGA 130 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + V++TDG + + + +A+++GI I + Q Sbjct: 131 R---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGV------QRADVG 178 Query: 418 FLSNCASP---NSFFEANSTHELNKIFR 442 L ASP F S +L + F Sbjct: 179 SLRAMASPPLEEHVFLVESF-DLIQEFG 205 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 48/135 (35%), Gaps = 11/135 (8%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 361 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 415 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 416 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 469 Query: 392 KAKSQGIRIMTIAFS 406 +AK +GI + + Sbjct: 470 RAKEEGIVMYAVGVG 484 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/147 (21%), Positives = 55/147 (37%), Gaps = 22/147 (14%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G FND D G + R + G TAI A++ AYD + + Sbjct: 470 LGVLTFNDASNWDIPL--GRVRESRPELHDAIGRIKASGPTAIFPALRNAYDALANVRVR 527 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 H ++LL+DG++ ++ EG+ K + I + T+A + Sbjct: 528 AKH-----------VILLSDGQSDPEDFEGLV--RKMSAAHITVSTVAL-----GPDADA 569 Query: 417 YFLSNCAS--PNSFFEANSTHELNKIF 441 L N AS + ++ +IF Sbjct: 570 ALLRNLASWGGGRSYVVQDAQQIPEIF 596 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 50/381 (13%), Positives = 105/381 (27%), Gaps = 98/381 (25%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + S L+ + + + + + + N + +V A + Sbjct: 290 SPSRVLVAEGTPGEASGLVAALKAGKLVVDTVDSNDIPKDISTLAKYDAVVLVNVPANSL 349 Query: 123 NPRKSAYQVVLSS-RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 QV + L+ G + ++ ++ R+ +E V++ Sbjct: 350 QDAGKTLQVYVHDLGKGLVAIGGDRAFALGGYFNTPLEQTLPVDSQIRNPDEEPQVAVVM 409 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ID S SM C S Sbjct: 410 AIDKSGSMAACH------------------------------------------CEGSKL 427 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 YPG + K + +++ I S + + + G Sbjct: 428 LEQYPGGIP-------------------KVDIAKESA---ILSSETLGPNDIF---GVVA 462 Query: 302 FNDR---VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ V+ + ++ + G T I + A D++I Sbjct: 463 FDTAPRWVVRPEPVT------DKSSIAEKVAGIQGSGGTNIYGGLAEAIDSLIKVKAKNK 516 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 H ++LLTDG + N + + +KA+ GI I T+ + + Sbjct: 517 H-----------VILLTDGWSNVGNYD--ELISKARRHGITISTV------SAAGGSAQL 557 Query: 419 LSNCASP--NSFFEANSTHEL 437 L + A +F+ + ++ Sbjct: 558 LRSIAEKGGGTFYNTRDSADI 578 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 54/376 (14%), Positives = 117/376 (31%), Gaps = 55/376 (14%) Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + +S+ ++ + I ++ LK+ DT E N S + Sbjct: 222 DGTSQIQDLKDLQEDNIISLSLKAKNTGLKE---IEAEIYSPEDTYKENNMASSYIYIQG 278 Query: 134 SSRYDLL----LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ------WVI 183 + L NP F +S+ I+ W + + + + + GV + + Sbjct: 279 KPKILYLAGKDFNP--TFAKSLVIQGWNVVIDFKPYSQISNLNSYQGVIMDNIPQEDLPL 336 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D + ++ D G L G + + + ++ K + N ++ + Sbjct: 337 DKMELLKNFVIDKGGTLLILGGDKSFSAGNYHGTPLEEILPLTLKPEQILKKSNVAIIIV 396 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L S S K L +++ V+ ++ D G F+ Sbjct: 397 LDASGSMGSYSGGDM-----------KMELAKESAQLVLDLLEDKD------YFGLIAFD 439 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 +K + + G TA+ +++A ++++ H Sbjct: 440 HSYQWIVPLQPLTNKEEA---ASLISRISPGGGTALYPPLKSAGESLLKVPIKSKH---- 492 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+ +TDG+ + + N AK + I + TI + L + A Sbjct: 493 -------IIAITDGQTEGGDFYNLVR-NLAKYK-ITVSTIGIGEDANIP-----LLKDIA 538 Query: 424 S--PNSFFEANSTHEL 437 + F+ + L Sbjct: 539 NWGNGRFYHTWNIRNL 554 >gi|123228966|emb|CAI21016.2| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H5 (ITIH5) [Danio rerio] Length = 906 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 51/221 (23%), Positives = 79/221 (35%), Gaps = 27/221 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L FV +S + K + AL ++I ++ DN N Sbjct: 235 DGHFVHYFAPRDLPVVPKNVVFVIDTSASMLGTKMKQTKQALFTIINELRPNDNFN---- 290 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTAINDAMQT--AYD 348 F++R+ W KL+ K F + G T IN +QT A Sbjct: 291 --FVTFSNRIRV-----WQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALL 343 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + S++DE H +L I+ LTDG T + I + K+ + TI Sbjct: 344 SDYLSSKDESHHHSVSL-----IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGM 398 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + R L NC + E + K F D IG Sbjct: 399 GDDVDYRLLERMSLDNCGTMRRIPEDADASLMLKGFYDEIG 439 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 83/235 (35%), Gaps = 35/235 (14%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V C + G + FV +S + + S+ + D Sbjct: 26 VICTVNQANAASLGTASIEGYDAVFVLDTSYSMRDTDPEGIAAEVISMFMDLSDAD---- 81 Query: 294 TVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+G +N V++ + GV I + + N G T + ++ + + + Sbjct: 82 RTRVGFVAYNHHVVASKPLTSIGVAAQKSQIQQEIRM-LNRSGYTDLGLGLRKGSELLAA 140 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGEN-----------TQDNEEGIAICNKAKSQGIRIM 401 + + +++LL+DGE N + ++ A+++G + Sbjct: 141 GA---------SQGRQPFMILLSDGETDFGVSSGSRSKGDSNNDVSSVIKSAQTKGYPVY 191 Query: 402 TIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHE----LNKIFRDRIGNEIF 450 TI + + T R L AS + F +S + LN+IF +I +++ Sbjct: 192 TIGLNHDGTVN---RQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLV 243 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 56/411 (13%), Positives = 110/411 (26%), Gaps = 35/411 (8%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 I +A LM + + G+ + W L + A S VS+ Sbjct: 12 ISSASLM--TIAIAGIASQTIDWQDLGKKLTDTTEAAGRMTSSGKPDGDASVSTAELPKQ 69 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA----VEMNPRKSAYQVVLSSRYDL 139 + E + + D A + + + + + Sbjct: 70 EETHTVVAEIARPVATPQPAPAPALPQKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLEP 129 Query: 140 LLNPLSLFLRSMGIKSWLIQ--TKAEAETVSRSYHKEHGVSIQWVIDFS----RSMLDYQ 193 P S + A+ VS S +V S + + Sbjct: 130 PAMPAVQLEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYV--RSTLSGGRLPNPD 187 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + +N F K + V + D + + + P PS Sbjct: 188 AVRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQ 247 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D+S K L++ + ++ S++ D V G++ Sbjct: 248 NLVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSG---------VLL 298 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +T + GSTA ++ ++ AY + D I+L Sbjct: 299 EPTKVADKTRILEKINALTSGGSTAGHEGLKGAYALAETMTGDGEQTR---------IIL 349 Query: 374 LTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 TDG+ D + + + G + + F E + N Sbjct: 350 ATDGDFNVGLSDPDSLKRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQN 400 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 82/256 (32%), Gaps = 26/256 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL-SPYMVSCNKSLYYMLYP 246 S+ + + N + +Y S + D+ + M N + Sbjct: 243 SLDSVTKFCTAKTHNTEAPNLQNKMCNYRSTWDVIMDSDDFQNASPMPGTNLPPHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F+ Sbjct: 303 LKSKQRVVCLVLDKSGSMDSDDRLLRMNQAAELYLIQIIEKESLV------GMVTFDSSA 356 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + + + G T+I + ++ + I+ SN+ Sbjct: 357 EIQNNLT-KITDDNAYQNIIAKLPQFAGGGTSICNGLKAGFQAIVYSNQSTSGSE----- 410 Query: 367 AKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+LLTDGE+ Q + C +AK+ G I +IA + K LSN Sbjct: 411 ----IILLTDGEDNQMSS-----CFEEAKASGAIIHSIALGPSA---AKELETLSNMTGG 458 Query: 426 NSFFEANSTHELNKIF 441 F + L F Sbjct: 459 LRFSANKDINGLIDAF 474 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 73/203 (35%), Gaps = 23/203 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + VD S + LV+ L ++ +++ D V+ V G R Sbjct: 188 PVAELAPANLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLREQDRVSLVVYAG----ESR 243 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V+ P+ K+ I GSTA ++ AY + E + N Sbjct: 244 VVLKPTSGRDKVKIRNAID-----QLTAGGSTAGASGIELAYQM---AREGFIDNGINR- 294 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ D + + + + G+ + T+ F V+ + Sbjct: 295 -----ILLATDGDFNVGISDFDSLKQMAVEQRKSGVSLTTLGFGVDNYNEHLMEQLAD-- 347 Query: 423 ASPNSFFEANSTHELNKIFRDRI 445 A ++ ++ E +K+ D++ Sbjct: 348 AGDGNYAYIDNLREAHKVLVDQL 370 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 12/165 (7%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ F+ SS + V+ LA VI + + R+G Sbjct: 25 AAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVKVFLAKVIDGL-SVGPDA--TRVGVVN 81 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + RV ++ S HK +VK + E T A+Q A + S E Sbjct: 82 YASRVKNEVSLK--SHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAE----GG 135 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +++TDG + + I +A+ GI I I Sbjct: 136 RKSPDISKVAIIVTDG---RPQDNIRDIAARAREAGIEIFAIGVG 177 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 82/228 (35%), Gaps = 23/228 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K G+F +I AL++ +L G + VD + ++ + A + Sbjct: 36 RFGKDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRFSTGE 95 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++Y R F N T I A + + QV Sbjct: 96 SHT-------------VVQDYGARIF----YANVTALSADAINFQIAFP-QDKTTDQQVQ 137 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++ + SLF + T++ S ++ + + V+D S+SM + Sbjct: 138 ATAAFTYK----SLFGVVASRLTGD-NWDKHQYTLTASVRLKNTIEVALVLDNSKSMDET 192 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S + ++ A + V++ +SQ+ + ++ + +V S+ Sbjct: 193 RSGSSKKRIDLLKDAASQLVETMASQSALITYVEKPVQFSLVPFAGSV 240 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 46/337 (13%), Positives = 109/337 (32%), Gaps = 11/337 (3%) Query: 62 ITASVPLIQSLEEV----SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 +TASV L ++E +S++ + K L+++ + L + + + Sbjct: 167 LTASVRLKNTIEVALVLDNSKSMDETRSGSSKKRIDLLKDAASQLVETMASQSALITYVE 226 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 V+ + A V + +Y S+ ++++ + +A G Sbjct: 227 KPVQFSLVPFAGSVNVGPQYLNAAWMDPEGKSSVNLENFTLPVTIDASRKIEEKPAGSG- 285 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 V + + + ++ + + G ++P + Sbjct: 286 RYYKVGTGWGDRNNKPFSRAELYADLTRRSSEPWLAWQGCVESRPGPFALDVTPPSDNNP 345 Query: 238 KSLYYMLY-PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +L+ ++ P S + + + L S ++ D+++ R Sbjct: 346 DTLFVPMFGPAEYYNVDSRGNVISTVLNSWWQDDMSLAYSPRQSDLKKYYLRDSLDKIYR 405 Query: 297 MGAT-----FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 G + ++ + + + V+T G T + +AM + TI+ Sbjct: 406 KGRSEGGGPNYSCTTLPLTPLTDVTTEQGMKTVQTAIKAMVPNGGTNVPEAMAWGWRTIV 465 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 R K +++LTDG NT +G+A Sbjct: 466 QGAPFTEARASTERGNDKVVIVLTDGANTYYKYDGLA 502 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 60/188 (31%), Gaps = 27/188 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + + + NV F + + R+I Sbjct: 107 RLEAAKKSAKNFVNQLPSKFNVA------LVNFAGTASIIVPPTTDRATVQRSID----- 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 STA + + T+ + D H N A IVLL+DG+ T Sbjct: 156 GLELAESTATGEGIFTSLQALTQVPPDPEH---PNDPAPARIVLLSDGKRTVG-RTAQEG 211 Query: 390 CNKAKSQGIRIMTIAFSVNKT---------QQEKARYFLSNCA--SPNSFFEANSTHELN 438 AK + I TI F + + R L + A + + A S EL Sbjct: 212 AQAAKEKNTPIYTITFGTDSGFIEMDGIRQRVPPDRAELRSVAEITGGEAYTAESAGELE 271 Query: 439 KIFRDRIG 446 +++D IG Sbjct: 272 DVYKD-IG 278 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 55/154 (35%), Gaps = 10/154 (6%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I N T R+G +N + + + + N + ST+ + Sbjct: 73 SIPNQPKTTRLGLVTYNWNATIQAGL--DKFQSQQDVFENIFNALNSVSSTSES----YL 126 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + ++++ +K IVL + + IAI ++ K GI I+T+ + Sbjct: 127 ANGLVAAENVLARGPNRGNNYQKVIVLF--AASYSSHSNPIAIADRLKQAGITIITMGY- 183 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 N L+ ASPN F S ++ I Sbjct: 184 -NNVGDPNFYQNLAKIASPNKSFTEKSLSQIGDI 216 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 52/416 (12%), Positives = 124/416 (29%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + L AQ I + PL +Q + +++ + Sbjct: 274 FLIDGSSSIGKRRFRIQKQFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRD 333 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A V + + Sbjct: 334 LKTAIEKITQRGGLSNVGRAISFVTKNFFSKSNGNRGGAPNVAV--------------VM 379 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + +E G++I ++ + Q E N + Sbjct: 380 VDGWPTDKVEEASR-------LARESGINIFFITIEGAVENEKQYVLEPNFANKAVCRTN 432 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + ++ + S +D SS Sbjct: 433 GFYSLNVQNWYGLHKSVQPLVKRVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTGN 488 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ +A++ + D R+GA + F + + ++ Sbjct: 489 FRTVLQF-VANLSKEFDISDTD---TRVGAVQYT--YEQRLEFGFDQYTTKPDVLNAIKR 542 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 543 VGYWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 590 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + QE+ ++ A ++FF + L K I Sbjct: 591 AMVAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFF-VDEFDNLYKSVPKII 643 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 70/197 (35%), Gaps = 24/197 (12%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + + + + + V+ L + +K D R G F D + Sbjct: 101 DLSGSMDTKDFTDASGRTVDRLTAVKAVLDDFLSR-RKGD------RAGVVVFGDAPFAL 153 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ + L R +++ + TA+ DA+ + + + K Sbjct: 154 VPFTTDLD-LCREMLRDTVVGM-AGPRTALGDAIGLGI-ALFDRSTVKA----------K 200 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCASP-- 425 I+ LTDG +T AK +GI I T+A + + L + AS Sbjct: 201 TIIALTDGNDTASQVPPTEAAGVAKDKGIVIHTVAIGDPSTVGEDKLDETALKDVASATG 260 Query: 426 NSFFEANSTHELNKIFR 442 FF A EL +I+ Sbjct: 261 GGFFRALDRDELARIYG 277 >gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 [Sus scrofa] gi|3024051|sp|P79263|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Major acute phase protein; Short=MAP; Flags: Precursor gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa] gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Sus scrofa] gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor Length = 921 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 12/160 (7%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P FV +S +K R+AL ++ + D N Sbjct: 254 NGYFVHYFAPEVWSAIPKNVIFVIDTSGSMRGRKIQQTREALIKILGDLGSRDQFN---- 309 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + + + K++A + + G T INDAM A + +N + Sbjct: 310 --LVSFSGEAPRRRAVAASAENVEEA--KSYAAEIHAQGGTNINDAMLMAVQLLERANRE 365 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E+ ++ +I+LLTDG+ T I + Sbjct: 366 ELLPARSVT----FIILLTDGDPTVGETNPSKIQKNVREA 401 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 37/73 (50%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ TG+ ++ A MP+++G G+ D V+W+ ++ +++ A +A + + + Q Sbjct: 8 RLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQADSAALAGAYAVAQGF 67 Query: 73 EEVSSRAKNSFTF 85 S + Sbjct: 68 SASDSATSDINRL 80 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 126/382 (32%), Gaps = 67/382 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K G II +P+++ V + +++AA+ A + + Sbjct: 2 KGFTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDN 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ ++Y R + + N +D E + ++ +K Y+ Sbjct: 62 KDD---------------QDYAQRIVDLYITDNISDIE---------ISVSTKKCEYKDG 97 Query: 133 LSSRYDLLLNPLSLFLRSMGIK--SWLIQT--------KAEAETVSRSYHKEHGVSIQWV 182 R + L+P + F + SW+ K ++++R Y + V I ++ Sbjct: 98 CVQRNN-ELSPFADFTVVATAEHDSWISHNEIGVEPKFKVSGDSITRKYLPQ-PVDIYFI 155 Query: 183 IDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S+SM + + + + R VK + + K P S L Sbjct: 156 LDTSQSMSNPWYGERNKTQMQVVKDTITRVVK---------ELENFKTGPDKKSRVALLT 206 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTVRMGA 299 Y Y D D +S + + + +++ + A Sbjct: 207 YNAYNAKFDKGAGRVKLYDYASEFSHTE--ASFESIVDKMFDESVVEQKPHY-------A 257 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + +N + ++ I+ + + G T + +I++ ++ Sbjct: 258 SDYNKSQD--IPLT-DKYQEFIDILNSNKVMPARGGGTQS-------WLGLIAAAKEADK 307 Query: 360 RMKNNLEAKKYIVLLTDGENTQ 381 K + ++ ++L+DG +T Sbjct: 308 VKKEDRNPEQVFIILSDGADTD 329 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 72/213 (33%), Gaps = 18/213 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 241 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 296 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 297 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 354 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E R+ I+LLTDG+ T +I N + ++ + + Sbjct: 355 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSL--FCLGFGFDVSY 408 Query: 417 YFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 409 AFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 441 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 56/148 (37%), Gaps = 13/148 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +++ +++ + D+V + + F V F+ + + Sbjct: 37 ITELQEGVSTFFAQL-LADDVAEYSAEVAVVTFGGNVDMAVDFA--------AVTRQTVP 87 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + +A++TA + + + E ++ + ++V++TDG T + + + Sbjct: 88 SLTAGGMTPMGEAVETALELLHTRKE--EYKRAGVDYYQPWLVIMTDGAPTDNISKASRL 145 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKAR 416 + + + + + I + E A+ Sbjct: 146 VDDLVREKKLAVFAIGIGKDADMNELAK 173 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 74/200 (37%), Gaps = 23/200 (11%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + VD S + LV+ L ++ +++ D V+ V G RV+ P Sbjct: 191 NLVFLVDVSGSMDRREGLPLVQSTLKLLVDQLREQDRVSLVVYAG----ESRVVLPP--- 243 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + ++T GSTA +Q AY + E + + N I+L Sbjct: 244 --TSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQM---AREGFIDKGINR------ILL 292 Query: 374 LTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 TDG+ D + + + + G+ + T+ F V+ + A ++ Sbjct: 293 ATDGDFNVGISDFDSLKQMAAEQRKSGVSLTTLGFGVDNYNEHLMEQLAD--AGDGNYAY 350 Query: 431 ANSTHELNKIFRDRIGNEIF 450 ++ E K+ D++ + + Sbjct: 351 IDTLREARKVLVDQLSSTLT 370 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 43/263 (16%), Positives = 76/263 (28%), Gaps = 36/263 (13%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S D ++ Q P +S N +E M+ Sbjct: 2 SFNDDEQIVAPQDSGSRPTPIVPGRVQLTSINNNTAPLEESKLKVMLELTGG------DS 55 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D + V S K V+ A+ VI+ + ID R+ F+ Sbjct: 56 TNDRPGLDLVAVLDVSGSMAGDKIAKVKTAMLFVIKKLSPID------RLSVVKFSADAS 109 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + N G+T I +QT + + + Sbjct: 110 RLCPLRQITEDSQKDLENLI-NGLNADGATNITAGLQTGLKVLNDRSLSSGRVVG----- 163 Query: 368 KKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP- 425 I+L++DGE N + + I N + + T F +N L A+ Sbjct: 164 ---IILMSDGEQNAGGDAAQVPIGN------VPVYTFGFGINHEP-----RVLKAIANNS 209 Query: 426 --NSFFEANSTHELNKIFRDRIG 446 +F + +T L+ F + Sbjct: 210 MGGTFSDVQNTDNLSLAFSQCLA 232 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 37/278 (13%), Positives = 81/278 (29%), Gaps = 10/278 (3%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +ITA L+ V L V + VD+ + L+ A +A + A+ L S ++ + A+ Sbjct: 19 MITAALL-VFLAVSALAVDINHMVVNKTRLQNAVDSATLAAATILDNSKDKDAVDAEVGT 77 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 + + + + D T S V + R D L Sbjct: 78 AL--NAMAASTGNQEIDFSTASISIDYSNDPKDFTGT--ATFDSTDDVYVRVRVD-ALEM 132 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW---VIDFSRSMLD-YQRDSEGQ 199 F++ G++ + + + +Y+ + + D S D + ++ + Sbjct: 133 DEFFIQLFGLEKVVSASAVAGPSSGLAYNNVVPIGVCIGDGTSDNDISPEDGFHDETGEE 192 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + FG + + + + G D + + Sbjct: 193 ITSVFGYEIGTVHALKVGDSSLSEMGNGNYHLLDFGSGGNTIKEGLGGSYDQPIKIGEDI 252 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + V + + D +D V Sbjct: 253 TTKPGGTVGPTGDGLNTRFGEYGGGLSASDYPSDYVTT 290 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 71/192 (36%), Gaps = 18/192 (9%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + +++ K + + + S++ R+ F+ S F+ + Sbjct: 239 DLSTTTNPIYRKYIEMAEELVNSLVI-------GRRFSRIALITFSSVGKSRTQFNLDRY 291 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + IV E+ G+TAI + ++ + + +D+ H + AKK +++ TDG Sbjct: 292 FDGKDIVTAIRRLESSGGTTAIGEGIR-----LGTEQKDKQHGGRPVEIAKKIMLVFTDG 346 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAF----SVNKTQQEKARYFLSNCASPNSF-FEAN 432 + + + + AK G + TI + V+ Y + + + Sbjct: 347 WSNKGP-DVEEMTRNAKGAGFTLYTIVYEGNGRVDANSPGLNLYTIETMVDDHKHVYSER 405 Query: 433 STHELNKIFRDR 444 + +L + R R Sbjct: 406 NFTQLIQELRQR 417 >gi|271964702|ref|YP_003338898.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] gi|270507877|gb|ACZ86155.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] Length = 514 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 67/203 (33%), Gaps = 29/203 (14%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG----ATFFNDRVISDPS-- 311 FV +S ++ +R AL ++ + R F Sbjct: 329 FVLDTSGSMEGERIEALRQALVTLTGADTSASGTFSRFRSRENVIMIPFGGSAGLPQPFI 388 Query: 312 -FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + I + +A G TAI D ++ AY + D H Sbjct: 389 LPERDPQPALAQI-RAYAERLRAAGGTAIYDGLRAAYGQAGDAGRD--HYTS-------- 437 Query: 371 IVLLTDGENTQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 IVL+TDGENT + +A+ Q +R + F + + + L+ Sbjct: 438 IVLMTDGENTDGSSYEDFEAYYRSLPEARRQ-VRTFVVLFGESDADEMERIATLTR---- 492 Query: 426 NSFFEANSTHELNKIFRDRIGNE 448 + F+A T L F++ G + Sbjct: 493 GAVFDAR-TGSLASAFKEIRGYQ 514 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 44/127 (34%), Gaps = 1/127 (0%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G I+ L + VM+G G+ +D+ + L+ +A ++A+ L Sbjct: 11 TRRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 71 SLE-EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ V+ + + + F+D + TAV Sbjct: 71 AISLSVAEADGIAAGHVNFAFFQKSAVQMLTDSNVTFSDALTNPFLTKTAVSTPANVKYV 130 Query: 130 QVVLSSR 136 + + Sbjct: 131 KCTATLS 137 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 20/170 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + IK +D + R+G + V + K ++++ + T Sbjct: 66 LSQVIKSLDVGPNATRVGLVNYASTVKPEFPLRAHTSKA--SLLQAVHRIQPLSTGTMTG 123 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A I + D + + K ++++TDG + + + +A++ GI + Sbjct: 124 LALQFA---ITKALSDAEGGRSRSSDISKVVIVVTDG---RPQDSVRDVSERARASGIEL 177 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNS-----FFEANST-HELNKIFRDR 444 I + L AS + E+ + +L K F++ Sbjct: 178 FAIGVG------RVDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQEA 221 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 61/165 (36%), Gaps = 15/165 (9%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 L+ ++ V +S K +V+ + + + I D R+G F+ + Sbjct: 364 KLNRKRADILLVVDTSGSMEGDKMTMVKAGIETFLMRILPED------RLGLITFDSQAR 417 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + R ++ + G TA+ DA+ A T+ + E R+ Sbjct: 418 LVVPMA--PLSENRIDLQIAVQEMRASGRTALFDALDLARQTLEALPPAEDDRI------ 469 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + IVLL+DG + + + GI I +A+ + +Q Sbjct: 470 -RAIVLLSDGADNASRLTLEEVRRQFDESGITIFPVAYGSDADRQ 513 >gi|17558302|ref|NP_505768.1| hypothetical protein C29A12.6 [Caenorhabditis elegans] gi|3874606|emb|CAA98248.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|3877573|emb|CAB01217.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 643 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 92/260 (35%), Gaps = 16/260 (6%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVK--SYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S + + ++ +P +K + ++ V + P+ N ++ Sbjct: 385 TESFGTTRSPAPVSTIDSPLKPEVPVIKPMDFMVRSRSVQFAMTEKPPFTTVMNPMKFFT 444 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKK----HLVRDALASVIRSIKKIDNVNDTVRMGA 299 P+ + ++ + ++ + L +I + DTVR+G Sbjct: 445 TTRTPITKPKPLIPYSCTADVFFLVDVSQGTGDKSQQYLDIAASAISSLPISQDTVRVGL 504 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + H +++ + E G+T DA++ A E + H Sbjct: 505 ISYSGPGRTHVRVFLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGKAH 560 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + N+ KK +V+ TDG + A+++GI+++ +A ++ Sbjct: 561 PARKNV--KKVLVVFTDG---YSQDNPKEASRMARAKGIQLIAVAVK-DRLAPPDTEQLT 614 Query: 420 SNCASPNSFFEANSTHELNK 439 + S F + S +L + Sbjct: 615 EIGGNGRSIFISPSGRDLRE 634 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 69/199 (34%), Gaps = 24/199 (12%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 FV S +K R+AL ++ + D N F+ Sbjct: 3 KNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQWRPSL 56 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++FA +G T INDAM A + SSN++E R+ I+L Sbjct: 57 VPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEE--RLPEGSV--SLIIL 112 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVNKTQQEKARYFLSNCASPNS--- 427 LTDG+ T +I N + + + F + + FL A N Sbjct: 113 LTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS-----YAFLEKLALDNGGLA 167 Query: 428 ---FFEANSTHELNKIFRD 443 +++S +L +++ Sbjct: 168 RRIHEDSDSALQLQDFYQE 186 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 45/117 (38%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +MG F+ + ++ + ++ +++ +G T I+ A+ A Sbjct: 127 KMGIVVFSGTAMWQCPLTFDL-HALKMFLQSVETTNLPLGGTRISSAIMLA--------- 176 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++L++DGEN + + N AK G+RI++I + Sbjct: 177 --SKAASCESAGSRVMILISDGENH--DSKIKEAVNAAKKAGLRIISIGIGKKEGAP 229 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 46/127 (36%), Gaps = 1/127 (0%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ + G II L + +M+G G+ +D+ + L+ +A + ++A+ L Sbjct: 11 TRRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTS 70 Query: 71 SLE-EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ +V+ + + + + F+D + TAV Sbjct: 71 AISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVANPANVKYV 130 Query: 130 QVVLSSR 136 + Sbjct: 131 KCTAQLS 137 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 54/170 (31%), Gaps = 22/170 (12%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V S ++ LV+ + ++ S+ R+G F+ RV ++ G Sbjct: 136 VLVIDGSKSVGAQQFELVKRWVGELVGSLDVSPAG---TRVGLVQFSSRVRTEFPL--GR 190 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI-VLLT 375 H T A++ + S E + + +L+T Sbjct: 191 HGTKAEAEAAVRAVTPMDKGTMTGLALRHLVERGFSEAE-------GARPGSRRVGLLVT 243 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 DG + ++ +AK +GI + + ++ L AS Sbjct: 244 DG---RSQDDVSPWAARAKDRGIVMFAVGVGKAVEEE------LREIASD 284 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 39/264 (14%), Positives = 83/264 (31%), Gaps = 23/264 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S+ + + LN F K+ + + + + M+ L Sbjct: 177 KEGSLPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSATTELAPSPYNDDMMLLRIGLKGYE 236 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S K L++ AL + + + D V+ V GA Sbjct: 237 LTKSELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAAGVVL 296 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 G + GST + + AY + H ++ Sbjct: 297 D---------GASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGG 341 Query: 365 LEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ ++ I++ K K +GI + T+ F ++ + Sbjct: 342 INR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADK 398 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 + ++ +E K+ D + Sbjct: 399 --GNGHYAYIDTLNEARKVLVDEL 420 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LESRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 58/334 (17%), Positives = 92/334 (27%), Gaps = 95/334 (28%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + T + + S + + I V+D S SM D + + A T Sbjct: 93 LTTLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRIDALKTAANTF----- 147 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +D ++ + +S +H R A Sbjct: 148 -------------------------------IDAIAAQNQSITDASKQH--------RVA 168 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + K NDT R G +N S +K + N GST Sbjct: 169 IVKFAGKKKTDKVGNDTYRDGRYTYNYSQTMKNLTS--CKGKDADSLKDTVGNINPAGST 226 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAIC 390 + ++ A + I+S + KK +V TDG T + + IA Sbjct: 227 QADYGLELAENITINSGRADA---------KKIVVFFTDGSPTSSSGFQASVADSAIASA 277 Query: 391 NKAKSQGIRIMTIAFSV--------NKTQQEKARYFLSNCAS------------------ 424 K+ G I TI K F+ +S Sbjct: 278 KSLKANGADIYTIGIFSGANPSADPTAEGTSKVNKFMHAVSSNYPGATSSISFWGEWVID 337 Query: 425 -------PNSFFEANSTHELNKIFRDRIGNEIFE 451 + + A S EL KIF + G+ I Sbjct: 338 YGTRAENSDYYKSATSASELEKIFEEISGSIIQT 371 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 47/329 (14%), Positives = 96/329 (29%), Gaps = 44/329 (13%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ--- 193 YD N +L + ++ + V + ++ +D + + Sbjct: 166 YDEYNNDQKFYLSNGKKEAVRCSAGISGKNVIKKCQGGSCITKSCKLDKVTGLYEEGCEF 225 Query: 194 ---RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 F Q D VK + +N + + + + Sbjct: 226 IPYDRQTEPASIMFSQSIDSVVKFCTEKNHNKDAPNPQNRRCNLRSTWEVIQDSEDFKKT 285 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLV------RDALASVIRSIKKIDN------VNDTVRMG 298 ++ + + SL + ++ + A ++ + + V +G Sbjct: 286 TPMTTQPLTPTFSLLQIGQRIVCLVLDKSGSMATGDRLKRLNQAGKLFLLQIVEQGSWVG 345 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ R + + G T+I +++A+ I + Sbjct: 346 MVTFDSAAQVQSELIQINSGTERDALTKS-LPTVATGGTSICSGLRSAFAVIKKKYPTDG 404 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ----E 413 IVLLTDGE+ I+ C N+ K G I T+A + ++ Sbjct: 405 AE----------IVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELEELS 449 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFR 442 K L AS +A + + L F Sbjct: 450 KMTGGLQTYASD----QAQN-NGLIDAFG 473 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 69/233 (29%), Gaps = 46/233 (19%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 R + ++ S +L D + + + + + ++ +A Sbjct: 61 CRPQWWGEPVIQYEGSRDLLLAVDLSDSMRTPDMLDNGEQQARLTAVRQQIKALIAKRAG 120 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-NDA 342 R+G F D ++ + + +D + +G T +A Sbjct: 121 D-----------RVGIIVFADHAYLLSPL---TQEIPALLTLSDELDFDLVGRTTALGEA 166 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A H A ++L+TDG NT N + + A +QGIRI T Sbjct: 167 ILLA----------RQHGDPGRPTA---LLLVTDGRNTAGNADPLQEAKLAAAQGIRIYT 213 Query: 403 IAFSVNKT----------------QQEKARYFLSNCASPNS--FFEANSTHEL 437 + + E L A +F A + +L Sbjct: 214 LGVGADPDTFIQPYDEAGSGQADPSSELDEPLLKELAQTGQGRYFRARTQSDL 266 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 56/180 (31%), Gaps = 24/180 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + K Sbjct: 149 KIQNVKKTLEYLLELLGDND------RLCLILFNSYATRLCHLM-KTNNSNKPAFKEIIN 201 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + + + + LL+DG++ + Sbjct: 202 KIYSTGGTDINSGMELAFRVL---------KDRKYQNPVSSVFLLSDGQDGSADLRVRQS 252 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 253 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 307 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 51/142 (35%), Gaps = 13/142 (9%) Query: 322 TIVKTFAIDENEMGSTAINDAMQT----AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 G T + M A ++ +D + E + + Sbjct: 396 YWFSDSKWHTTIDGGTTGSVQMTWPEVWAKWSVRYVAKDIYTKALGGSENSWFETFTDEI 455 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 Q + IC+ AK GI I +I F E R L +CAS P+++F A + Sbjct: 456 SYGQKDVRLQQICDAAKDSGIVIFSIGFEA----PENGRNQLRDCASQPSNYFNATGV-Q 510 Query: 437 LNKIFRDRIGNEIFERVIRITK 458 + FR I ++ +R+T+ Sbjct: 511 ITTAFR-AIATQLSH--LRLTQ 529 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 57/162 (35%), Gaps = 37/162 (22%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 M +D++R L+Q + + A+ P+ + +Y + Sbjct: 55 MALDLMRHEQKRTTLQQTLDRSTLAAAS-------------LQQSLDPESVVRDYFAK-- 99 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 N T + V ++ + +V + D NP F++ +GI S+ Sbjct: 100 -----ANMTQ-------YLSGVTVDEGMNYREVNALAAAD--TNPF--FMQMVGIDSFDA 143 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + + AE + V + V+D S SM R + +P Sbjct: 144 KAASTAEQRISN------VEVSMVLDISGSMASNSRLTRLRP 179 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 49/148 (33%), Gaps = 22/148 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + K++ N G TA +A+ A + HR EA+ IVL+TDG Sbjct: 133 VDKDAAKSYINTINSGGGTATGNAIDAAVALLAD------HR----TEAQPVIVLMTDGA 182 Query: 379 NTQDNEEG------IAICNKAKSQGIRIMTIAFSVNKTQQ--EKARYFLSNCASPNSFFE 430 T+ + + AK G+ TIA + N A+ + Sbjct: 183 ATESPKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPITSAPNVLMKNMATTATHHH 242 Query: 431 ----ANSTHELNKIFRDRIGNEIFERVI 454 + +++ IG V Sbjct: 243 FVLGSKGLNQIYAAIVKEIGMASAYDVT 270 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 541 IQVGVVQYGEDVVHEFHL--NDYRSVKEVV-EAASHIEQRGGTETRTAFGIEFAR----- 592 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + ++ + +A + Sbjct: 593 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGI 650 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 651 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 693 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 38/254 (14%), Positives = 86/254 (33%), Gaps = 20/254 (7%) Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P F + + + G + + ++++ PLD FV Sbjct: 4 PALVFLSLLFFCLSQTGANKHRPGSAAARGQNETTAQSRAVGNPCKAVPLD-----FVFV 58 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 SS + V+ + +++ ++ R+G + V + S S Sbjct: 59 IDSSRSIRPRDYEKVKTFIVNLV-QFLEVGPEA--TRVGLLQYGSVVQPEFSLS--TFST 113 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + ++ T A+Q A +T + D ++ + V++TDG Sbjct: 114 KAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEA-DGARPAHLHIP--RIAVVVTDG-- 168 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA--NSTHEL 437 + + + +A+ GI+I I + + + S S + A + L Sbjct: 169 -RPQDRVEEVAAQARQAGIQIFAIGVG--RVDMKTLKTIGSEPHSEHVHLVASFSQMETL 225 Query: 438 NKIFRDRIGNEIFE 451 +F+ ++ E+ E Sbjct: 226 VSVFQSKLCREMCE 239 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 69/216 (31%), Gaps = 18/216 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV +S + K ++AL ++++ ++ D+ N Sbjct: 350 NGYFVHYFAPTNLPPLPKNVVFVIDTSASMLGNKMKQTKEALFTILKDLRPQDHFN---- 405 Query: 297 MGATFFNDRV---ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F+ RV + + + K F G T IN+ + + + Sbjct: 406 --IIGFSKRVKVWQQNQMVKVSPNNIRDA--KKFIYSLYPTGETNINEGIHVGAQLLNNY 461 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 K+ ++ LTDG T E I K+ + +I F + Sbjct: 462 LASNGKHEKSVSL----MIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVD 517 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + L NC E K F D IG Sbjct: 518 FNLLEKLSLENCGMMRRIQENEDAASQLKGFYDEIG 553 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 69/216 (31%), Gaps = 18/216 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV +S + K ++AL ++++ ++ D+ N Sbjct: 318 NGYFVHYFAPTNLPPLPKNVVFVIDTSASMLGNKMKQTKEALFTILKDLRPQDHFN---- 373 Query: 297 MGATFFNDRV---ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F+ RV + + + K F G T IN+ + + + Sbjct: 374 --IIGFSKRVKVWQQNQMVKVSPNNIRDA--KKFIYSLYPTGETNINEGIHVGAQLLNNY 429 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 K+ ++ LTDG T E I K+ + +I F + Sbjct: 430 LASNGKHEKSVSL----MIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVD 485 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + L NC E K F D IG Sbjct: 486 FNLLEKLSLENCGMMRRIQENEDAASQLKGFYDEIG 521 >gi|47218379|emb|CAG01900.1| unnamed protein product [Tetraodon nigroviridis] Length = 683 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 80/223 (35%), Gaps = 24/223 (10%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + Y + + I + + + +Y+ VD+S + Sbjct: 188 TGLARYYPASPWMDISNSANKIDLYDVRRRPWYIQGAAS---PKDMLILVDASGSVSGLT 244 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT--IVKTF 327 L++ +++ ++ ++ D VN +FND+ F V +R ++K Sbjct: 245 -LKLIQISVSKMLETLSDDDYVN------VVYFNDKAKYASCFENLVQANVRNKRMLKKA 297 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G+T + + A++ + N + K I+L TDG + E Sbjct: 298 VQNITAKGTTNYSGGFELAFEQLAQMNVSRANCN-------KIIMLFTDGGEEKAEEIFK 350 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +Q +RI T FSV + +K CA+ ++E Sbjct: 351 KY---NPNQEVRIFT--FSVGQHNYDKGPIQWMACANKGYYYE 388 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 53/154 (34%), Gaps = 25/154 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ + D R+ F+ R +KT Sbjct: 59 LETVKRAAQKLVDRLLPSD------RLAVIVFDHVAKVLIP---NQPVTDRDKIKTRISH 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAI 389 MG TAI++ +Q +I++ K ++ I LLTDGEN N + + Sbjct: 110 LAAMGGTAIDEGLQLGLTELIAA--------KAGAISQ--IFLLTDGENEHGNNSRCLQL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +A + I + T+ F + L A Sbjct: 160 AEEAAKENITLNTLGF-----GYHWNQDVLEQIA 188 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 69/216 (31%), Gaps = 18/216 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV +S + K ++AL ++++ ++ D+ N Sbjct: 299 NGYFVHYFAPTNLPPLPKNVVFVIDTSASMLGNKMKQTKEALFTILKDLRPQDHFN---- 354 Query: 297 MGATFFNDRV---ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F+ RV + + + K F G T IN+ + + + Sbjct: 355 --IIGFSKRVKVWQQNQMVKVSPNNIRDA--KKFIYSLYPTGETNINEGIHVGAQLLNNY 410 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 K+ ++ LTDG T E I K+ + +I F + Sbjct: 411 LASNGKHEKSVSL----MIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVD 466 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + L NC E K F D IG Sbjct: 467 FNLLEKLSLENCGMMRRIQENEDAASQLKGFYDEIG 502 >gi|324231484|emb|CBZ42120.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|324231581|emb|CBZ42125.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 980 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 92/260 (35%), Gaps = 16/260 (6%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVK--SYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S + + ++ +P +K + ++ V + P+ N ++ Sbjct: 722 TESFGTTRSPAPVSTIDSPLKPEVPVIKPMDFMVRSRSVQFAMTEKPPFTTVMNPMKFFT 781 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKK----HLVRDALASVIRSIKKIDNVNDTVRMGA 299 P+ + ++ + ++ + L +I + DTVR+G Sbjct: 782 TTRTPITKPKPLIPYSCTADVFFLVDVSQGTGDKSQQYLDIAASAISSLPISQDTVRVGL 841 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + H +++ + E G+T DA++ A E + H Sbjct: 842 ISYSGPGRTHVRVFLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGKAH 897 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + N+ KK +V+ TDG + A+++GI+++ +A ++ Sbjct: 898 PARKNV--KKVLVVFTDG---YSQDNPKEASRMARAKGIQLIAVAVK-DRLAPPDTEQLT 951 Query: 420 SNCASPNSFFEANSTHELNK 439 + S F + S +L + Sbjct: 952 EIGGNGRSIFISPSGRDLRE 971 >gi|146340337|ref|YP_001205385.1| hypothetical protein BRADO3364 [Bradyrhizobium sp. ORS278] gi|146193143|emb|CAL77155.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 470 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 72/495 (14%), Positives = 142/495 (28%), Gaps = 96/495 (19%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + + +I AL+M + + GM +D + + L AA A I A P + S+ Sbjct: 8 RFARDRCANVAVIFALMMVPTIYLLGMALDYTQALRKQGQLDAAADAAAIAAVRPAMLSV 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S +++N + V + + + Sbjct: 68 TDTS-----------------VVKNTAAAVFATKAAMNGLTAVPTPDITVTDSGLQRTIQ 110 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S + ++ F +G SW ++ A A+ S ++ ++D S SM Sbjct: 111 VS----YVARSINNFPSVLGSPSWQVKGSATAQASSAPN-----MNFYLLLDDSPSMAIA 161 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ L K+ +V + + N S+ + Sbjct: 162 ATQTDIDNLIAATSKQPGGSKNCGFACHEVHPNLDSGASSSTVDNLSI-----ARSKSIT 216 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN--VNDTVRMGATFFNDRVISDP 310 L + V++ V V++ + I+N + N Sbjct: 217 LRIDLVVNAVKQLLVGPYTCPQAGISGGVMQCMSAINNTTYKAAIYTFDYNLNTIQTLTS 276 Query: 311 SFSWGVHK------LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 S G + N T I+ A+ + I+ ++ Sbjct: 277 PSSAGTKISNIQLLTVDHQNCVTTAICNTDFGTDISGALG-GVNAIMPDPGTGT--NQSG 333 Query: 365 LEAKKYIVLLTDG-----ENTQDNEEGIAI--------------------CNKAKSQGIR 399 ++ + L+TDG + + A C+ K + IR Sbjct: 334 DTPQEVVFLVTDGVEDKLIASSSGCDPKATYPLPAAGSQVRCQQPLNTAVCDTIKKRNIR 393 Query: 400 I---MT--------------IA-FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I T IA F+ + L CAS FF T Sbjct: 394 IAILYTEYLQLTTDNWYNSRIAQFNSPSSLSGTIAQRLQACASSPDFFATVQT------- 446 Query: 442 RDRIGNEIFERVIRI 456 G +I + + ++ Sbjct: 447 ----GGDISDALTKL 457 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 82/254 (32%), Gaps = 26/254 (10%) Query: 201 LNCFGQPADRTVKSYSSQNGKVG---IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 L+ F A T Y+ + ++ R +SP + S+ P + Sbjct: 12 LSAFLMEAQGTYGPYARNHNQLYAGRQRSPHISPNIPGHGNSINIGAPAEPCKSRPLDLV 71 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F+ SS + V+ + + +D +D R+ + V + Sbjct: 72 FIIDSSRSVRPAEFEKVKI---FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFS 128 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K + T A++TA + + + N K K +++TDG Sbjct: 129 KAEVKQAFSRIDPL--STGTMTGMAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDG 183 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS- 433 + ++ + A++ GI I + + + L AS + F + Sbjct: 184 ---RPQDKVEEVSAAARASGIEIYAVGVDRAEMRS------LKQMASQPLDDHVFYVETY 234 Query: 434 --THELNKIFRDRI 445 +L FR+ + Sbjct: 235 GVIEKLTSKFRETL 248 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 72/213 (33%), Gaps = 18/213 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 168 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 223 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 224 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 281 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E R+ I+LLTDG+ T +I N + ++ + + Sbjct: 282 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSL--FCLGFGFDVSY 335 Query: 417 YFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 336 AFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 368 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + ++++A+ +IR + D R+ FNDR V + + R I + Sbjct: 75 SRLDVLKEAMKFIIRKLDDGD------RLSIVAFNDRPVKEYSTGLLNISGNGRRIAEKK 128 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TA+ A++ A + D + + +I+LLTDG++T Sbjct: 129 VDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVG-------FILLLTDGDDTSGFRWSR 181 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + N A + + T + + SF + + ++ IG Sbjct: 182 DVINGAVGK-YPVHTFGLGAAHSSEALLH-IAQESRGTYSFVDDENMDKIAGALAVCIG 238 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 40/117 (34%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + R + + + + G TAI+DA++ A + Sbjct: 128 RIGIIAYAGRAVPQLPITTDFSAAKMFLNNLNTNMLSSQG-TAIDDAIRLAKTYYDDVEQ 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + +V+++DGE+ + +A +GI+ TI K Sbjct: 187 TN-----------RVLVIISDGEDHTGGAG--QLAEEATKEGIKTYTIGVGTTKGGP 230 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 46/401 (11%), Positives = 123/401 (30%), Gaps = 33/401 (8%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 72 LEE-----VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD----TAVEM 122 ++ S +N ++ + + +K + +++ T Sbjct: 70 RKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPLCKTINNN 129 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHGVSIQ 180 + + ++ + S F L+ ++ Q +++ + + + + Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTFNIPIDLM 189 Query: 181 WVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKS------YSSQNGKVGIRDEKLSPYM 233 V D S SM D + + N +K ++ ++ + Sbjct: 190 VVADLSGSMKDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQESNEYNRIGITAFAMG 249 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-LASVIRSIKKIDNVN 292 K +L D L + L R + + + + +N Sbjct: 250 AEHPKENKCVLPFVLQDNLHGMSKIKIKQYLASRYNRTTLKRTEFVDNFVALLDIEKTLN 309 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQ 344 +G T ++ G+ + + G T + + Sbjct: 310 S---IGQTNYDITFPKSSICLEGLKNASQFWYTKTENDKFRDRVNSLKANGGTLASSGLL 366 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 TA + ++S N E K+ I++L+DG + N Sbjct: 367 TASNQMLSEKSRSEEL---NQETKRVILVLSDGNDDMSNLN 404 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 25/160 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ A +I +K+ D R+ F+ R +K+ Sbjct: 57 RPLETVKKAAMQLIERLKEGD------RICVIAFDHRAKVLVP---NQAIDNLNTIKSQI 107 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 + G TAI++ ++ + + D V + + LLTDGEN DNE + Sbjct: 108 RQLSADGGTAIDEGLKLGIEEVAKGKADAVSQ----------VFLLTDGENEHGDNERCL 157 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + A + I T+ F + + L A S Sbjct: 158 KLAHFAVEHKLTINTLGFGASWN-----QDVLEKIADSGS 192 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 59/185 (31%), Gaps = 39/185 (21%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ I + + R+ F D + + L+ T +D +G Sbjct: 110 SAVRQQIDRLIAARPGD---RIALIVFADHAYLLSPLTQETNALLGL---TRELDFELVG 163 Query: 336 STAI-NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T +A+ A H+ A ++L+TDG NT + + + +A Sbjct: 164 RTTALGEAILLA----------RQHKAPERSTA---LLLVTDGRNTAGSADPLREAKQAA 210 Query: 395 SQGIRIMTIAFSVNKTQQEKA-----------------RYFLSNCASPNS--FFEANSTH 435 + GIR+ T+ + +A L A +F A + Sbjct: 211 AAGIRLYTLGVGADPDTFAEAMTPAQTPAQSDPSAELDEALLQQLAEVGHGRYFRARTQG 270 Query: 436 ELNKI 440 +L I Sbjct: 271 DLEAI 275 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 62/170 (36%), Gaps = 20/170 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + ++ +D + R+G + V ++ S HK +++ + T Sbjct: 61 LSQVVESLDVGANATRVGLVNYASTVKNEFSLK--THKAKPALLQAVKKVQPLSTGTMTG 118 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + + E R+K+ K I+ +TDG + + I +A+ G+ I Sbjct: 119 LAIQYAINIAFTEPEGA--RLKSPGINKVAII-VTDG---RPQDAVKDISARARESGLEI 172 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 I + L AS S +L+K F++ Sbjct: 173 YAIGVG------RVDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEA 216 >gi|233142080|gb|ACQ91095.1| complement factor B-like protein [Ruditapes decussatus] Length = 697 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 47/311 (15%), Positives = 92/311 (29%), Gaps = 29/311 (9%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + T+ ++ + G V+D S + EG+ + Sbjct: 109 HVSTEGRKVGNKATFRCDEGF----VLDGS----TTRTCREGRRWDGIMPVCKSRRTIED 160 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 N + L+ L PG + V SS ++ Sbjct: 161 VANHLKDTMIDSLAGATTGSKTGAQSRLSPGKSGLDVVLLVDVSSSIGDRSMESAKKFMK 220 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 L + + + R F++ + + G + + + +N G Sbjct: 221 LLVDIFGVSNETSGGKNGTRFALLTFSNEADIVFNLNDGTARSKEEVKRRIDEIQNTGGG 280 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-----GIAICN 391 T A+ I + + + N+ + + LLTD E T E+ N Sbjct: 281 TNFRAALLKVVGGIFFNVIKKESQRLNHAT--RAVFLLTDAEETSTLEKDRLPRIRQAAN 338 Query: 392 KAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 K++G I I N + L+ AS F + +L +++G+ Sbjct: 339 DLKNEGHFEIFCIGVGQNIDETT-----LAEIASTPHIEHVFTLSKFDDL-----EKVGD 388 Query: 448 EIFERVIRITK 458 I E+ I + Sbjct: 389 IIAEKNIDYGR 399 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 50.0 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 74/435 (17%), Positives = 122/435 (28%), Gaps = 53/435 (12%) Query: 44 VRWSYYEHALKQA---AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 + K+A + ++ A+ +KI + R + Sbjct: 93 TTMTNPHSEAKEAIFDLDLPSLAFISNFTMTINNKVYVAEVKEKHQAKKIYDEARRQGKT 152 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD-LLLNPLSLFLRSMGIKSWLIQ 159 DRE T+VE V + Y+ LL L + ++ ++ + Sbjct: 153 AAHVGVKDRETEKFRVSTSVEAGG-----TVTFTLSYEELLQRHLGKYQHAVSVRPQQVV 207 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 E + + + SR + + R P + + ++ + + Sbjct: 208 KNLSVEVTISERTGIDYIHV-LPLRTSRLLTNTLRGEADIPPSTKIEKGEKCARIIFTPT 266 Query: 220 -------GKVGIRDEKLSPYMVSC----------NKSLYYMLYPGPLDPSLSEEHFVDSS 262 GI + + Y VS N + P L P FV Sbjct: 267 PQEQAAYSSSGIMGDFVVQYDVSMKDIIGDVQIYNGYFVHYFAPRGLPPVQKNVVFVIDV 326 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S K + A+ ++ + D +FN SD W I+ Sbjct: 327 SGSMFGTKMKQTKKAMHVILNDLHHDD-----------YFNIVTFSDAVSVWKASGSIQA 375 Query: 323 I------VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLL 374 K + G T IN A+ A S + KK I+ L Sbjct: 376 TPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGR----GKGLKKIPLIIFL 431 Query: 375 TDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 TDGE T I + AK I + +AF + R L N +E Sbjct: 432 TDGEATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMRRLSLENRGVARRIYED 491 Query: 432 NSTHELNKIFRDRIG 446 K F D I Sbjct: 492 ADATLQLKGFYDEIA 506 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 50.0 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 56/440 (12%), Positives = 135/440 (30%), Gaps = 50/440 (11%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G AL++ +L + G+++++VR Y +L AA A++ ++ + E+ ++ Sbjct: 7 QGGGLSPTFALMLTGVLALTGVVIELVR-GYSGQSLLSAAADAVLYSAADSDSAAEDAAA 65 Query: 78 --RAKNSFTF---------PKQKIEEYLIRNFENNLKK--NFTDREVRDIVRDTAVEMNP 124 RA + ++ + +++ L + Sbjct: 66 LVRANLAGRHLQVGPPALSQNEQEAQVILQGEVPALMALSAIGTSGDLPVAAAARASSAR 125 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV-- 182 + +VL + P+ + + E +++ VSI Sbjct: 126 TRIEIALVLDVSNSMSGAPMKAIKQG-------LAEFGEVLFGRERRNQDRVVSIIPATG 178 Query: 183 -IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ++ + +S P +R + ++ R + + Sbjct: 179 LVNIGDHPELFHPESLTFPFGLQTLAHERGWSNLLTREVPGRQRKAFCARLPEHVDGIDR 238 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR-----DALASVIRSIKKIDNVNDTVR 296 L ++ H K ++ L + ++ + R Sbjct: 239 LAELTPGWIRKLELAPRGEAQPRLHYSTKPPAIQQYEDGTPLRAFAPRENPLERYLENRR 298 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII----- 351 F+D H R + + +T + + + + Sbjct: 299 DKLGIFDDPDCGVSPIQ--AHLSTRAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQWQG 356 Query: 352 --SSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-----NEEGIAICNKAKSQGIRIMTIA 404 E+ R + +K +VL +DGE+ + + + +C + K +GI++ T+A Sbjct: 357 RWQQGAAELPRPYGQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVA 416 Query: 405 FSVNKTQQEKARYFLSNCAS 424 F + F++ CAS Sbjct: 417 FEGDA-------RFVAQCAS 429 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 66/217 (30%), Gaps = 11/217 (5%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + S G+ + C+K Sbjct: 157 ALRRAGILLYVVGVRDAVWAELMEIASSPQENFTSFLPNFSGLSNLAQKLRQELCDKLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + P+ E D L + + +D +D VR+G Sbjct: 217 AAPRVDHVSPACREAALADIVFLVDSSTSIGPQNFLKVKNFLYSVVLGLDISSDRVRVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L +++ G T A++ + E+ Sbjct: 277 AQYNDNIYPAFQL--NQHPLKSMVLEQIQNLPYRPGGTNTGSALE--FIRTNYLTEESGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 R K+ + + ++L+TDGE N+E + ++ K Sbjct: 333 RAKDRVP--QIVILVTDGE---SNDEVQEVADRLKED 364 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 70/397 (17%), Positives = 126/397 (31%), Gaps = 79/397 (19%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ G I+ L+P+M+ + + + + +AA+ A + LI S +E Sbjct: 4 IRKNRGVAGILFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLA----LIASPKE 59 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ---V 131 DR V D + D VE+ RK Y+ V Sbjct: 60 SEDDNVKYAR--------------------QLVDRYVVDNINDVDVEVYTRKCEYKDGCV 99 Query: 132 VLSSRYDLLLNPLSLFLRSMGI--KSWLIQTKAE--------AETVSRSYHKEHGVSIQW 181 S + P S F+ S KSW+ +AE ++V+R Y + V + + Sbjct: 100 QASG----EVAPFSDFVVSAKAEHKSWIAYEEAELKPEFEVAGKSVTRKYLPQ-PVDVYF 154 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKS 239 + DFS SM L+ Q +R V+ + N + R L +P V + + Sbjct: 155 IGDFSGSMTG-HWKGGKTKLDVVKQTIERVVEDIADFNTEEKSRVALLGYNPLHVKQSGT 213 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR--- 296 +Y Y + + +A + V + VR Sbjct: 214 VYLNAYGYRRSWPKKIAYDYARGTSAQ----------TVAKMFDPPSVYSRVQEYVRGMS 263 Query: 297 ----MGATFFNDRVI-----SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 NDR + D + + + G T+ + + A Sbjct: 264 RLDVENLVVNNDRFVDYYKFYDIPLTEDYTHFKARLASAW---LGAEGGTSSWNGIIAAA 320 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + N+ ++ ++L+DG + N Sbjct: 321 QEANRA---------TNINPEQVFIVLSDGADNDTNY 348 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 88/270 (32%), Gaps = 23/270 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + LN F K+ + + + + M+ L Sbjct: 155 REGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYD 214 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P + + +D S K L++ AL + + D V+ V GA Sbjct: 215 LPKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAG--- 271 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V+ D G + + GST + AY + H + N Sbjct: 272 -VVLD-----GASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNG 319 Query: 365 LEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ D ++ IA+ K K GI + T+ F + + Sbjct: 320 INR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADK 376 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ +E K+ D + + +F Sbjct: 377 --GNGNYAYIDTLNEARKVLVDELSSTLFT 404 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 69/205 (33%), Gaps = 31/205 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R A + ++ T+++G F + + S + R + Sbjct: 127 TRVDAARQAAIKFVDEME------PTLQLGLVTFAGTAQTLIAPSSDHEVVKRALDDAIR 180 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 D+ TA + + TA I + K + IVL +DG+ T ++ Sbjct: 181 PDKLAA-RTATGEGIYTALQQIETLKGILGGASK--APPAR-IVLESDGKETVPDDLNAP 236 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFLSNCA--SPNSFFE 430 +AK++ + I +I+F L A S FF Sbjct: 237 RGAFTAAKEAKAKEVPIYSISFGTASPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFT 296 Query: 431 ANSTHELNKIFRD---RIGNEIFER 452 A+S +L ++ IG ++ ++ Sbjct: 297 ASSLDQLTDVYSSLNAEIGYDLVKQ 321 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 62/171 (36%), Gaps = 20/171 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A+ ++ ++K+ +++ + + F + +H + + D E Sbjct: 30 LNTAIRDMLGTLKEQESLKAEIHISFITFGNGGA-------NLHTALTPVSNIEFNDFTE 82 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CN 391 G T + A++ A + + + + + I+LL+DG + E N Sbjct: 83 GGMTPLGGALRIAKEMVEN------REIIPSKSYAPIILLLSDGAPNDNGWENEMYRFIN 136 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 +S+ M++ ++ L +S +EA + + F+ Sbjct: 137 DGRSKKCMRMSLGI-----GRDYDYDVLKGFSSNGEVYEAKDSMNIIDFFK 182 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 35/94 (37%), Gaps = 4/94 (4%) Query: 2 VFDTKF---IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 VF F + L + TG+ II AL + ML G D +R ++ Sbjct: 4 VFSYSFGRAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLD 63 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEE 92 A+I +V I + + + + ++E Sbjct: 64 AALIA-AVKQINNTGDTDALKLKVTDWFHAQVEN 96 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/311 (14%), Positives = 107/311 (34%), Gaps = 32/311 (10%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 + F+ FIF IK+ G + ++P+ +G+ + ++ + + L A + A Sbjct: 9 LSFNKLFIF-----IKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQA 63 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + +V + + + K+K +++I + L ++ + ++ T+ Sbjct: 64 TLALTVD--------NDESPDDDNIKKEKNSKFIINYAKAYLPN---EKFSKPVINITSH 112 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 YQV ++ Y P + + S + A + + + Sbjct: 113 SDYIN---YQVDMTIYY-----PTKILNKIFQTVSPEVSISDNARALKYTTTDSKPTDVV 164 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +V D+S SM +Y +S+ A R + K + + ++ + Sbjct: 165 FVADYSGSMNEYFDESDESDEKKIV--ALRRIFKDIQNEIKYNNVNIDIIGFVPFSWGTK 222 Query: 241 YYMLYPGPLDPSLSEEHF----VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + S HF S + K + S ++ ++ N++ R Sbjct: 223 NFYSSTLANMEKESFCHFPFVPNQFSPSSDYLAKYNFSEKYNISDLKKFPELKNLDIVDR 282 Query: 297 M--GATFFNDR 305 + G +N+ Sbjct: 283 IKNGECTYNNY 293 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 82/235 (34%), Gaps = 35/235 (14%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + K+ Y V ++S Y LD + + +F + + + V Sbjct: 289 TYNNYRKIINEINDKLHYEVDLDES-YVNEIYDSLDLACNMAYFDEIADI--VESNIDYT 345 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + SI K DN + + + + +F++ H T + + Sbjct: 346 KT-----LESINKADNTINIKMVDMPNNSICLRGSKAFTFDRHNRNNTSISKI-LGTRAT 399 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----- 389 G T I+ + T + ++ + K +++++DG++++ ++ Sbjct: 400 GGTLISSGILTGNNIF----------LETDNSYNKLMIIISDGDDSRQTDKEKRYYNISK 449 Query: 390 -------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 C K K GI+ +AF + + C +F+ A + HEL Sbjct: 450 NLIKDGMCEKIKDNGIK---MAFIAIGYVPREDIDW-RRCVGEENFYFAKNAHEL 500 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 72/223 (32%), Gaps = 27/223 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 N + L P D F+ SS + V+ LA ++ ++ Sbjct: 40 RNNGLPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRPGEFEKVKIFLADMVDTLDVGP- 98 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDT 349 D R+ + V + L + +K + T A++ A D Sbjct: 99 --DATRVAVVNYASTVKIESLLK---SHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDE 153 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + KN K +++TDG + ++ + A++ GI I + Sbjct: 154 AFTEKSGARPKSKN---ISKVAIIVTDG---RPQDQVEEVSAAARASGIEIYAVGV---- 203 Query: 410 TQQEKARYFLSNCASP--NSFFEANS---THELNKIFRDRIGN 447 + R ++P + F + +L FR+ + + Sbjct: 204 -DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETLCD 245 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 59/179 (32%), Gaps = 33/179 (18%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVKTF 327 V+ A +I +++ D R+ F+ R VI + G + Sbjct: 59 LETVKSAALGLIDRLEEDD------RLSVIAFDHRAKIVIENQQVRNGAA------IAKA 106 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEG 386 G TAI++ ++ ED V +I LLTDGEN DN+ Sbjct: 107 IERLKAEGGTAIDEGLKLGIQEAAKGKEDRV----------SHIFLLTDGENEHGDNDRC 156 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 + + A + + T+ F + L A+ S + E FR Sbjct: 157 LKLGTVASDYKLTVHTLGF-----GDHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQ 210 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 12/165 (7%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ F+ SS + V+ LA VI + + R+G Sbjct: 25 AAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVKVFLAKVIDGL-SVGPDA--TRVGVVN 81 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + RV ++ S HK +VK + E T A+Q A + S E Sbjct: 82 YASRVKNEVSLK--SHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAE----GG 135 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +++TDG + + I +A+ GI I I Sbjct: 136 RKSPDISKVAIIVTDG---RPQDNIRDIAARAREAGIEIFAIGVG 177 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 56/386 (14%), Positives = 128/386 (33%), Gaps = 68/386 (17%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 F K K G II +P+++ V + +++AA+ A + Sbjct: 4 FNMKGFTK-QKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASP 62 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + ++ ++Y R + + N +D E + ++ +K Sbjct: 63 GKDNKDD---------------QDYAQRIVDLYITDNISDIE---------ISVSTKKCE 98 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIK--SWLIQT--------KAEAETVSRSYHKEHGVS 178 Y+ R + L+P + F + SW+ K ++++R Y + V Sbjct: 99 YKDGCVQRNN-ELSPFADFTVVATAEHDSWISHNEIGVEPKFKVSGDSITRKYLPQ-PVD 156 Query: 179 IQWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 I +++D S+SM + + + + R VK + + K P S Sbjct: 157 IYFILDTSQSMSNPWYGERNKTQMQVVKDTITRVVK---------ELENFKTGPDKKSRV 207 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTV 295 L Y Y D D +S + + + +++ + Sbjct: 208 ALLTYNAYNAKFDKGAGRVKLYDYASEFSHTE--ASFESIVDKMFDKSVVEQKPHY---- 261 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 A+ +N + ++ I+ + + G T + +I++ + Sbjct: 262 ---ASDYNKSQD--IPLT-DKYQEFIDILNSNKVMPARGGGTQS-------WLGLIAAAK 308 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K + ++ ++L+DG +T Sbjct: 309 EADKVKKEDRNPEQVFIILSDGADTD 334 >gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Pan troglodytes] Length = 1312 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P L P FV S K + A+ ++ ++ D Sbjct: 269 IHYFAPRGLPPMEKNVVFVIDVSSSMFGTKMEQTKMAMNVILSDLQAND----------- 317 Query: 301 FFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +FN SD W I+ + K + G T IN A+ A + SN Sbjct: 318 YFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSN 377 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 ++ I+ LTDGE T I + + + + ++AF + Sbjct: 378 QEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDA 433 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 71/195 (36%), Gaps = 23/195 (11%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + VD S + LV+ L ++ ++ D V+ V G RV+ +P Sbjct: 201 NLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAG----ESRVVLEP--- 253 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + ++T GSTA +Q AY + + + I+L Sbjct: 254 --TSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQ--------MAQQGFIDQGINR-ILL 302 Query: 374 LTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 TDG+ D + A+ + + G+ + T+ F V+ + A ++ Sbjct: 303 ATDGDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFGVDNYNEHLMEQLAD--AGDGNYAY 360 Query: 431 ANSTHELNKIFRDRI 445 ++ E K+ D++ Sbjct: 361 IDNLREARKVLVDQL 375 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 76/210 (36%), Gaps = 30/210 (14%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + SL ML D + V S + K V+ A++ + ++K++ + N+ Sbjct: 354 SSSLASMLPVTTGDGRGQSTNIVLSIDVSGSSKGGMRVQKAVS--LSALKQLGDENEV-- 409 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G FN R S R + G+T I ++ A + Sbjct: 410 -GIVGFNHRTYSVAE--RQPLGPNREALADRIRRLQAGGATDIAGGLRGAGKMLGDDPGT 466 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 ++L++DG EE I+ + +S+G RI+ I N ++ Sbjct: 467 --------------VILISDG--HDRVEESISYAKQLRSEGKRIIAIGAGKNPNEKN--- 507 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A S S+F A T+ LN +F Sbjct: 508 --LRTIARASGGSYFRATETNRLNILFGGA 535 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 46/154 (29%), Gaps = 16/154 (10%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 A + + + ++++ + G TA A+ T ++ Sbjct: 435 AAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVP----- 489 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++K + +TDG + K +G +I I ++ + R Sbjct: 490 ----FTREGSQKALFFITDGHSNIGG-SPRKAAKILKDKGFQIYAIGV----GKKVRRRE 540 Query: 418 FLSNCASPNSFF--EANSTHELNKIFRDRIGNEI 449 + + P + +L + + +I Sbjct: 541 LMEIASEPEDEYVISVRKYKQLLSAVKKAVHIKI 574 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 46/154 (29%), Gaps = 16/154 (10%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 A + + + ++++ + G TA A+ T ++ Sbjct: 428 AAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVP----- 482 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++K + +TDG + K +G +I I ++ + R Sbjct: 483 ----FTREGSQKALFFITDGHSNIGG-SPRKAAKILKDKGFQIYAIGV----GKKVRRRE 533 Query: 418 FLSNCASPNSFF--EANSTHELNKIFRDRIGNEI 449 + + P + +L + + +I Sbjct: 534 LMEIASEPEDEYVISVRKYKQLLSAVKKAVHIKI 567 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 73/229 (31%), Gaps = 28/229 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 N + L P D F+ SS + V+ LA ++ ++ Sbjct: 40 RNNGLPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRPGEFEKVKIFLADMVDTLDVGP- 98 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDT 349 D R+ + V + L + +K + T A++ A D Sbjct: 99 --DATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDE 153 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + KN K +++TDG + ++ + A++ GI I + Sbjct: 154 AFTEKSGARPKSKN---ISKVAIIVTDG---RPQDQVEEVSAAARASGIEIYAVGV---- 203 Query: 410 TQQEKARYFLSNCASP--NSFFEANS---THELNKIFRDRI-GNEIFER 452 + R ++P + F + +L FR+ + G + Sbjct: 204 -DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETLCGMDACAM 251 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 84/264 (31%), Gaps = 23/264 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S+ + + LN F K+ + + + + M+ L Sbjct: 182 KEGSLPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYE 241 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S K L++ AL + + + D V+ V GA Sbjct: 242 LTKSELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAAG--- 298 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V+ D G + GST + + AY + H ++ Sbjct: 299 -VVLD-----GASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGG 346 Query: 365 LEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ I++ K K +GI + T+ F ++ + Sbjct: 347 INR---VILATDGDFNVGVTNFDLLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADK 403 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 + ++ +E K+ D + Sbjct: 404 --GNGHYAYIDTLNEARKVLVDEL 425 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 52/416 (12%), Positives = 124/416 (29%), Gaps = 49/416 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKNSFTFPKQK 89 L G + R+ + L AQ I + PL +Q + +++ + Sbjct: 213 FLIDGSSSIGKRRFRIQKQFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRD 272 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ + + + N + R A V + + Sbjct: 273 LKTAIEKITQRGGLSNVGRAISFVTKNFFSKSNGNRGGAPNVAV--------------VM 318 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 G + ++ + +E G++I ++ + Q E N + Sbjct: 319 VDGWPTDKVEEASR-------LARESGINIFFITIEGAVENEKQYVLEPNFANKAVCRTN 371 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + L + ++ + S +D SS Sbjct: 372 GFYSLNVQNWYGLHKSVQPLVKRVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTGN 427 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ +A++ + D R+GA + F + + ++ Sbjct: 428 FRTVLQF-VANLSKEFDISDTD---TRVGAVQYT--YEQRLEFGFDQYTTKPDVLNAIKR 481 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+ A+ A + + ++ +K ++L+TDG + ++ Sbjct: 482 VGYWSGGTSTGAAINYALEQLFKKSKPNK---------RKLMILITDG---RSYDDVRIP 529 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A +G+ I + QE+ ++ A ++FF + L K I Sbjct: 530 AMVAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFF-VDEFDNLYKSVPKII 582 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 59/195 (30%), Gaps = 30/195 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + + + +G F+ + R VK Sbjct: 107 SRLAAAKQGAQAFVDQL---PPKIN---LGLVSFSGTAAVLVPPT-----TDRDAVKAGI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL-EAKKYIVLLTDGENTQDNEEGI 387 +TAI + + Y + + N + + IVLL+DGE T+ Sbjct: 156 NGLQLGPATAIGEGI---YAGLSAINTVSSQFVNSGQAVPPAAIVLLSDGETTRGRPNN- 211 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF---------LSNCA--SPNSFFEANSTHE 436 AK I + TIA+ + LS A + S A S E Sbjct: 212 QAAQAAKDAHIPVSTIAYGTPNGTLDVGGQLIPVPVNEPALSQIAEQTGGSHHRATSGDE 271 Query: 437 LNKIFR---DRIGNE 448 L I++ IG Sbjct: 272 LTSIYKGLGSSIGYR 286 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 91/305 (29%), Gaps = 28/305 (9%) Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS----RS 188 + DL L P + ++ ++ AE + S + W + S Sbjct: 193 AAPGDDLALAPPAGTEAFPDVEPSSLKIAAEEPVSTFSIDVDTA---SWAVVRSSLTRGQ 249 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + +N F + +G+ D + + L L P Sbjct: 250 LPPRDAVRIEEMVNYFPYDYPAPAAGEAPFRATLGVMDSPWAEGRQLVHIGLQGALPPVE 309 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P L+ VD+S K L++ + ++ ++ D + G+ V++ Sbjct: 310 ARPPLNLVFLVDTSGSMDAPDKLGLLKQSFRLMLSELRPQDEIAIVAYAGSAG---EVLA 366 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 TI+ GST ++ AY T + ED Sbjct: 367 PTP-----AGERATILAALE-RLAAGGSTNGAGGLEQAYATAEAMTEDGEVSR------- 413 Query: 369 KYIVLLTDGENTQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+L TDG+ + A+ + + G + + F + N Sbjct: 414 --ILLATDGDFNVGLSDPSALEDFIADKRDSGTYLSVLGFGRGNLDDATMQALAQNGNGT 471 Query: 426 NSFFE 430 ++ + Sbjct: 472 AAYID 476 >gi|291407509|ref|XP_002720066.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Oryctolagus cuniculus] Length = 1320 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 65/218 (29%), Gaps = 22/218 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L P FV S K + A+ ++ ++ D Sbjct: 265 DGYFIHYFAPRGLPPVEKNVVFVIDVSGSMFGTKMQQTKKAMNVILSDLQAND------- 317 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTI 350 +FN SD W I+ K + G T IN A+ A + Sbjct: 318 ----YFNIISFSDTVSVWRAGGSIQATSQNVHSAKNYLDHMEAAGWTDINAALLEAASVL 373 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSV 407 SN++ I+ LTDGE T I + + + + ++AF Sbjct: 374 NHSNQEPGRSPGVGRTP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGD 431 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + R L N +E + + I Sbjct: 432 DADFPLLRRLSLENRGVARRIYEDTDAALQLEGLYEEI 469 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 53/163 (32%), Gaps = 1/163 (0%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE-EVSSR 78 G II L + V++G G+ +D+ + + L+ + + A+ + + VS Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEA 78 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A + + + NL +++D V + K Y + R Sbjct: 79 AGLTTGTRNAALFQGKAVEMFENLNVSYSDTPDNTFYTKDKVPYSLDKIKYVKCTAERTG 138 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + + G+ A A + S + + Sbjct: 139 IAQWFIQVLNTLPGMNIQPSTVNAMAVATTTSAQTACAIPVYI 181 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 49/152 (32%), Gaps = 9/152 (5%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEV 358 ++ P+ ++ T ++E G T A+ + Y + + + Sbjct: 146 YSYDFPEDVAPALGTSFSHMLGATNPTQRTKDHENGTGTNTYAALNSVYLMMNNQMQLLG 205 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKT 410 + E + I+LLTDG++ A+ N+ ++ + I I Sbjct: 206 MKTMAWQEIRHAIILLTDGKSNMGGSPKTAVDQIREILNINQKRNDYLDIYAIGVGKLDV 265 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + S F T L+++F Sbjct: 266 DWRELNELGSKKDGERHAFILQDTKALHQVFE 297 >gi|301627727|ref|XP_002943021.1| PREDICTED: complement factor B-like, partial [Xenopus (Silurana) tropicalis] Length = 705 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/303 (14%), Positives = 79/303 (26%), Gaps = 29/303 (9%) Query: 164 AETVSRSYHKEHGVSI--QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 A SY E VS Q + S +R+ +P+ R +Y + Sbjct: 151 ASKSGTSYKMESKVSYTCQQGLVMFGS---KERECLENKDWSGTEPSCRQWYTYDTPREV 207 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + + N L L V +S K + ++A Sbjct: 208 AKGFSSSMLENVDTTNLEGRSDRKVQILKDGLMNIFIVLDTSKSVGKDKFNEAKEASILF 267 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAIDENEMGST 337 I + D + I S S +I + + T Sbjct: 268 IEKVSSYDIKPQYC---IISYASEAIPVVSLRDQDSKNADAVIEHLENFVYDSHADKQGT 324 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY---IVLLTDGENTQDNE---------- 384 A+ + Y +I E N K I+L+TDG+ + Sbjct: 325 NTRAALHSIYQQLI-EQELVYKNNNNKESFMKIHNVILLMTDGKFNMGGDPREEMKVIRR 383 Query: 385 --EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + + + + + Q E S F + ++ + F Sbjct: 384 FLNVGTSKDDLREEYLDVYVFGLGSDIDQPE-INDLASKKDKEVHTFHLENVDKMKEFFE 442 Query: 443 DRI 445 I Sbjct: 443 LMI 445 >gi|293356227|ref|XP_574569.3| PREDICTED: rCG40584-like [Rattus norvegicus] Length = 1182 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/350 (12%), Positives = 102/350 (29%), Gaps = 64/350 (18%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 ++ ++ T K Y + L + P ++ + + A Sbjct: 47 SWVVVGAPKEIQATNQIGGLYKCGYHTGMCESISLQVPPEAVNMS--------LGLSLTA 98 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S W++ ++ R++ CF + + Sbjct: 99 TNPS------------WLLACGPTVHHICRENTYLTGLCFLLDSSFRQNQNFPAAQQGCP 146 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R ++ +++ + S+ Y + L + + SS R + + Sbjct: 147 RQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQQSSTRFSLMQF------------- 193 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + T R FND + + + GST A++ Sbjct: 194 -------SHTFRTHF-TFNDFISTSSPL------------RLLDFVNQLRGSTRTASAIK 233 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + ++ + K ++++TDG D + + A++ GI I Sbjct: 234 HVITELFTTQKGARKDAT------KILIVITDGRKEGDRLDYGDVIPMAEAAGIIRYAIG 287 Query: 405 FSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 Q + + L + AS F + L I ++++ +IF Sbjct: 288 VGQAFYQAQSRQE-LKDIASSPSREYVFSVENFDALKDI-QNQLKEKIFA 335 >gi|293344414|ref|XP_001080404.2| PREDICTED: rCG40584-like [Rattus norvegicus] gi|149067645|gb|EDM17197.1| rCG40584, isoform CRA_b [Rattus norvegicus] Length = 1182 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/350 (12%), Positives = 102/350 (29%), Gaps = 64/350 (18%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 ++ ++ T K Y + L + P ++ + + A Sbjct: 47 SWVVVGAPKEIQATNQIGGLYKCGYHTGMCESISLQVPPEAVNMS--------LGLSLTA 98 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S W++ ++ R++ CF + + Sbjct: 99 TNPS------------WLLACGPTVHHICRENTYLTGLCFLLDSSFRQNQNFPAAQQGCP 146 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R ++ +++ + S+ Y + L + + SS R + + Sbjct: 147 RQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQQSSTRFSLMQF------------- 193 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + T R FND + + + GST A++ Sbjct: 194 -------SHTFRTHF-TFNDFISTSSPL------------RLLDFVNQLRGSTRTASAIK 233 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + ++ + K ++++TDG D + + A++ GI I Sbjct: 234 HVITELFTTQKGARKDAT------KILIVITDGRKEGDRLDYGDVIPMAEAAGIIRYAIG 287 Query: 405 FSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 Q + + L + AS F + L I ++++ +IF Sbjct: 288 VGQAFYQAQSRQE-LKDIASSPSREYVFSVENFDALKDI-QNQLKEKIFA 335 >gi|149067644|gb|EDM17196.1| rCG40584, isoform CRA_a [Rattus norvegicus] Length = 1084 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/350 (12%), Positives = 102/350 (29%), Gaps = 64/350 (18%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 ++ ++ T K Y + L + P ++ + + A Sbjct: 47 SWVVVGAPKEIQATNQIGGLYKCGYHTGMCESISLQVPPEAVNMS--------LGLSLTA 98 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S W++ ++ R++ CF + + Sbjct: 99 TNPS------------WLLACGPTVHHICRENTYLTGLCFLLDSSFRQNQNFPAAQQGCP 146 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R ++ +++ + S+ Y + L + + SS R + + Sbjct: 147 RQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQQSSTRFSLMQF------------- 193 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + T R FND + + + GST A++ Sbjct: 194 -------SHTFRTHF-TFNDFISTSSPL------------RLLDFVNQLRGSTRTASAIK 233 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + ++ + K ++++TDG D + + A++ GI I Sbjct: 234 HVITELFTTQKGARKDAT------KILIVITDGRKEGDRLDYGDVIPMAEAAGIIRYAIG 287 Query: 405 FSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 Q + + L + AS F + L I ++++ +IF Sbjct: 288 VGQAFYQAQSRQE-LKDIASSPSREYVFSVENFDALKDI-QNQLKEKIFA 335 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 52/305 (17%), Positives = 108/305 (35%), Gaps = 50/305 (16%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ G ++T LL+ ML VGGM VD +R+ L+ A A++ A+ Sbjct: 57 FIRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAA---- 112 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 +++ +I +F FT + T + ++ Sbjct: 113 --------------NLNQEREAADVITDF-------FTAEGFGGSIVGTP-SIQKNRNGS 150 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + L S D+ + +LR +GI + + A A + + + E V+D S SM Sbjct: 151 TIRLESIVDVD----TFYLRLVGIDT--LSAPANATAIEGTGNVEVS----LVLDISGSM 200 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + +D ++ N R E +++ L L Sbjct: 201 GSRMTG------DAYLYDSDGEIRLDPDGNPLTERRTEDRMFFLIQEANKFIGDL----L 250 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + ++ + ++ + AL + SI + DN+ + G+ D + Sbjct: 251 KDEYRDRVSINLVAYSQHVRLGDDLYTALNTTPDSIDEDDNLGSSY--GSIT--DGYTAP 306 Query: 310 PSFSW 314 +++W Sbjct: 307 FTYTW 311 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 80/264 (30%), Gaps = 23/264 (8%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + +N F + + +P+ + L Sbjct: 186 GVLPERGTVRVEELINYFAYQYPAPDAGEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKE 245 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +D S K L+++AL + + + + D ++ V GA+ V Sbjct: 246 KADLGASQLVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGASG----V 301 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + D GV + GST ++ AY + H + + Sbjct: 302 VLD-----GVKGNDTLAISQALDKLKAGGSTNGGAGIELAYQL------AQKHFIAGGVN 350 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++L TDG+ + A+ + + QGI + T+ F Sbjct: 351 R---VILATDGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADK-- 405 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++ +E K+ D I Sbjct: 406 GNGHYAYIDTLNEARKVLVDEISA 429 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 12/165 (7%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ F+ SS + V+ LA VI + + R+G Sbjct: 25 AAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVKVFLAKVIDGL-SVGPDA--TRVGVVN 81 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + RV ++ S HK +VK + E T A+Q A + S E Sbjct: 82 YASRVKNEVSLK--SHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAE----GG 135 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +++TDG + + I +A+ GI I I Sbjct: 136 RKSPDISKVAIIVTDG---RPQDNIRDIAARAREAGIEIFAIGVG 177 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 34/88 (38%), Gaps = 11/88 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ----------AAQTAIIT 63 I +G+F I+TALLM ++G GM VD L A ++ + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 A++ + +S ++ ++ Sbjct: 68 AAMAM-NGNGTISLGKTDARDIFMSQVS 94 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 52/151 (34%), Gaps = 17/151 (11%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNED 356 G TFF + V+ + V + G T I A++ +I E Sbjct: 125 GLTFFGNEVLHWCPLTTDVSAINCATPFMRPGQLPPWFGGTLIAKALRACKAELIKRPE- 183 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + IVL+TDG++ N + + K++GI + + ++ Q Sbjct: 184 ----------GDRMIVLITDGDSQDFANGADAEVAEELKAEGITVFAVVIGNDRQFQNPI 233 Query: 416 RYFLS----NCASPNSFFEANSTHELNKIFR 442 S + FEA + L +F+ Sbjct: 234 IRNGSVQTVTARTGGESFEAGDPNALATVFK 264 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 51/349 (14%), Positives = 102/349 (29%), Gaps = 57/349 (16%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD-------REVRDIVRDTA 119 + L + + + R F KN + T+ Sbjct: 78 ATVGGLSPTKEYTLQVYVLNGSQEALFAKRKFVIEDLKNASQTRNNRRNSGAAPGKNLTS 137 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V + + + + + PLS L + + +AE + + K G ++ Sbjct: 138 VGSSTAEQSLGMETAP------PPLSTVL------THPAKDRAEKKRHKGTQPKGSGETM 185 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 QP Q T KS S Sbjct: 186 -----------------RSQPSVGVTQTPPPTTKSSQRATANTEREPPGKEKPTRD---S 225 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L ++++ + S L+++ L+++I D +R+G Sbjct: 226 LRRGSQLQCDTSAMTDIVLLVDGSWSIGRSNFKLIKEFLSALISPFNIA---QDKIRVGL 282 Query: 300 TFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + ++ DP W + +++ + G+T A+ + + D Sbjct: 283 SQYSS----DPRTEWDLSAYATRDQVLEAVRNLRYKGGNTFTGLALTHVLEQ--NLKPDA 336 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 R+ EA+K ++LLTDG + ++ K+ GI I I Sbjct: 337 GARL----EAEKLVILLTDG---KSQDDANLAAQTLKNMGIEIFAIGVK 378 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 72/213 (33%), Gaps = 18/213 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T INDAM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E R+ I+LLTDG+ T +I N + ++ + + Sbjct: 370 E--RLPEGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSL--FCLGFGFDVSY 423 Query: 417 YFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N +++S +L +++ Sbjct: 424 AFLEKLALDNGGLARRIHEDSDSALQLQDFYQE 456 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 20/170 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + I+ +D + R+G + V ++ S HK +++ + T Sbjct: 61 LSQVIESLDVGANATRVGLVNYASTVKNEFSLK--THKAKPALLQAVKKVQPLSTGTMTG 118 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + + +E R+K+ K IV +TDG + + I +A+ G+ I Sbjct: 119 LAIQYAMNNAFTESEGA--RIKSPGINKVAIV-VTDG---RPQDTVKDISARARESGLEI 172 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 I + L AS S +L+K F++ Sbjct: 173 YAIGVG------RVDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEA 216 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 52/395 (13%), Positives = 121/395 (30%), Gaps = 44/395 (11%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L + T + + + S K +F K K +E + + ++ + +N E+ Sbjct: 82 ELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEI 141 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 ++ + ++ ++ + G K +AE V+ Sbjct: 142 GTGLKHVRENSFSGAEG---GGNPDKQKIVILMTDGKSNAGAPPQHEAHKLKAEGVTVIA 198 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 ++ ++ +M +Y + + D Sbjct: 199 IGIGQGFVKTELEQIATMKNYVLTTNSFSELSTLLKL---------------VIDLACEV 243 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 +V C +++++ + L+++ + ++ K Sbjct: 244 CVVDCAGHADIAFVFDASSS-------INANNPNNY----QLMKNFMKDIVDRFNKTGPD 292 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + F DR + + +G TAI D ++ A + Sbjct: 293 G--TQFAVVTFADRATKQFGLK-DYSSKADIKGAIDKVSPSIIGQTAIGDGLENARLEVF 349 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + E +K ++LLTDG+N ++ + + +G+ I+ I Sbjct: 350 PN-----RNGGGREEVQKVVILLTDGQNN-GHKSPEHESSLLRKEGVVIVAIGV-----G 398 Query: 412 QEKARYFLSNCASPNSF-FEANSTHELNKIFRDRI 445 + L N AS + F +S +L+KI D + Sbjct: 399 TGFLKSELINIASSEEYVFTTSSFDKLSKIMEDVV 433 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 70/232 (30%), Gaps = 11/232 (4%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK--SLYYM 243 S S L+ + G C + + + G+ S C + Sbjct: 121 SNSPLNSTKLQNGFKNACIQNYQNLPSLKWQYFGSEQGVTTLFPSLRATDCGSFDNRCRP 180 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG-ATFF 302 Y P + V S + +L ++A SV+ ++ D F Sbjct: 181 WYVQANVPKPKQIVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPF 240 Query: 303 NDRVISDPSFSW---GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 V SD F+ + V+ F + G T A+Q A+ Sbjct: 241 QSTVASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDF 300 Query: 360 RMK--NNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFS 406 +K + E + I+ ++DG ++ N+ + + I+T Sbjct: 301 NIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLG 352 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 66/210 (31%), Gaps = 37/210 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + S+ + VR+G F + + + I + Sbjct: 107 RLVAAQEAARAFVESLPR------EVRVGVVSFAGTAAVVQAPTTSRDDVFAAIERFQLQ 160 Query: 330 DENEMGS------TAINDAMQTAYDTIISSNED-------EVHRMKNNLEAKKY----IV 372 +GS I I E + Y ++ Sbjct: 161 RGTAIGSGIVLSLATIFPDAGIDIQQITGQRTMPRMLGDPEKKAEFTPVPPGSYASAAMI 220 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMT----------IAFSVNKTQQEKARYFLSNC 422 LLTDG+ T + I A +GIR+ T I F + L Sbjct: 221 LLTDGQRTTGP-DPIDAAKMAADRGIRVYTVGVGTTQGEIIGFEGWSMRVRLDEDTLRQI 279 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIF 450 A + +F A + +L K+++ R+G+ + Sbjct: 280 AQMTTGEYFYAGTAEDLKKVYQ-RLGSRMV 308 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 93/305 (30%), Gaps = 28/305 (9%) Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 N R Q S+ ++++ S +R + Q T + + + + Sbjct: 685 NNVRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNALRSMKSLFQMAY 744 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + ++ +N + + + K ++ E L + N+ + Sbjct: 745 RENVPQILIVITDGKSEDKVNQAARDLRQQGIVIYAIGIKDAVQQE-LEEIAETKNRMFF 803 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKK-KHLVRDALASV-IRSIKKIDNVNDT----- 294 + V S+++ ++ + D+ S+ +KI + + Sbjct: 804 VNDFDSLKHIKHEIVQEVCSTNVCKNVRADIVFLVDSSNSIRAAEFQKIKDFMQSFVIKV 863 Query: 295 ------VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 VR+G F+ + + + I + + + T A+ A Sbjct: 864 DVGLDNVRIGLIQFSSEIREEFQLDR--YSTIADVQRAIQEMQQIKLGTLTGKALTFAAS 921 Query: 349 TIISSNEDEVHRMKNNLE-AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 R K K+Y++++TDGE + + +GI I I Sbjct: 922 YF--------DRPKGGRPELKQYLIVITDGEAQDSVKSP---ARAIRDKGITIYAIDMLQ 970 Query: 408 NKTQQ 412 Q Sbjct: 971 ANNSQ 975 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 29/212 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L G +D ++ +F+ S ++ + VIR + + VR G + Sbjct: 638 SLKKGCMDTEEADIYFLIDGSGSIYPSDFKDMKTFMNEVIRIFQL---GANNVRFGVVQY 694 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 S G H + + + G T +A++ S + Sbjct: 695 AS--ESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNALR-------SMKSLFQMAYR 745 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 N+ + ++++TDG + ++ + QGI I I ++ + L Sbjct: 746 ENVP--QILIVITDG---KSEDKVNQAARDLRQQGIVIYAIGI------KDAVQQELEEI 794 Query: 423 A-SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + N F N L I +EI + V Sbjct: 795 AETKNRMFFVNDFDSL-----KHIKHEIVQEV 821 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 79/219 (36%), Gaps = 16/219 (7%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++++ F+ +S + V++ L+S++ S+ D D +R+G + Sbjct: 34 AQQTACSKATVADVVFIVDTSTSIAQENFQKVKNFLSSLVSSL---DIGLDMIRVGLAQY 90 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +D + L ++ G T A+ + E R K Sbjct: 91 SDEAYQVFLL--NQYLLKSDVLDQIGNLPYRGGETYTGRALDFVSTRYFT--ESAGSRAK 146 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---QQEKARYFL 419 + + VL+T GE+ + E+ K + +GI I + + T QQ ++ F Sbjct: 147 GYVP--QLAVLITSGESNDEVEQP---AKKLRYRGISIYVVGIGIQNTTELQQIASKPFR 201 Query: 420 SNCASPNSFFEANS-THELNKIFRDRIGNEIFERVIRIT 457 S SF + + L + F I ++I R Sbjct: 202 RYLYSIGSFDDLPDLSTRLLQNFCIAIESQIQAFAKRYA 240 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 55/163 (33%), Gaps = 25/163 (15%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ + + D+ +G FN + R + G T I + + Sbjct: 409 ALNALGQLGDSTSVGVVGFNRQAYEVVGLEQLTEN--RDTTRQRIRQLRAGGGTNIANGL 466 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + A + + ++L++DG + + +A +G+R++T+ Sbjct: 467 RGAEEMLDGQRGT--------------VILISDGVDARSRATVVA--ESLGRRGVRVITV 510 Query: 404 AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A S ++F+AN T L +F Sbjct: 511 GAGQRVNEP-----LLEQIADISGGTYFQANETDRLRILFGGS 548 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 68/187 (36%), Gaps = 28/187 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + A+V + + ++ FNDR F+ +L + Sbjct: 126 LAKSRAAVAAFLSSANPEDEF---SLVLFNDRAQLVSGFNRQTDELQSKL-----FYAQS 177 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TA+ DA+ A D + + +K +++++DG + + N+ Sbjct: 178 KGRTALLDAIYLAMDQMKHAKHS-----------RKAVLVISDGGDNCSRYSMREVKNRV 226 Query: 394 KSQGIRIMTI------AFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRI 445 K +I +I F ++ L + AS FE ++ +EL+ + +I Sbjct: 227 KEGDAQIYSIGILEAMGFRGRSAEELAGPALLDDIASQSGGRLFEIDNLNELSDV-ASKI 285 Query: 446 GNEIFER 452 G + + Sbjct: 286 GMALRNQ 292 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 61/158 (38%), Gaps = 17/158 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +D+ + R G FN V + + + + + + F + + + ++ ++ Sbjct: 72 LSLMDDASSKDRFGFVGFNKEVTKELALTNNIVQAKSKL-NEFGKNISPYMANDLSKGLE 130 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A D + K + K IV++T G N+ NE + KA + I I TI+ Sbjct: 131 KAVDELT----------KKSTSNDKVIVIMTVG-NSIYNEVSKKLAAKAYEEDITIHTIS 179 Query: 405 FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 F FL+ A + ++ + + L + Sbjct: 180 FGDPLYADAP---FLTEIAKLTGGNYTHSPNAAFLKDV 214 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 43/117 (36%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + ++ D TAI+DA++ A + Sbjct: 130 RVGIIAYAGGAYPQLPITTDFS-AAKMFLQALNTDMISSQGTAISDAIELATTYYDDDQQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ + N E IA +A +GIRI TI K Sbjct: 189 TN-----------RVLFIISDGEDHEGNVEDIA--EQAAEKGIRIFTIGVGTEKGGP 232 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 68/195 (34%), Gaps = 34/195 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP----------SFSWGVHK 318 + L ++A + + D R+G F+ + + V+ Sbjct: 432 TRLELAKEAAITSLDEFSDSD------RVGLWMFSTDLEDNGQDWRELVPLGPLGASVNG 485 Query: 319 L-IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R + + G T + D A+ + + + +V LTDG Sbjct: 486 TPRREELAERISNLPPGGGTGLYDTALAAHTLVAEHSRPDAINA---------VVFLTDG 536 Query: 378 ENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 +N N + +I + QG+RI TI++ + + + A+ + ++A+ Sbjct: 537 KNEDLNGISLEKLLDSITPEPGQQGVRIFTISYGEDADLKTMTQI---AEATNAAAYDAS 593 Query: 433 STHELNKIFRDRIGN 447 ++++F I N Sbjct: 594 DPQSIDEVFEAVISN 608 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 51/366 (13%), Positives = 108/366 (29%), Gaps = 47/366 (12%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 K K+ + RN + + + +D + V + + N S Sbjct: 224 TKAKLTLFSGRNKVGEQQVQIQKGKNSFVFKDKQSSGGFKGYRVLVEAEGDTNKVNNEFS 283 Query: 146 LFLRSMGIKSWLIQTKAEAET--------VSRSYHKEHGVSI------QWVIDFSRSMLD 191 F M + L+ + ++ S + K+ S + + S + D Sbjct: 284 TFTNVMDKPNILLINGVKGDSEALEGILSNSGANIKKIAPSASPSTLNELLEYKSIVLND 343 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 RD N F + VK Y G D + +Y Sbjct: 344 VHRD---DLSNGFMDNIEAYVKDYGGGLITFGGEDSYALGGYKDTSLEKVLPVYMDKRGK 400 Query: 252 SLSEEHFVD--------SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + ++ S+ + K L ++A + +++++D + F+ Sbjct: 401 NEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDE------ISVIAFD 454 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + +K G T+I A++ Y+ + S+ H Sbjct: 455 DTYDEVVPLQ---KVGDKEAIKELISGIQIRGGTSIYPALEQGYNMQMQSSAKIKHT--- 508 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +LLTDG++ + + I + T+A + L++ Sbjct: 509 --------ILLTDGQDGYGLDNYATLLQNFIDNNITLSTVAVG--EGANAGLLNQLASIG 558 Query: 424 SPNSFF 429 S++ Sbjct: 559 KGRSYY 564 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 14/160 (8%), Positives = 53/160 (33%), Gaps = 2/160 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G ++ + + ++ V + +D + L+ A + + A+ L Sbjct: 19 RPYRKQGGAILVMFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKELQDGA 78 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-V 131 +R + T Q ++ +++ + D + + +E + + + Sbjct: 79 SLYEAREA-ATTLLLQNLQYQENGELNSSIDLSAPDYNSTQVAANLFIEFSEWPDPFSPI 137 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 ++ + + ++ L + + A ++ Sbjct: 138 LVEGSEYVRIRIENVGLTNFLAQIMNFDKSVRASAIAGRS 177 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 55/160 (34%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T IN+AM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E R+ + I+LLTDG+ T +I + Sbjct: 370 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREA 405 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 55/145 (37%), Gaps = 17/145 (11%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F++R I + + R + + E G T+I +Q A+ I Sbjct: 345 GIVLFDERAIIRNPLIQIISEDDRNYL-MTRLPEAAGGGTSICSGVQAAFQAI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 + K IVLLTDGE+ ++ C + K G I T+A + Q+ + Sbjct: 397 --KQKFQTTDGSEIVLLTDGEDV-----TVSSCFEEVKQSGATIHTVALGTSAAQELERL 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIF 441 ++ S EA + + L F Sbjct: 450 SDMTGGISTAPSDEAQN-NGLIDAF 473 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + VR L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETVRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVMYAVGVGKAVEAE------LREIAS 497 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 46/150 (30%), Gaps = 24/150 (16%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S + + V+ L+ S + R+ F+ ++ + Sbjct: 146 SVGKANFYSEIKFVKKLLSDFNVSY-------NYTRVAVITFSSQMKIFRHIDQISTSVE 198 Query: 321 RT-----IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + E G T A++ A + ++ D KK I L+T Sbjct: 199 DNDKCLLLNYQIPKIEFSGGGTYTYGALKEAEEIFQNARADS----------KKIIFLIT 248 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 DG + + I + K + + I +I Sbjct: 249 DGFSNG--RDPIPLAESLKRKNVVIYSIGI 276 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/133 (24%), Positives = 54/133 (40%), Gaps = 24/133 (18%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 V TF TAI A++ Y ++ E + + + IVL+TDGEN Sbjct: 418 VNTFVDGLRLQDGTAIYSALEAGYRAAGAAVEADPGYLTS-------IVLMTDGENN--- 467 Query: 384 EEGIAICN--------KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GI+ + A ++ +R TIAF + R ++ + + F+A T Sbjct: 468 -SGISAADFRSSYQRLPAAARAVRTFTIAFG--EADPAALRDISAD--TGGAVFDAR-TS 521 Query: 436 ELNKIFRDRIGNE 448 L F+D G + Sbjct: 522 SLADAFKDIRGYQ 534 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 55/160 (34%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T IN+AM A + SSN++ Sbjct: 312 --LIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSSNQE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E R+ + I+LLTDG+ T +I + Sbjct: 370 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREA 405 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 64/201 (31%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G + S + I K Sbjct: 118 RLTAAQEAAKQFADQLTPGIN------LGLIAYAGTATVLVSPTTNRESTKTAIDKLQLA 171 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNE 384 D TA + + TA I + + +VL++DG+ N + + Sbjct: 172 DR-----TATGEGIFTALQAIATVG---AVIGGGDEPPPARVVLMSDGKETVPSNPDNPK 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANS 433 AK QG+ I T++F E L A S F A+S Sbjct: 224 GAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 +L ++F +++IG E + Sbjct: 284 LEQLKQVFTNLQEQIGYETIK 304 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 84/233 (36%), Gaps = 36/233 (15%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 K + + +D ++ SS + + I++I + Sbjct: 746 KTEHFSTFLLGDKNMPVDLSKVDIVFVLDNSGSMSSNDPNYYRIEATKK----FIQNIDE 801 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ++N R+G F+ V + + KL++ + GST I ++ A Sbjct: 802 LNN-----RVGLVDFDSSVSVRSNLTSDKSKLLQALNAMRWTG----GSTNIGGGLKAAL 852 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + ++KK IVLL+DG + + K + I + TIA Sbjct: 853 ------------GLFDQEQSKKIIVLLSDGYHNTGIHPNDVLPELIKQE-IVVNTIAL-- 897 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ R L + A + +F ++T L+ ++ + +I ++TK Sbjct: 898 ---GKDCDRELLHDIADKTKGGYFYVDNTGGLS---QEDVDKQIELIYEKLTK 944 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 64/453 (14%), Positives = 134/453 (29%), Gaps = 73/453 (16%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ +GH ++ A+++P G+ + D R + L+ AA+ A++ + + Sbjct: 4 IRKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLEDAAEAAVLAIAAHNADNSGS 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S A N I +Y+ + K T DI N +Q + Sbjct: 64 SSGSAIN-KKIASDWIGQYMQDMQAISDIK-ITKLNCNDIAECKEGLENGESRYFQYEIL 121 Query: 135 SRYD-LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 ++ + L P + G ++ + + + V + +V DFS SM + Sbjct: 122 AKTNHLSWFPGNNSTAGFG------ESFDVVGSATARKFQSESVDVMFVSDFSGSMNNKW 175 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 V + + + D S Sbjct: 176 SGGSNSRRYKDLIKIIGDVIKELDKFNNAHTTTTNRVGFTGFNTYTR------KTADNSC 229 Query: 254 SEEHFVDSSSLRHVIKKKHLVR-----DALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 ++ + S+ + K V+ + + I DN N+ Sbjct: 230 YQDQYDRSAGRT--VNKIFEVKGCKSRSSGGAKFHDIAMTDNYNEF-------------- 273 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 K G TA + + ++ E + Sbjct: 274 ----------------KNTIKYFKPGGGTASYQGIIRGAQMMDAAPEPR---------PR 308 Query: 369 KYIVLLTDGENTQDNEE------------GIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + +++L+DG +++ + + + N S G I T V + Sbjct: 309 RIMIILSDGIDSKRSRANKLVEEGMCSKILLKLGNANTSDGKAIKTKMAVVGFDYNPASN 368 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L+ C ++ + AN+ ++ + I EI Sbjct: 369 PSLAKCVGEHNVYGANNPEDVLNKILELISEEI 401 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 64/193 (33%), Gaps = 21/193 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +D S+ K + ++ L + + D R+G F++ Sbjct: 84 YNIIIALDVSNSMKEKNKLKISKEILRDFLLKRDEED------RIGILVFDNLPFRLMPL 137 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + L+R ++ ++G TA+ D + A + + + K I+ Sbjct: 138 TSDRGALLR-VISIIRPAMVDVGGTAMYDGLVEALNMFMKDRRN------------KIII 184 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LLTDG + + + G +I TI V+ LS +FF Sbjct: 185 LLTDGGDINSKYTLEDVVRFNQDIGAKIYTIG--VSSGMNFYVLERLSEATGGKAFFVTK 242 Query: 433 STHELNKIFRDRI 445 + + D I Sbjct: 243 DYQKALRSVFDEI 255 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 56/174 (32%), Gaps = 24/174 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ I + + + +I S V +L ++K I Sbjct: 602 LKTWTGEFIDKLGVQEYGAQV---ALVKYATSIIKVSELSSDVDELKEKLMKVPFIQ--- 655 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ A + + K I+L+TDG+ T I K Sbjct: 656 -GKTNTGGALERAQQML----------AEGRPSVPKIILLITDGDATDKERLDAQI-EKL 703 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIG 446 K I I TI + E L+ A+ F +E ++KI +G Sbjct: 704 KKSNILIYTIGVGDLIDRNE-----LNRIATDEDFVYETRDFDSISKIKSSLLG 752 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 53/149 (35%), Gaps = 21/149 (14%) Query: 278 LASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + ++ + D + V++G ++ V SD + G V+ + G Sbjct: 791 AKNFVANVSSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGD 850 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-AICNKAKS 395 T A++ I ++N E K ++ +TDG T + + A + Sbjct: 851 THTGTALRY----ISTNNRWRE-------EVPKILIFVTDG--TPQDRAIVPAAARSLRD 897 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +G+RI I + L AS Sbjct: 898 KGVRIFAIGVGNAVESE------LKEIAS 920 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 59/159 (37%), Gaps = 15/159 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++K + V+ GA ++ I + + G T A++ Sbjct: 842 VEKANVGRGQVQFGALKYSKEPEDLFYL--NTFSKGAAITENLRRRRDTYGETYTAKALE 899 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A +S E H + K+ ++++TDG + D E+ K +++GI I + Sbjct: 900 HA-----NSQFTEEHGSRIKQNVKQMLIVITDGV-SHDREQLSDTALKLRNKGIIIYAVG 953 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFR 442 E +Y L A N+ ++ +L I++ Sbjct: 954 VG------EADQYELEAIAGDKNNTRHVDNFDKLKDIYQ 986 >gi|325964113|ref|YP_004242019.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] gi|323470200|gb|ADX73885.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] Length = 352 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 49/156 (31%), Gaps = 19/156 (12%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 I+ A+LM +LG + VDV + ++ A + + + + A Sbjct: 28 ILVAVLMVALLGCAALAVDVGAMYAEKAQIQNGADATSLAIAEECANGVNCAVAMAA--- 84 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 P ++ + + + + T I +T + + L Sbjct: 85 --PANRLADANANDGATGVF-SVTQPSPSTIRVETNAREAGSG-------DNHFSLF--- 131 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 F R +GI + I AEA S ++ Sbjct: 132 ---FARVLGIDTSQITAAAEASWGPPSSGSTLPWTV 164 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 72/189 (38%), Gaps = 24/189 (12%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 AL S++ S + + + R+G F F + +D G Sbjct: 134 ALKSLLSSFSQQRDGD---RLGLIVFGSGAYLQVPF---TEDVRLWQTLLEQMDTQMAGP 187 Query: 337 -TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI DA+ + SN +++ ++L+TDG +T + + A + Sbjct: 188 ATAIGDAVGLSIRAFERSN-----------TSQRILLLVTDGSDTSSRLDPVDAARVAAA 236 Query: 396 QGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFR--DRIGNEI 449 +GI I T+ T + F L+ A + FE NS+ + +I D+I Sbjct: 237 EGIEIFTLGMGSVDTVGDDQVDFNTLNKIAKITNGRAFEGNSSTAIAEILAQIDKIAPAK 296 Query: 450 FERVIRITK 458 +++ + K Sbjct: 297 YQQNSFLPK 305 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 55/160 (34%), Gaps = 10/160 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 244 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 299 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA +G T IN+AM A + SSN++ Sbjct: 300 --LIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSSNQE 357 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 E R+ + I+LLTDG+ T +I + Sbjct: 358 E--RLPDGSV--SLIILLTDGDPTVGETNPRSIQKNVREA 393 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 51/163 (31%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAYVQSELVQINSGSDRDTLAKR-LPTAAAGGTSICTGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 R K + + IVLLTDGE+ I+ C N+ K G I T+A A+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAVIHTVAL-----GPSAAQ 443 Query: 417 YFLSNCASPNSF--FEANSTHE--LNKIFRD-RIGNEIFERVI 454 F + ++ L F GN + Sbjct: 444 ELEQLSKMTGGFQTYASDQAQNNGLIDAFGALSSGNGADSQRS 486 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 61/164 (37%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T +TAY + + Sbjct: 202 IQVGVVQYGEDVVHEFHL--NDYRSVKDVVAA-ASHIEQRGGTET----RTAYGIEFARS 254 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + K+ ++++TDGE + D+ + + ++ + +A + Sbjct: 255 EAFQKGGRKGA--KRVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGI 311 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 312 NPEAFLNEIKFIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 354 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 61/164 (37%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T +TAY + + Sbjct: 202 IQVGVVQYGEDVVHEFHL--NDYRSVKDVVAA-ASHIEQRGGTET----RTAYGIEFARS 254 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + K+ ++++TDGE + D+ + + ++ + +A + Sbjct: 255 EAFQKGGRKGA--KRVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGI 311 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 312 NPEAFLNEIKFIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 354 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ ++ + R+G ++ +V S + + T Sbjct: 57 LVGLLRGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 60/197 (30%), Gaps = 20/197 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 Y C + D + + S + LV+ + ++ + Sbjct: 318 SYRCLCPEGRQLQADGKSCDRCREGHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFLDV 377 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + R+G F+ RV ++ G + + + E T A++ Sbjct: 378 SP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMV 432 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + S + R N + +V TDG + ++ +AK +GI + + Sbjct: 433 EHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAARAKEEGIAMYAVGVGK 486 Query: 408 NKTQQEKARYFLSNCAS 424 + L AS Sbjct: 487 AVEAE------LREIAS 497 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ ++ + R+G ++ +V S + + T Sbjct: 57 LVGLLRGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 60/197 (30%), Gaps = 20/197 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 Y C + D + + S + LV+ + ++ + Sbjct: 318 SYHCLCPEGRQLQADGKSCDRCREGHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFLDV 377 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + R+G F+ RV ++ G + + + E T A++ Sbjct: 378 SP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMV 432 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + S + R N + +V TDG + ++ +AK +GI + + Sbjct: 433 EHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAARAKEEGIVMYAVGVGK 486 Query: 408 NKTQQEKARYFLSNCAS 424 + L AS Sbjct: 487 AVEAE------LREIAS 497 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 55/185 (29%), Gaps = 52/185 (28%) Query: 316 VHKLIRTIVK---------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NN 364 HKL ++ + + G T + M A I N Sbjct: 447 THKLQEYLLNGSARNADFVSRINALSPKGGTMHDIGMIWAGRLISPDGIFAADNASAPNG 506 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI----------------------------------C 390 ++++ +TDGE A C Sbjct: 507 DPISRHVIFMTDGEMGASPSNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAIC 566 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 +++ + I +IAF + + + CA+ + A ++ EL+ FRD G+ Sbjct: 567 KAIRNKNVTIWSIAFGLPHSAYTQG------CATGTSRALTAANSSELDSRFRDIAGSIA 620 Query: 450 FERVI 454 R++ Sbjct: 621 ELRLV 625 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 39/160 (24%), Gaps = 25/160 (15%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 D R + L+QA A + A L S + N + Sbjct: 21 DASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPANIQDKADNFFDTNFPSGMYGT 80 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 +T D A + + +V + D+ +N S + Sbjct: 81 TNVGYTLSAGTATQMDGAATASVPTTLMKVFNVPQIDIAVN-----------CSAELDLP 129 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + + V+D S SM + Sbjct: 130 N--------------IDVVLVLDMSGSMNSNGTTGSKRIT 155 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/282 (14%), Positives = 80/282 (28%), Gaps = 40/282 (14%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + P ++ E SP + Sbjct: 70 ATTAPTSPPVAPTVAPFTPATPTATGADAPTTVPESSSPPT-DTTTIFRPAEGEAAQVTT 128 Query: 253 LSEEHFVDSSSLRHVI---KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI-- 307 + F S S+ I K R A+ +I ++ N + +G F Sbjct: 129 NIQLVFDASGSMAQRIGGETKIQAARRAMERIIDTL----PDNPDLNVGFRVFGHEGDSS 184 Query: 308 --------SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + + + +++ A G T I+ A+Q A + + Sbjct: 185 EAQKARSCQSTALLVPMQGVNKALLRQQAQAWQPTGWTPISLALQRAGEDFQA------- 237 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-----QGIRIMTIAFSVNKTQQEK 414 + I+++TDGE T + C AK+ +RI + F + Sbjct: 238 ----GENVRNVIIMVTDGEETCGGDP----CAVAKALAESQAEVRIDVVGFGTTPDVAKT 289 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 R N S + +A + L + + I + +R Sbjct: 290 LRCIAEN--SGGVYTDAQNGDALVQTLEELIAATLKRSTLRF 329 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 126/382 (32%), Gaps = 67/382 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K G II +P+++ V + +++AA+ A + + Sbjct: 2 KGFTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDN 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ ++Y R + + N +D E + ++ +K Y+ Sbjct: 62 KDD---------------QDYAQRIVDLYITDNISDIE---------ISVSTKKCEYKDG 97 Query: 133 LSSRYDLLLNPLSLFLRSMGIK--SWLIQT--------KAEAETVSRSYHKEHGVSIQWV 182 R + L+P + F + SW+ K ++++R Y + V I ++ Sbjct: 98 CVQRNN-ELSPFADFTVVATAEHDSWISHNEIGVEPKFKVSGDSITRKYLPQ-PVDIYFI 155 Query: 183 IDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S+SM + + + + R VK + + K P S L Sbjct: 156 LDTSQSMSNPWYGERNKTQMQVVKDTITRVVK---------ELENFKTGPDKKSRVALLT 206 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS--IKKIDNVNDTVRMGA 299 Y Y D D +S + + + +++ + A Sbjct: 207 YNAYNAKFDKGAGRVKLYDYASEFSHTE--ASFESIVDKMFDKSVVEQKPHY-------A 257 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + +N + ++ I+ + + G T + +I++ ++ Sbjct: 258 SDYNKSQD--IPLT-DKYQEFIDILNSNKVMPARGGGTQS-------WLGLIAAAKEADK 307 Query: 360 RMKNNLEAKKYIVLLTDGENTQ 381 K + ++ ++L+DG +T Sbjct: 308 VKKEDRNPEQVFIILSDGADTD 329 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 55/178 (30%), Gaps = 30/178 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P V +S K ++ + + K+ G Sbjct: 33 ASSPSASKVDAVLVVDVSNSMNTSDPGKIGNEAMKMFIDMLSTQNDKV---------GIV 83 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + D V + + + + +KTF N T + ++ A + + H Sbjct: 84 AYTDVVQREKALLNITSEADKQELKTFIDGLNRGAYTDTSVGVKEALRILQD-GKTAGHA 142 Query: 361 MKNNLEAKKYIVLLTDGEN----------TQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 IV+L DG N +Q +++ +AK+ G+ I TI + + Sbjct: 143 P--------MIVMLADGNNDFNKTTGRTESQSDQDMAQAVAEAKNSGVPIYTIGLNAD 192 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 64/181 (35%), Gaps = 18/181 (9%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D +D+S + + + +++ + I R+ F+D + Sbjct: 42 DSRADIVFLLDNSGSVGRYNFEEVEIAFVENLLSQL-TISPQAS--RVAVVSFDDVARTH 98 Query: 310 PS-FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + +K + ++ + T DA + A + + + KN K Sbjct: 99 IDYIKYPKNKC--SFLRELKTVKYIGEWTNTEDAFRLAQELLRPPS-----AFKNERPVK 151 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + ++LLTDG T+ + + N KS I +I N +Q+ L +CA+ Sbjct: 152 QVVILLTDGRPTRGG-DPVKRANNLKSVYNAEIFSIGIGGNLNKQQ-----LEDCATDAQ 205 Query: 428 F 428 Sbjct: 206 H 206 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 42/115 (36%), Gaps = 17/115 (14%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + A++ A + + E + IVL++DGE+T + Sbjct: 125 KVEASGHTPMGPALRQAAEELPKDGE-------------RSIVLVSDGEDTCAPPPVCEV 171 Query: 390 CNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K +GI I T+ F V+ +++ A + +A T L + Sbjct: 172 AKDLKKEGIDLTINTVGFLVDSKARKELECIAE--AGGGEYMDAKDTVSLADSMK 224 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + +++ + + + ++ L + I + D R G F D Sbjct: 103 DLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + + + M T + DA+ + + Sbjct: 156 TPFT----ADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA-- 423 ++ +++LTDG +T E + A ++GI+I IA +Q + + Sbjct: 203 QQVMIVLTDGNDTGSFVEPVDAAKIAAARGIKIYIIAMGDPTHVGEQPMDMEVVQRVSQL 262 Query: 424 SPNSFFEANSTHELNKIFR 442 + F A EL+K ++ Sbjct: 263 TQARAFIAIDQAELDKAYQ 281 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVMYAVGVGKAVEAE------LREIAS 497 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 6/76 (7%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + IC A+ GI I I V+ + + L CAS + + E+ F Sbjct: 336 DRRLRNICAAAQRAGIVIYAIGMDVDS---QNSLDLLKECASTEAHYFDVDGLEIQTAF- 391 Query: 443 DRIGNEIFERVIRITK 458 D I I ++R+TK Sbjct: 392 DMIAASIS--MLRLTK 405 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 280 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 334 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 335 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 388 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 389 RAKEEGIVMYAVGVGKAVEAE------LREIAS 415 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 401 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 455 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 456 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 509 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 510 RAKEEGIVMYAVGVGKAVEAE------LREIAS 536 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVMYAVGVGKAVEAE------LREIAS 497 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 321 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 375 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 376 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 429 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 430 RAKEEGIVMYAVGVGKAVEAE------LREIAS 456 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 403 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 457 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 458 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 511 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 512 RAKEEGIVMYAVGVGKAVEAE------LREIAS 538 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 45/393 (11%), Positives = 108/393 (27%), Gaps = 28/393 (7%) Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++A + + A + +Q+ E R + + + + Sbjct: 29 LSACSDKAAEQQTPAELAAQAKLAAEQQAERQANRQRDAAIAMHEQASSAKLRTMSAESR 88 Query: 122 MNPRKSAYQVVLSSRYDL----LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + + + + P ++ + T + Sbjct: 89 AYIAQPTASISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGEIPVSTFAIDVDTGSYT 148 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +++ ++ + + LN F K+ + + + + M+ Sbjct: 149 TLRRML-KEGRLPQKDTLRVEEMLNYFSYDYPLPGKNDAPFSVTTELAPSPYNDDMMLLR 207 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 L + + +D S K L++ AL + + + D V+ V Sbjct: 208 IGLKGYEQSKAELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLDAQDKVSIVVYA 267 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 GA G + + GST +Q AY + Sbjct: 268 GAAGVVLD---------GAAGNDTQTLNYALEQLSAGGSTNGAQGIQLAYQL------AQ 312 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEK 414 H ++ + ++L TDG+ + + K QGI + T+ F + Sbjct: 313 KHFVEGGINR---VILATDGDFNVGTTNLDELIDLVSARKQQGIGLTTLGFGMGDYNDHL 369 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +S +E K+ + + Sbjct: 370 MEQLADK--GNGQYAYIDSINEARKVLVEHLSA 400 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 73/493 (14%), Positives = 141/493 (28%), Gaps = 91/493 (18%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I+ AL + +LG+ G+ VD + + L AA TA + A V + +A Sbjct: 23 GSVNILFALSLIPVLGLVGLAVDYGLAAADKTTLDHAADTAALAAVVTAKSYIAANQGQA 82 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + + N + T +S + + Y Sbjct: 83 NLTANAIAAGLAQAANVFAVNA--------GSVPFAQVTLQPPQLVRSGQTLTATVSYGA 134 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + + F + +G + L+ A SY + ++D S SM Sbjct: 135 TIQ--NSFGKLLGTPTTLLGNSVTASADLPSY-----LDFYLLVDVSGSMGLPATPGGMT 187 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 L ++ + S Q + + + ++ K SL + Sbjct: 188 QLAS----VNKDMWSDYQQGCQFACHFPGFTGWGLAAGKIQLRSDAVNAAVCSLIQRAST 243 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + + +A+++ + ++N + + + G +L Sbjct: 244 -PAVPNQYRVGIYPFINQMATLVGITGSVASLNAAAQCAL---SWPLAFTNLLDTGTTQL 299 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-- 377 G T M I + + ++ K ++ L+TDG Sbjct: 300 FAYGD---PTTGTASGGTHFEVVMPQLQAAIKAFGDGS-----SSTSPKPFVFLITDGMQ 351 Query: 378 ------------------------------ENTQDNEEGIAICNKAKSQGIRI------M 401 + +Q ++ +C KS G I Sbjct: 352 NGQHYGAPANGTYAYPGNPSSFWGYADAWWDGSQPSQIDPTVCAGLKSAGATISILSIPY 411 Query: 402 TIAFSVNKTQ------------QEKARYFLSNCASPNSFFEANSTHE----LNKIFRDRI 445 + VN L CASP F AN+ + LN +F + Sbjct: 412 NLITFVNNGGGVAWENNRVSGFSPTLATPLKACASPGFFATANTPADITASLNAMFDQAL 471 Query: 446 GNEIFERVIRITK 458 RV R+T+ Sbjct: 472 ------RVARLTR 478 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 47/150 (31%), Gaps = 26/150 (17%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I ++ A+ I + Sbjct: 345 GMVTFDSAAQVRSELRQIKSGTDREALTKS-LPTVPSGGTSICSGLRVAFSVIKKKYPTD 403 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNK----TQQ 412 IVLLTDGE+ I +C ++ + G I T+A + + Sbjct: 404 GSE----------IVLLTDGEDN-----TIKVCFDEVRQSGAIIHTVALGPSAALELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFR 442 K L AS + + + L F Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFG 473 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 67/183 (36%), Gaps = 15/183 (8%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF--FNDRVISDPSFSWGVHKL 319 + + K ++ ++ + IK V TV G+ N + GV+ Sbjct: 49 AKRIDGVSKYNMAKEEIVRFADQIKSKSQVRMTV-FGSEGNNKNSGKVQSCESIRGVYGF 107 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGE 378 R ++F N +G T I + ED + K + LLTDGE Sbjct: 108 QRFDRQSFLNSLNGIGPTGWTP--------IAKALEDAKASFTGLHKLGSKSVFLLTDGE 159 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + + + + Q I++ I F + + A ++EA+S ++N Sbjct: 160 ETCGG-DPVKTAKELRKQHIKVNVIGFDFKEGFNGQLHEIAK--AGGGKYYEAHSQKDMN 216 Query: 439 KIF 441 +IF Sbjct: 217 RIF 219 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ +R +V A + G T A + Sbjct: 214 IQVGVVQYGEDVVHEFHL--NDYRSVRDVV-EAASHIEQRGGTETRTAFGIEFAR----- 265 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK +V++TDGE + D+ + + +++ + +A + Sbjct: 266 -SEAFQKGGRKGAKKVMVVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 323 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 324 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 366 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ +R +V A + G T A + Sbjct: 183 IQVGVVQYGEDVVHEFHL--NDYRSVRDVV-EAASHIEQRGGTETRTAFGIEFAR----- 234 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK +V++TDGE + D+ + + +++ + +A + Sbjct: 235 -SEAFQKGGRKGAKKVMVVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 292 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 293 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 335 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 187 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 246 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 247 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 306 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 307 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 366 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 367 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 417 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 418 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 469 Query: 433 STHELNKIF 441 + L F Sbjct: 470 DLNSLIDAF 478 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 61/183 (33%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + I K D TA + + TA Sbjct: 130 ADQLTPGINLGLIAYAGTATVLVSPTTNREATKAAIDKLQLADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 + + + IVL++DG+ N + + AK QG+ I T Sbjct: 185 QAVATVG---AVIGGGDEPPPARIVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 ++F E L A S F A+S +L ++F +++IG E Sbjct: 242 VSFGTPYGYVEINDQRQPVPVDDEMLKKIADLSGGDAFTASSLEQLKQVFTNLQEQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 48/139 (34%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A +I ++ R+G + + + H + ++ D Sbjct: 112 LEKAKRIAFETISQLKGD----RVGIVAYAASAYPQLALTTD-HSAAKMFLQDMNTDMLS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A++ A + N + + +LTDGE+ + I +A Sbjct: 167 SQGTAIQEAIRMASNY-----------FDENTPTARLLFILTDGEDHE--MGATEIATEA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +G+ I TI K Sbjct: 214 QEKGVHIYTIGIGTEKGAP 232 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 44/127 (34%), Gaps = 1/127 (0%) Query: 11 SKKLIKSCTGHFFIIT-ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +++ K F I+ AL++ V++ + VD+ + L+ AA + L Sbjct: 4 FRRIRKLNEKGFAIVYIALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAGALAGVAKLS 63 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ S + + N N+ + + R + + K A Sbjct: 64 DTVTARQSAKLFAERNKAAGESVKVALNETNSADGDIVVGYWDKVTRTMSATVPTGKVAN 123 Query: 130 QVVLSSR 136 V + +R Sbjct: 124 AVKVVAR 130 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 54/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I ++ A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRLAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 76/267 (28%), Gaps = 29/267 (10%) Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 S +L GI + L + + + + ++ + + M + L + Sbjct: 496 SQWLAYNGIATALRKRNSAHTITIGASVVANTIAEE---NCKGLMSKLWT-TYSHQLPSY 551 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-S 263 + + V+ S+ + E + K L L L SS Sbjct: 552 RSSSSQIVQFTDSRKQQSSETTENKVEKLGQYFKKNIRRLREKSLSLDLVFLVDESSSVG 611 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH-----K 318 + + + V+ ++ + R+ F+ + + Sbjct: 612 HSNFVNELRFVKK----LLSDFPVVP---SATRVAIITFSSKTNVQTRVDYISSSEPHQH 664 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + G T A Q A + S + K I L+TDG Sbjct: 665 KCSLLNREIPAITYKGGGTFTKGAFQQAAQILRYSRSNS----------TKVIFLITDGY 714 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAF 405 + + IA + G+ I T+ Sbjct: 715 SNGGDPRPIAA--NLRDLGVEIFTVGI 739 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 96/309 (31%), Gaps = 37/309 (11%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 LTDGE+ GI C +A G I TIA + + LS+ F+ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSA---ARELETLSDMTGGLRFYANK 464 Query: 433 STHELNKIF 441 + L F Sbjct: 465 DLNSLIDAF 473 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 68/193 (35%), Gaps = 26/193 (13%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L S+++E F + + + + + V++ L + +K D R+G F D Sbjct: 100 LSGSMAQEDFQNDAGKK--VSRLDAVKEVLDGFLAK-RKGD------RVGLVVFGDAAYL 150 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ + L + ++ + TA DA+ + E Sbjct: 151 QAPFTTDLQ-LSQELLGECEVGM-AGPRTAFGDAIGLGVNLFDEDTERA----------- 197 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCASP- 425 K I+ LTDG +T+ + A + I+I T+A + + L + AS Sbjct: 198 KTIIALTDGNDTKSKVPPVEAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASET 257 Query: 426 -NSFFEANSTHEL 437 +F L Sbjct: 258 GGKYFFRRGPRAL 270 >gi|126314401|ref|XP_001377042.1| PREDICTED: similar to Procollagen, type VI, alpha 2, partial [Monodelphis domestica] Length = 762 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 69/198 (34%), Gaps = 20/198 (10%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 330 LTECDVMTYVRETCGCCDCERHCGALDIVFVIDSSESIGYTNFSLEKNFVINVVNRLGSI 389 Query: 289 --DNVNDT-VRMGATFFND-RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D +DT R+G ++ G + + E G T A+ Sbjct: 390 TKDPNSDTGTRIGVVQYSHDGTFEAIKLDDGRIGSLAQFKEEVKKLEWIAGGTWTPSALN 449 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 Y+ +I + + R+ + V++TDG + + + A+C+ K + + + Sbjct: 450 YTYNELIKGSRRKKTRV--------FAVVITDGRHDPRDNEQSLKALCD--KVENVVVTA 499 Query: 403 IAFSVNKTQQEKARYFLS 420 I +Q ++ S Sbjct: 500 IGIGDMFNEQHESESLTS 517 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 60/163 (36%), Gaps = 23/163 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + + + T + + G T+I ++ + II S + Sbjct: 347 GMVTFDSVAEIRNNLT-KITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQST 405 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE+ + I C + K G+ I TIA + K Sbjct: 406 SGSE---------IILLTDGEDNE-----IHSCIEEVKQSGVIIHTIALGPSA---AKEL 448 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNE---IFERVIRI 456 LS+ + F+ + L F RI + I ++ I++ Sbjct: 449 ETLSDMTGGHRFYANKDINGLTNAF-SRISSRSGNITQQTIQL 490 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 47/321 (14%), Positives = 99/321 (30%), Gaps = 34/321 (10%) Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + ++ + G ++ + + E VS V + S + +R Sbjct: 70 MPAPSVANDALAGGYRTEPREQYEKLPDNPIHSVAETPVSTFSVDVDTGSYANVRRLLNQ 129 Query: 199 QPLNCFGQPADRTVKSY-----------SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 L G + +Y S + +P+ + Sbjct: 130 GSLPPEGAVRLEEMVNYFPYSYALPTDGSPFGVTTEVAPSPWNPHTRLLRIGIKASDRAV 189 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + VD S + LV+ L ++ ++ D V+ V G RV+ Sbjct: 190 ADLAPANLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAG----ESRVV 245 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 P + ++ + GSTA ++ AY + E + + N Sbjct: 246 LKP-----TSGRDKVTIRNAIDQLDAGGSTAGASGIELAYQM---ARESFIDKGINR--- 294 Query: 368 KKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+L TDG+ D + + + G+ + T+ F V+ + A Sbjct: 295 ---ILLATDGDFNVGVSDFDSLKQMAVDQRKSGVSLTTLGFGVDNYNEHLMEQLAD--AG 349 Query: 425 PNSFFEANSTHELNKIFRDRI 445 ++ ++ E K+ D++ Sbjct: 350 DGNYAYIDNLLEARKVLVDQL 370 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 83/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + + Y+ C++ Sbjct: 595 EGFRLAEDGKRCRRKDVCKSTYHGCEHICVNRGNSYICKCSEGFVLAEDGRRCKRCTEGP 654 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 655 VDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLR 711 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 712 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSTRVPRVAIVF 767 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KA++ GI + + ++ L AS F Sbjct: 768 -TDG---RAQDDVSEWASKAQANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 817 Query: 431 ANSTHEL 437 A + Sbjct: 818 AEDFSTM 824 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 83/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + + Y+ C++ Sbjct: 595 EGFRLAEDGKRCRRKDVCKSTYHGCEHICVNRGNSYICKCSEGFVLAEDGRRCKRCTEGP 654 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 655 VDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLR 711 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 712 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSTRVPRVAIVF 767 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KA++ GI + + ++ L AS F Sbjct: 768 -TDG---RAQDDVSEWASKAQANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 817 Query: 431 ANSTHEL 437 A + Sbjct: 818 AEDFSTM 824 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 83/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + + Y+ C++ Sbjct: 595 EGFRLAEDGKRCRRKDVCKSTYHGCEHICVNRGNSYICKCSEGFVLAEDGRRCKRCTEGP 654 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 655 VDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLR 711 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 712 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSTRVPRVAIVF 767 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KA++ GI + + ++ L AS F Sbjct: 768 -TDG---RAQDDVSEWASKAQANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 817 Query: 431 ANSTHEL 437 A + Sbjct: 818 AEDFSTM 824 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 65/186 (34%), Gaps = 37/186 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP------SFSWGVHKLIRT 322 K L + A I ++ ++ + G F+ R Sbjct: 439 TKLELAKQAA---INALPQLGPNDQV---GLWMFSTNQDGGKDYRELVPMGRN----NRD 488 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++K G T + D + AY T++ + ++V +V+LTDG+N D Sbjct: 489 LLKKRIQGLIPGGGTGLYDTTRAAYRTVLERHSNDVINA---------VVVLTDGKNEDD 539 Query: 383 NEEGIA--ICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 N + + G +R+ TIA+ + L + + + +++ Sbjct: 540 NSISLEDLLAELRTETGQETVRVFTIAY-----GNDADLEVLRQISQVTDAAAYDSREPG 594 Query: 436 ELNKIF 441 ++++F Sbjct: 595 SIDQVF 600 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 83/247 (33%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + + Y+ C++ Sbjct: 595 EGFRLAEDGKRCRRKDVCKSTYHGCEHICVNRGNSYICKCSEGFVLAEDGRRCKRCTEGP 654 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ + +I S+ + R+G ++ +V ++ + Sbjct: 655 VDLVFVIDGSKSLGEENFEIVKQFVTGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTLR 711 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 712 N--FNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSTRVPRVAIVF 767 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KA++ GI + + ++ L AS F Sbjct: 768 -TDG---RAQDDVSEWASKAQANGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 817 Query: 431 ANSTHEL 437 A + Sbjct: 818 AEDFSTM 824 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/329 (11%), Positives = 83/329 (25%), Gaps = 27/329 (8%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I ++ + K G I AL M ++LG + +D+ + L+ +A A + + Sbjct: 3 IMHTGRSGKKQEGAVAITVALCMVILLGFAALAIDIGNLLIARNELQNSADAAAMAGAGC 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 LI+ ++ A + N V+ T+V+ + + Sbjct: 63 LIRRTACSNTSASQPDWPTADATASAFSTSATTNQ------------VQGTSVQTSTVAT 110 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y + Y L P + + I+ + + + + Sbjct: 111 GYWNTTGTPYGLESLPFTPGANDLPAVQVTIRKDGSNANGAVPIFLGRIFGARILKASAV 170 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + P F + + + + + S + + P Sbjct: 171 ATAVLSTPGSVGPGGLFPLAISQCMYTNYWDSSSQSPKIAPNSGVVPGFSWPNQIAGQPY 230 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S + SS + ++ + + N I Sbjct: 231 IFQIGSSYHYGTCSSGQ---WTTFDVNDNSAGYAKTMLTNGNP------------NTLTI 275 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +W T+ A Sbjct: 276 GASPGTWIQTGTENTLFNNTATCSAAGNG 304 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 79/278 (28%), Gaps = 31/278 (11%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS-CNKSLYYMLY 245 D G P K + G+ G+ E++ + C + + Sbjct: 627 GDYGDKGDPGSKGAKGEIGDPGSPGPKGTWGRKGEPGLSREEVIRLINEFCGCGIRCRIT 686 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P L + +S + I K ++ + V+ + R+G F+ + Sbjct: 687 PLEL---IFVIDSSESVGPDNFISTKTFMKTVIDEVLA-------NHAKTRIGVINFSHK 736 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V S + ++ T A++ A + ++ Sbjct: 737 VELVSSLE--KYTTKESLKSAVDKMLYLGEGTYTASAIKKAINLFQAARPA--------- 785 Query: 366 EAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-QQEKARYFLSNCA 423 +K V++TDG + +D + +A + I I I + + A Sbjct: 786 -VRKVAVVVTDGQADARDEVHLDMVVREAHAANIEIFVIGIVQETDPHYDNFLKEMHLIA 844 Query: 424 SP---NSFFEANS---THELNKIFRDRIGNEIFERVIR 455 + F+ L +I + E R Sbjct: 845 TDPDEEHFYRIEDFKTLSALTDKLITKICDNASEIYSR 882 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 68/187 (36%), Gaps = 22/187 (11%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRV 306 D L + F+ SS + L ++ + S+ +I ++ V V++ F+ V Sbjct: 5 DLCLIDIVFIVDSSESAKNQLFGLQKNFVLSLTDNIFQMKPVKSQKYDVKLAGMQFSSTV 64 Query: 307 ISDPS-FSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 D +W V I I + T A+ A ++ + + Sbjct: 65 SVDHPFIAWKNVQNFKEKIRALVYIGQ----GTYSYYAISNA---------TQLFKTEGR 111 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + K L+TDG + ++ I A+S GI TI S ++EK L + Sbjct: 112 EGSIKVAFLMTDGVDHPNSPSVEGIATAARSLGIHFFTIGLSKKNVKEEK----LRLISG 167 Query: 425 PNSFFEA 431 +SF Sbjct: 168 DSSFKHV 174 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + + + +K ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDAVHEFHL--NDYKSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLKKVIRQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 43/114 (37%), Gaps = 16/114 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ H + ++T Sbjct: 272 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLQRMSHSGRQQALQT-V 324 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T I DA++ A I ED H I+LL+DG++T + Sbjct: 325 NSLVASGGTNIADALKKAAKVI----EDRSH-----QNPVCSIILLSDGQDTYN 369 >gi|55378019|ref|YP_135869.1| hypothetical protein rrnAC1219 [Haloarcula marismortui ATCC 43049] gi|55230744|gb|AAV46163.1| unknown [Haloarcula marismortui ATCC 43049] Length = 788 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 56/163 (34%), Gaps = 25/163 (15%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ +D + D ++G FN R G+T I + Sbjct: 398 ALDVLDQLGDENQVGVVAFNQNAYRVSEMQ--ALGQNRAETADKIRRLESGGATDIAVGL 455 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A D ++ E I+LL+DG++ A+ N+ +G R++++ Sbjct: 456 QGA-DELLDDREGT-------------IILLSDGQDRLGP--PAAVANQLGREGTRVVSV 499 Query: 404 AFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDR 444 + + AS S+F A+ T L +F Sbjct: 500 GV-----GKRVGVATMRQIASESGGSYFAADETERLRLLFGGS 537 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 56/384 (14%), Positives = 110/384 (28%), Gaps = 70/384 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G I + ++L GGM VD +R L+ A++ A+ Sbjct: 11 LRRDERGGMIIFGLFVFLLLLLAGGMAVDFMRTETARGRLQATLDGAVLAAA-------- 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ P + + +Y+ + L D +V +I V Sbjct: 63 DLDQDKD-----PVEVVRDYVAK---AGLDPFLIDVDVTEI-----------AGQRIVTA 103 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S++ D+ ++ F++ +GI ++ A + + + V+D S SM + Sbjct: 104 SAKSDVTMH----FMKMVGIDFLPAPARSTASEAVSN------LDVSLVLDMSGSMEGDK 153 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 D F T +G L+ + + L Sbjct: 154 LDQLQAAAKNFVGIVYDT----------MGAEKILLNVVPYATQVAAPAGLLDMLGAFLR 203 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDAL---ASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + S + ++ A +R F Sbjct: 204 EHSYSNCVSFSAADFTETSILEAAALPQGGHFDPFYTWGP----LRYDDVTFVCNPDPST 259 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN------ 364 + ++ + G+T+I+ M+ I ++ N Sbjct: 260 EVL--TLASTQREIEDYIDGLVAEGNTSIDVGMKWGAALIDPDLGSTLNEFANGPSAAGI 317 Query: 365 --------LEAKKYIVLLTDGENT 380 K IVL+TDG+NT Sbjct: 318 NPVALWGDRSTDKVIVLMTDGKNT 341 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 40/80 (50%), Gaps = 8/80 (10%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELN 438 + N AIC AK+QG+++ T+ F V ++ + +CAS FF + +L Sbjct: 464 SAKNARLEAICTAAKNQGVQVFTVGFEV----EDDEAIIMEDCASSRAHFFRVSGGGDLT 519 Query: 439 KIFRDRIGNEIFERVIRITK 458 F + I +I E +R+T+ Sbjct: 520 TAF-ESIARQITE--LRLTE 536 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 65/180 (36%), Gaps = 16/180 (8%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT- 294 +K+ + F+ SS I +D + S+ I ++ V Sbjct: 28 GSKTNLLARKNLQDSTCFIDLVFIVDSSESSKIFLFDKQKDFVGSLSDKIFQLTPVGSLK 87 Query: 295 --VRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +++ A F+ V DP FS W + + VK+ T A+ A + Sbjct: 88 YDIKLAALQFSSSVQIDPPFSSWKDLQTFKQRVKSM---NFIGQGTFSYYAIANATRLL- 143 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + + K VL+TDG + N + +I A++ GI +TI S + Sbjct: 144 --------KREGRKDGMKVAVLMTDGIDHPKNPDVQSISEDARTAGILFITIGLSAVVNE 195 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 78/270 (28%), Gaps = 28/270 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S D + P S+N + +E ++ Sbjct: 2 SFNDDEPIIPSPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGG------DS 55 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D S + V S +K ++ A+ V++ + ID R+ F D Sbjct: 56 TSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTFLDTAN 109 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++K G+T I+D +QT + ++ + Sbjct: 110 RICPLRQVTEDSQPQLLKLI-DALQPGGNTNISDGLQTGLKVLAD------RKLSSGRVV 162 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L++DG+ + K + + T F + N + + Sbjct: 163 G--VMLMSDGQQNRG-----EPAANVKIGNVPVYTFGFGAD-YDPTVLNAVARN-SMGGT 213 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F N + L+ F + + V +T Sbjct: 214 FSVVNDVNLLSMAFSQCLAGLLTVVVQDLT 243 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 46/300 (15%), Positives = 97/300 (32%), Gaps = 46/300 (15%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + T +A +S S W++ +M ++ C + Sbjct: 87 IPLHTPPDAVNMSLGLSLSAAASRPWLLACGPTMHRACGENMYAEGFCLLLDSHLQTIWT 146 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-EEHFVDSSSLRHVIKKKHLV 274 E +++ + S+Y + D + HF +++L +I+ HL+ Sbjct: 147 VPAALPECPSQEMDIVFLIDGSGSIYESSFKQMKDFVRALMGHFEGTNTLFSLIQYSHLL 206 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + F + SW L+ IV+ Sbjct: 207 K------------------------IHFTFTQFQN---SWNPLSLVDPIVQL-------K 232 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + S K K ++++TDG+ +D E + +A+ Sbjct: 233 GLTYTATGIRKVVEELFHSKNGARKSAK------KILIVITDGQKYKDPLEYSDVIPQAE 286 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 GI I + + A+ L N S + F ++ L+ I ++++ +IF Sbjct: 287 RAGIIRYAIGVG-DAFWKPSAKQELDNIGSEPAQDHVFRVDNFAALSSI-QEQLQEKIFA 344 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 42/115 (36%), Gaps = 17/115 (14%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + A++ A + + E + IVL++DGE+T + Sbjct: 125 KVEASGHTPMGPALRQAAEELPKDGE-------------RSIVLVSDGEDTCAPPPVCEV 171 Query: 390 CNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K +GI I T+ F V+ +++ A + +A T L + Sbjct: 172 AKDLKKEGIDLTINTVGFLVDSKARKELECIAE--AGGGEYMDAKDTVSLADSMK 224 >gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] Length = 932 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 79/224 (35%), Gaps = 24/224 (10%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 Y+ N + P L FV S KK R+AL +++ + Sbjct: 247 TGGYLQIENGYFVHHFAPEDLPTMAKNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQ 306 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D N V G + + +++ + +A G T IN A+ +A + Sbjct: 307 DQFNIIVFSGEAN-QWEQLLVQATEENLNRAV-----DYASKIPAQGGTNINKAVLSAVE 360 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAF 405 + SN+ E+ K+ I+LLTDGE T I +A + + + F Sbjct: 361 LLDKSNQAELLPSKSVSL----IILLTDGEPTVGETNPKIIQKNTQEAINGRYSLFCLGF 416 Query: 406 SVNKTQQEKARYFLSNCASPNS------FFEANSTHELNKIFRD 443 + FL A N + +++S +L +++ Sbjct: 417 GFDVNYP-----FLEKLALDNGGLARRIYEDSDSALQLQDFYQE 455 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 50/148 (33%), Gaps = 18/148 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A +I ++ R+G + + + H + +++ + Sbjct: 112 LEKAKRIAFETINQLKGD----RVGIVAYAASAYPQLALTTD-HSAAKMFLQSMNTNMLS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A++ A + + + + +++DGE+ + I +A Sbjct: 167 SQGTAIQEAIRMATNYFDDKS-----------TTSRLLFIISDGEDHE--MGATEIAAEA 213 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + +GI I TI K R Sbjct: 214 QEKGIHIYTIGVGTEKGSPIPMRELGEQ 241 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 64/201 (31%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G + S + I K Sbjct: 118 RLTAAQEAAKQFADQLTPGIN------LGLIAYAGTATVLVSPTTNRESTKTAIDKLQLA 171 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNE 384 D TA + + TA I + + +VL++DG+ N + + Sbjct: 172 DR-----TATGEGIFTALQAIATVG---AVIGGGDEPPPARVVLMSDGKETVPSNPDNPK 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANS 433 AK QG+ I T++F E L A S F A+S Sbjct: 224 GAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 +L ++F +++IG E + Sbjct: 284 LEQLKQVFTNLQEQIGYETIK 304 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 46/397 (11%), Positives = 123/397 (30%), Gaps = 26/397 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 72 LEEVSSRAKNSFTF----------PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ + S ++ + + + L + D+ + T Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHK-EHGVSI 179 + + ++ + S F +G ++ Q +++ + + + + Sbjct: 130 TSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 180 QWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCN 237 V D S SM D + + N +K + ++ +E Sbjct: 190 MVVADLSGSMKDGIKGEKLNGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 238 KSLYYMLYPGPLD-PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + L + H + S ++ + H V + +D Sbjct: 250 GAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNS 309 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYD 348 +G ++ G+ K + + G T + + TA + Sbjct: 310 IGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASN 369 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 ++S N E K+ I++L+DG + N Sbjct: 370 QMLSEKSRSEEL---NQETKRVILVLSDGNDDMSNLN 403 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 78/270 (28%), Gaps = 28/270 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S D + P S+N + +E ++ Sbjct: 2 SFSDDEPIIPSPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGG------DS 55 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D S + V S +K ++ A+ V++ + ID R+ F D Sbjct: 56 TSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTFLDTAN 109 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++K G+T I+D +QT + ++ + Sbjct: 110 RICPLRQVTEDSQPQLLKLI-DALQPGGNTNISDGLQTGLKVLAD------RKLSSGRVV 162 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L++DG+ + K + + T F + N + + Sbjct: 163 G--VMLMSDGQQNRG-----EPAANVKIGNVPVYTFGFGAD-YDPTVLNAVARN-SMGGT 213 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F N + L+ F + + V +T Sbjct: 214 FSVVNDVNLLSMAFSQCLAGLLTVVVQDLT 243 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 56/354 (15%), Positives = 111/354 (31%), Gaps = 35/354 (9%) Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 D+V + + K+ + + Y +P R+ + E + + Sbjct: 123 QEDVVVNAMIVPEAPKTVAGSPVVNAYMKSASPAFYGSRAPQFNTEDYSPVNENRFHTVA 182 Query: 171 YHKEHGVSIQWVIDF-SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 SI +D S S + + P + + + G + Sbjct: 183 SDPLSTFSID--VDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKYSNPTGNTPVAV 240 Query: 230 SPYMVSCNKSLYYMLYPGPLD---------PSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 M C + + L L P + +D S KK LV+ A Sbjct: 241 RTDMAICPWNTAHQLVRIALKGKDVAKDNLPPSNLVFLIDVSGSMSDAKKLPLVKQAFKL 300 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D V V GA ++ +T + GSTA Sbjct: 301 LVNQLRPVDRVAIVVYAGAAG----LVLPS-----TSGDHKTAILDALDKLEAGGSTAGG 351 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEG--IAICNKAKSQG 397 + +Q AY T + +K+ +++ TDG+ N + +G I K + +G Sbjct: 352 EGVQLAYKTAT------EYLLKSGNNR---VIIATDGDFNVGPSSDGELQRIIEKKREKG 402 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + + F + + K ++ ++ E + F G +F Sbjct: 403 IFLSVLGFGMGNYKDNKLELLADK--GNGNYAYIDNFEEARRTFATEFGGTLFT 454 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 63/213 (29%), Gaps = 50/213 (23%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + I Sbjct: 101 YPNRLEAAKRTLENLLQGLKGD-------RIGFIPFSDSAYIQMPLTDDYSIGKNYINAL 153 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A + N D K I++L+DG +E+ Sbjct: 154 DTN-LISGGGTELYQALELAEKSFKEINSDN-----------KTIIILSDG--GDFDEKS 199 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ-----------------------EKARYFLSNCA 423 + K + + +I + + FL + Sbjct: 200 LKF---VKDNKMNVFSIGIGTEEGTIIPEYVNGKKVGFIKDQNGSAVISKLNSDFLKKLS 256 Query: 424 --SPNSFFEANST-HELNKIFRDRIGNEIFERV 453 S ++E N+ + + F+D I E + Sbjct: 257 SESDGKYYEVNNLKDDSSNFFKDTINLERKNQR 289 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 59/183 (32%), Gaps = 32/183 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + AL V+ ++ DT ++G + V G Sbjct: 57 TKIEAAKKALTGVVGALP------DTAQVGLRVYGATVDGK-----GKPTPAACADTQLI 105 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKN-----NLEAKKYIVLLTDGENTQD 382 T A+ +A I + E H + K+ IVL++DGE + Sbjct: 106 TPI----GTIDKPALTSAISAINALGETPIAHSLTEALKDLGPTGKRNIVLVSDGEESCT 161 Query: 383 NEEGIAICNKAKS--QG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + C K G ++I T+ F VN + + + A ++++A L Sbjct: 162 PDP----CPAVKKLTAGGVDLQIDTVGFGVNAKARSQLQCIAD--AGKGTYYDAKDAPAL 215 Query: 438 NKI 440 Sbjct: 216 AAS 218 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 61/159 (38%), Gaps = 17/159 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D + V+ GA +++ + ++ + A G T A++ Sbjct: 839 VKKSDVGPNRVQFGALRYSNEPDIIFYLNSNRSAIMEYLRSLSAKG----GDTYTAKALE 894 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + E H + K+ ++++TDG+ + D+ +K +++GI I + Sbjct: 895 RA-NILF----TEEHGSRIKQNVKQMLIIITDGK-SHDHIHLSDKASKLRAKGIIIYAVG 948 Query: 405 FSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFR 442 E + L A + ++ L +++ Sbjct: 949 VG------EANQEELETMAGNKHYTIHVSNFDSLKDVYQ 981 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 93/268 (34%), Gaps = 30/268 (11%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS------PYMV 234 V+ SM D +E A ++ +Q ++ ++E++S Sbjct: 546 IVLTDGMSMDDVLEPAEKLRAENIAVHAIGIGEANRTQLQQIAGKEERVSFGQNFDSLKN 605 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 N+ L+ + + ++ F+ SS + ++ + +++ I+ D Sbjct: 606 IKNEVLHSICTEKGCEDMKADIMFLVDSSGSIGPENFRTMKSFMKNLLAKIQI---GLDK 662 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+D + + + I +T A+ + S Sbjct: 663 TRIGVVQFSDVTKEEFKL--DTYFTQKEISDAIERMSPIEQNTLTGKALTSIEPYFTESK 720 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQE 413 +K+++L+TDGE D + + +G+ I + F N+TQ Sbjct: 721 GAR-------SMVRKFLILITDGEAQDDVRNPAKV---LRDKGVVIFAVGVFRANRTQ-- 768 Query: 414 KARYFLSNCASPNS-FFEANSTHELNKI 440 L + +S F+ S +L +I Sbjct: 769 -----LEEISGDSSLVFQVESFSDLQEI 791 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 60/162 (37%), Gaps = 14/162 (8%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 G +D + +F+ S ++ ++ I+ D D VR+GA ++D Sbjct: 430 KTGCVDTKEVDIYFLIDGSGSIRSDHFEQIKK---FMLEVIENFDIGPDKVRVGAVQYSD 486 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + T+ K + G T +A+ I E ++++ Sbjct: 487 TREKEFDIT--DYTTDETLRKAISNIRQLGGGTYTGEALDFILQIIKKGREQRINKVPC- 543 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 Y+++LTDG ++ + K +++ I + I Sbjct: 544 -----YLIVLTDG---MSMDDVLEPAEKLRAENIAVHAIGIG 577 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 69/166 (41%), Gaps = 18/166 (10%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 IDN+ + ++G F + + ++ + T G T++ A+ + Sbjct: 636 NLIDNLPENSKVGVVKFTSSTTKLTTSLTSDKETAKSYLTTSYFR--SSGGTSMYTAINS 693 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 ++ +++++ + K +++L+DG T ++ A + G++I T+ Sbjct: 694 SFSMFEATDDNIL----------KMMIVLSDGA-TSYTYLHSSVVTTANNNGVKIYTVGL 742 Query: 406 SVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEI 449 + + +L A+ +F+ A+ +L I++D I +I Sbjct: 743 GSSSSSY--FTQYLKPLANNTGGAFYLASDASQLEDIYKD-INKKI 785 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 85/264 (32%), Gaps = 23/264 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S+ + + LN F K+ + + + + M+ L Sbjct: 182 KEGSLPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYE 241 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S K L++ AL + + + D V+ V GA Sbjct: 242 LTKSELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAAG--- 298 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V+ D G + GST + + AY + H ++ Sbjct: 299 -VVLD-----GASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGG 346 Query: 365 LEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ ++ I++ K K +GI + T+ F + + Sbjct: 347 INR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMGNYNDQLMEQLADK 403 Query: 422 CASPNSFFEANSTHELNKIFRDRI 445 + ++ +E K+ D + Sbjct: 404 --GNGHYAYIDTLNEARKVLVDEL 425 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 77/219 (35%), Gaps = 30/219 (13%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 S GPLD FV SS + +R +I I +D + Sbjct: 32 GTGSQQSKCKTGPLD-----IVFVIDSSRSVRPFEFETMR---RFMIDIIHNLDIGPNAT 83 Query: 296 RMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G ++ +V + S + ++ + I + + M AI M A+ T + Sbjct: 84 RVGVIQYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTTQEGA 143 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + + V++TDG + + + +A++ GI I + Q Sbjct: 144 RPL-------HKKIPRVAVIVTDG---RPQDRVTEVSAQARAAGIEIYAVGV------QR 187 Query: 414 KARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 L ASP F S +L + F + +++ Sbjct: 188 ADMNSLRAMASPALEEHVFLVESF-DLIQQFGKQFQDKL 225 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 65/223 (29%), Gaps = 23/223 (10%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C + + GP D +L + S KK + R A + D V Sbjct: 817 CPDYTCPISFSGPTDITLVVDSSTSVGSRNFNTTKKFVKRLAERFL----SAAKPTEDAV 872 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ ++ R + + + I + + +T +N A+ S+ Sbjct: 873 RVSVVQYSGRTQQKLEVPFEQNYTV--IADSVDKMQFINDATDVNAALNYVTSLFRRSSR 930 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K+ ++ +DG + + I +A+ GI I + + Sbjct: 931 SGAK--------KRMLIF-SDGNSQGITQSAIERAVQEARQAGIEIYVLVVGTQANEPNV 981 Query: 415 ARYFLSNCASPN------SFFEANSTHELNK-IFRDRIGNEIF 450 A + F L + +F + +I Sbjct: 982 RVLVTGKTAEYDVAFGERHLFRVPDYESLLRGVFYQTVSRKIS 1024 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 58/136 (42%), Gaps = 20/136 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED 356 GA ++D V + + R+ +K +G T + A++ + ++ Sbjct: 102 GALHYSDEVQLISGLT--SMRTGRSGLKDQVSKVVSIGKGTYTDCAIKRGIEELLIGGS- 158 Query: 357 EVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQ 412 H +N KY++++TDG E ++ G+ N+AK QGI++ ++A S Sbjct: 159 --HHKEN-----KYMIVVTDGHPLEGYKEPCGGLEDAANEAKHQGIKVFSVAIS-----P 206 Query: 413 EKARYFLSNCASPNSF 428 LS A+ ++ Sbjct: 207 NHLESRLSVIATDQAY 222 >gi|126334034|ref|XP_001370526.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 1247 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 115/398 (28%), Gaps = 33/398 (8%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ K S K+K +E F NN + + +V V+ S Sbjct: 9 IAEIQQDTYDSDKGSIQTEKEKTKEQHDNWFNNNRHEPYVWIKVGKKEYKALVDT-ALAS 67 Query: 128 AYQVVLSSRYDL-----------LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + L ++ + + L F +G + Y Sbjct: 68 CHGFNLDIKHPIIFQKDAAGFGQSVVQLEGFRLVVGAPLVKVSDSQTGRLYECKYDTGRC 127 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V I I + P + TV +N + Sbjct: 128 VPIPLQIPQEAVNMSLGLSLATGPNSSQLLACGPTVHQVCGKNIYMK---GFCFVLDSKL 184 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + ++ FV S ++ + +VI K+ + + Sbjct: 185 QQQKTIPAALQECPKQENDIVFVIDGSTSIDSHDFQQMKSFVRAVINQFKETNTL----- 239 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +++R+ F++ + G T A++ + S Sbjct: 240 FSLMQYSNRL--KIHFTFADFQRSTNWGNLVNPILQLRGLTYTATAIRKVVTELFQSRNG 297 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K ++++TDGE QD + + +A+ GI I + + A+ Sbjct: 298 ARKNAT------KILIVITDGEKYQDKLQYKDVIPEAEQAGIIRYAIGVG-DAFEYASAQ 350 Query: 417 YFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 L+ AS F+ ++ L I ++ + +IF Sbjct: 351 EELNIIASQPAKEHVFQVDNFSALKTI-QEDLQEKIFS 387 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGL------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 321 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 375 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 376 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 429 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 430 RAKEEGIVMYAVGVGKAVEAE------LREIAS 456 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HS-NVLF----TEEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + ++ +L I+ Sbjct: 985 V-----GNANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 78/239 (32%), Gaps = 20/239 (8%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + S +G + VS + G +D ++ +F+ S K+ Sbjct: 430 SYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRKKE 489 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++SVI N VR+G ++ + + S + + K Sbjct: 490 FEQIQIFMSSVIDMFPIGPNK---VRVGVVQYSHKNEVEFPVSRYTDGI--DLKKAVFNI 544 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T A+ I + R Y+++LTDG + N+ + Sbjct: 545 KQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------YLIVLTDG---KSNDSVLEPA 595 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N+ +++ I I I Q L A + + K ++ I + I Sbjct: 596 NRLRAEQITIHAIGIGEANKTQ------LRQIAGKDERVNFGQNFDSLKSIKNEIVHRI 648 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLRTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLRTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 65/194 (33%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS ++ V+ L+ VI + N + R+G + V ++ S Sbjct: 41 VFIIDSSRSVRPQEFEKVKVFLSRVIEGLDVGPN---STRVGVINYASAVKNEFSLK--T 95 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H+ +++ E T A+Q A S E R N K +++TD Sbjct: 96 HQTKAGLLQAVRRIEPLSTGTMTGLAIQFAISRAFSDAEGARLRSSN---INKVAIVVTD 152 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G ++ A +A+ GI I I L AS + S Sbjct: 153 GRPQDGVQDVSA---RARQAGIEIFAIGVGRVDMHT------LRQIASEPLDDHVDYVES 203 Query: 434 ---THELNKIFRDR 444 +L F++ Sbjct: 204 YSVIEKLTHKFQEA 217 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 67/216 (31%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 225 NGYFVHYFAPEGLPTIPKNVIFVIDKSGSMSGRKMQQTREALIKILDDLSPKDQFN---- 280 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ +++A G T IN+A+ A + S+ + Sbjct: 281 --LISFSGDAAQWKPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQK 338 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 E+ I+LLTDG+ T I K + + F + + Sbjct: 339 ELMPEGTVSL----IILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVS--- 391 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + +++S +L + + Sbjct: 392 --YAFLEKLALDNGGLARRIYEDSDSALQLQDFYEE 425 >gi|281356510|ref|ZP_06243002.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281317202|gb|EFB01224.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 783 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 47/149 (31%), Gaps = 19/149 (12%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++R + + + R G F + + + I D +G T Sbjct: 115 AKFLLRQLVESAPND---RFGLVAFAGKAYLACPLTSDSLAFTQYID-ELNTDTVPLGGT 170 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + A++ A K + I+L TDG+ N A+ ++ + + Sbjct: 171 NLEAALRVA-----------EQAFKAAAGGNRGILLFTDGDELAGNSA--ALVDELRKRQ 217 Query: 398 IRIMTIAFSVNKTQQ--EKARYFLSNCAS 424 I + + + +A L AS Sbjct: 218 IPLFIVGLGDPEVGAPVPEADGTLKRDAS 246 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 17/168 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV S +V+ + ++ S++ R+G ++ V ++ + + Sbjct: 573 VFVIDGSKSLGEDNFEIVKQFVKGILDSLEIS---QKAARVGLIQYSTHVRTEFTMA--Q 627 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + K + + + A++ ++ S E + R + + IV TD Sbjct: 628 YSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFS--EAQGARARPMRVPRVAIVF-TD 684 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 G + +E KAK GI I I ++ L AS Sbjct: 685 G---RAQDEVSEYAEKAKQSGITIYAIGIGKAIDEE------LQEIAS 723 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 58/171 (33%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +K +D D R+G + V ++ S +K I + + T Sbjct: 75 LITMLKFLDIGPDNTRVGLLQYGSTVKNEFSLK--TYKRKPDIERAVKRMMHLATGTMTG 132 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E N + +++TDG + + I KA++ GI I Sbjct: 133 LAIQYAMNIAFSEAEGARPL---NQYVPRIAMIVTDG---RPQDPVAEIAAKARNSGILI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I L S F + L +F++++ Sbjct: 187 FAIGVGRVDMST------LKTIGSQPHSEHVFLVANFSQIETLTSVFQNKL 231 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 60/189 (31%), Gaps = 32/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + ++ Sbjct: 266 TKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQEALQA-V 318 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QD 382 G T I + ++ +I R KN + + IVLL+DG++T Sbjct: 319 NSLVSNGGTNIAEGLKKGARVLID------RRFKNPVSS---IVLLSDGQDTYTMTSPNG 369 Query: 383 NEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + K I + F + + + A S +F S + Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDA-----SLMHSIAENSGGTFSFIESETVI 424 Query: 438 NKIFRDRIG 446 F IG Sbjct: 425 QDAFAQCIG 433 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 49/161 (30%), Gaps = 27/161 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R ++ + G T+I ++TA+ I + Sbjct: 346 GMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSA-GGTSICSGLRTAFTVIKKKYPTD 404 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 IVLLTDGE+ I+ C + K G I T+A A+ Sbjct: 405 GSE----------IVLLTDGEDN-----TISSCFDLVKQSGAIIHTVAL-----GPAAAK 444 Query: 417 YF--LSNCASPNSFFEANSTHE--LNKIFRD-RIGNEIFER 452 LS + ++ L F GN + Sbjct: 445 ELEQLSKMTGGLQTYSSDQVQNNGLVDAFAALSSGNAAIAQ 485 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 26/209 (12%) Query: 246 PGPLDPSLSEE---HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P P +P S F+ SS + V+ + + +D +D R+ + Sbjct: 192 PAPAEPCKSRPLDLVFIIDSSRSVRPAEFEKVKI---FLSEMVNSLDIGSDATRVALVNY 248 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V + K + T A++TA + + + N K Sbjct: 249 ASTVNIEFHLKKYFSKAEVKQAFSRIDPL--STGTMTGMAIKTAMEQVFTENAGARPLKK 306 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K +++TDG + ++ + A++ GI I + + + L Sbjct: 307 G---IGKVAIIVTDG---RPQDKVEEVSAAARASGIEIYAVGVDRAEMRS------LKQM 354 Query: 423 AS---PNSFFEANS---THELNKIFRDRI 445 AS + F + +L FR+ + Sbjct: 355 ASQPLDDHVFYVETYGVIEKLTSKFRETL 383 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 83/276 (30%), Gaps = 46/276 (16%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 ID M D G +S + ++ + KS + Sbjct: 182 IDLQSDMNDEHNAITGSVKIKAYSEFPAIEQSVT--------KEIFAILIHLRAPKSSHS 233 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 PLD + S K L+++A++ VI+++ D R+ F Sbjct: 234 ASSRAPLDLVTVLDV-----SGSMAGTKIALLKNAMSFVIQTLGPND------RLSVIAF 282 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + ++ G T I D ++ I R+K Sbjct: 283 SSTARRLFPLRRMTLAGRQQALQA-VSSLVASGGTNIADGLKKGAKVIED------RRLK 335 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK----------SQGIRIMTIAFSVNKTQQ 412 N + + I+LL+DG++T + + + ++I T F + Sbjct: 336 NPVCS---IILLSDGQDTYTLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSA 392 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 + A S +F ++ + F IG Sbjct: 393 A-----MHAIAEISSGTFSFIDAEGSIQDGFAQCIG 423 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 67/227 (29%), Gaps = 16/227 (7%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G ++ +LL PV LGV + +D+ + LK A A + A L + Sbjct: 2 RHFFRDRAGQITVLASLLSPVGLGVAALAIDLSTLQMVKQRLKVTADAASLAAVAVLPDT 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +S + + ++ T A QV Sbjct: 62 GTALSRALAIAADNAGAGAGSV--TTAADVQFGSYDSATRTFTAGATPAN------AVQV 113 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS-ML 190 S ++ F R++G + + A A S +Y V+D S S L Sbjct: 114 TASRSEARGNPVITGFARALGWATPDLSASAVAVRFSPAYC-------FLVLDPSASGAL 166 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 NC Q + + ++ N S + Sbjct: 167 SVSGTGRLSVPNCGVQVNSTSASAATTGNNSTAQARSFCITGGYSGS 213 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A++ GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARASGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 321 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 375 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 376 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 429 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 430 RAKEEGIVMYAVGVGKAVEAE------LREIAS 456 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A++ GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARASGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVMYAVGVGKAVEAE------LREIAS 497 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + D TAI DA++ A + Sbjct: 131 RVGIIAYAGSAYPQLPITTDYSSAKMFLNAMN-TDMLSSQGTAIRDAIELAKTYYNDEEQ 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + +V+++DGE+ IA A QGIRI TI + Sbjct: 190 TN-----------RVLVIISDGEDHAGEVASIA--ESATEQGIRIFTIGVGSEAGDR 233 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 20/158 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+++++ + + + R+G F ++ S L I + Sbjct: 117 ITRLQAVKNSVSKFVAA-------RQSDRIGLVIFANQAWPFAPVSEDKQALQTRITQLS 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + SS N +A K +LLTDG +T Sbjct: 170 --PGMVGEQTAIGDALGVAVKLLDSSA---------NQDASKLAILLTDGNDTASQLAPP 218 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA 423 A + +++ TIAF + + L A Sbjct: 219 LAAQLAAAHHVQVHTIAFGDSNSAGSDHVDLTQLQEIA 256 >gi|38348336|ref|NP_940912.1| inter-alpha-trypsin inhibitor heavy chain H5-like protein precursor [Homo sapiens] gi|74762375|sp|Q6UXX5|ITH5L_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5-like protein; Short=Inter-alpha inhibitor H5-like protein; Flags: Precursor gi|37181448|gb|AAQ88537.1| ITI-like protein [Homo sapiens] gi|57208182|emb|CAI42356.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|57209935|emb|CAI42344.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|119613592|gb|EAW93186.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_a [Homo sapiens] gi|162318154|gb|AAI57043.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] gi|162318494|gb|AAI56206.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] Length = 1313 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P L P FV S K + A+ ++ ++ D Sbjct: 269 IHYFAPRGLPPMEKNVVFVIDVSSSMFGTKMEQTKTAMNVILSDLQAND----------- 317 Query: 301 FFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +FN SD W I+ + K + G T +N A+ A + SN Sbjct: 318 YFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSN 377 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 ++ I+ LTDGE T I + + + + ++AF + Sbjct: 378 QEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDA 433 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 68/198 (34%), Gaps = 24/198 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD S S + +I + VR+ ++S + + R+G F D Sbjct: 99 ALDLSQSMDTRDFRDPSGALIPRVQAVREVVSSFVAR----RPGD---RIGLIVFGDAPY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ H L++T+++ + M + + + + Sbjct: 152 PLAPFTLD-HALVQTMIR------------DLLPGMAGPSTALGDAVGLGIKMFDQSPAP 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA-- 423 +K +++LTDG +T + AK + + + TI + E+ L A Sbjct: 199 EKVLIVLTDGNDTASKMPPERAADIAKQRHVTVHTIGIGDPSAEGEQRVDLGVLQRMAAQ 258 Query: 424 SPNSFFEANSTHELNKIF 441 + +F + L I+ Sbjct: 259 TGGRYFFGADQNSLESIY 276 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 60/189 (31%), Gaps = 32/189 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + ++ Sbjct: 266 TKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQEALQA-V 318 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QD 382 G T I + ++ +I R KN + + IVLL+DG++T Sbjct: 319 NSLVSNGGTNIAEGLKKGARVLID------RRFKNPVSS---IVLLSDGQDTYTMTSPNG 369 Query: 383 NEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + K I + F + + + A S +F S + Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDA-----SLMHSIAENSGGTFSFIESETVI 424 Query: 438 NKIFRDRIG 446 F IG Sbjct: 425 QDAFAQCIG 433 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 61/181 (33%), Gaps = 19/181 (10%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 F+ S + ++ L +++ G ++ + + + H Sbjct: 7 FLVEGSRSVSALEFEKMKTFLNNIVGQFDIGPTATQV---GVVQYSWFIRQECAL--NAH 61 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + ++ + + T A+ A +T +++ + + + K +V++TDG Sbjct: 62 SSLASLQQAISNITVLGLGTHTGAALTFARNTALTA----ANGARPGVP--KIVVVMTDG 115 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHE 436 + D ++ G+ I+ S + L + A SP+ F A Sbjct: 116 ASEDDVTLPSQ---NLRNDGVITFAISVSWSLPN----DRLLQDIAGSPDRIFAATDFDA 168 Query: 437 L 437 L Sbjct: 169 L 169 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 30/232 (12%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + D +E + D S S V + + + CN + P Sbjct: 145 GQLTDIASTNETLGVVDNFNLLDDVRNSLLSSVCSVLNDNFLTTLTIQDCNSDHISITMP 204 Query: 247 --------GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 P + + FV S + V+ + ++ D+ R+G Sbjct: 205 CYTLLEKVTPPCNNPVDIVFVLDGSGSVGRRNFEKVQAGVKKIVGDFNIA---LDSTRVG 261 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA----YDTIISSN 354 ++ V + + ++ + G T AM+ A + + + Sbjct: 262 VVQYSSIVRQEFAL--DTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFTSANGAR 319 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D H IVL+TDG + ++ KAK GI + + Sbjct: 320 PDVGH----------VIVLVTDG---RSYDDVSQASQKAKQAGIVVFAVGIG 358 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 60/323 (18%), Positives = 100/323 (30%), Gaps = 56/323 (17%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 Y V V + + +Q + D V S + + Sbjct: 339 YESRGDVFFDIVEERTGESETFQNWTYKPVTYNLASLYDDNVISLPIGSNGSNT-NVTWD 397 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV----RDALASVIRSIK 286 + N P P D + + S+ + D+ +V+ I Sbjct: 398 GCIEEANTVATDTFDPFPQDAHDLKINLTPSNVNEYWKPVLRNATWKREDSSGNVLGHIT 457 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + N N R G S P+ ++ + + RT ++T+ +T + M Sbjct: 458 QTGNEN---RPG--------YSCPAAAFKLTDISRTDLETYVDGLTPRSNTYHDFGMIWG 506 Query: 347 YDTIISSNEDEVHRMK--NNLEAKKYIVLLTDG-------------------ENTQDNEE 385 I + N ++IV +TDG T D Sbjct: 507 ARFISPNGIFAASNATAPNGDAISRHIVFMTDGLLVPNQEIYSMYGIEWWDRRITNDGSG 566 Query: 386 GIA----------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 G A C A+ + I + IAF TQ L +CA+P F+AN T Sbjct: 567 GQARDRHATRFQVACRAARQENISVWVIAFGTTLTQN------LIDCATPGRAFQANDTA 620 Query: 436 ELNKIFRDRIGNEIFERVIRITK 458 L F ++I EI +R+T+ Sbjct: 621 ALETRF-EQIAQEIAA--LRLTQ 640 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 205 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 256 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 257 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSEKDNVTRYAVAVLGYYNRRGI 314 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 315 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 357 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 87/298 (29%), Gaps = 32/298 (10%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + T + + S+ V + +M + F + SQ G Sbjct: 86 STTTCEKLNLQTSTSMSNVTEMKTNMSLGLTLTRNVGTGGFLTCGPLWAQQCGSQYYTTG 145 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + + + + P +D V S V++ L ++ Sbjct: 146 VCSDVSPDFQLRTSFAPAVQTCPSFIDV-----VVVCDESNSIY--PWDAVKNFLEKFVQ 198 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDA 342 + MG + + + K ++K + G T A Sbjct: 199 GLDIGPTKTQ---MGLIQYANNPRVVFNL--NTFKSKDEMIKATSQTFQYGGDLTNTFKA 253 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEA--KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 +Q A D + K +V++TDGE + D + A+ ++ I Sbjct: 254 IQYA--------RDTAYSTAAGGRPGATKVMVVVTDGE-SHDGSKLKAVIDQCNKDNILR 304 Query: 401 MTIAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + +L + IG +IF Sbjct: 305 FGIAVLGYLNRNALDTKNLIKEIKAIASIPTERHFFNVSDEADLLEK-AGTIGEQIFS 361 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 48/139 (34%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A +I ++ R+G + + + H + ++ D Sbjct: 112 LEKAKRIAFETISQLKGD----RVGIVAYAASAYPQLALTTD-HSAAKMFLQGMNTDMLS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A++ A + N + + +LTDGE+ + I +A Sbjct: 167 SQGTAIQEAIRMASNY-----------FDENTPTARLLFILTDGEDHE--MGATEIATEA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +G+ I TI K Sbjct: 214 QEKGVHIYTIGIGTEKGAP 232 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 63/193 (32%), Gaps = 30/193 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS- 336 +AS + + + ++ VR+G F + + + I + +GS Sbjct: 116 VASQVAAKNFVKDLPRHVRVGVVSFAGTAAVVQAPTHSRDDVFAAIDRFQLQRGTAIGSG 175 Query: 337 ----------------TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + I + S +D+ + ++LLTDG+ T Sbjct: 176 IVLSLATLFPEAGIDLSDITGERRMPKGIGESDKQDDFKPVAPGSYGSAAVILLTDGQRT 235 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------EKARYFLSNCA--SPNSF 428 + + A +G+++ T+ F + L N A + + Sbjct: 236 TGP-DPMDAAKMAADRGVKVYTVGFGTTSGEIIGFEGWSMRVRLDEATLKNIANLTQAEY 294 Query: 429 FEANSTHELNKIF 441 F A S +L K++ Sbjct: 295 FYAGSATDLQKVY 307 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 46/151 (30%), Gaps = 21/151 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F+D + V G T + M AY+ + S+ Sbjct: 460 KLGVVTFDDSAHWTIELD---KVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQLRQSDA 516 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 H +LLTDG + IA GI + +A + Sbjct: 517 KIKHA-----------ILLTDGWGHATDIGSIA--ENMNKDGITLSVVAAGNGSDNALQR 563 Query: 416 RYFLSNCASPNSFFEANSTHELNKIF-RDRI 445 L ++ A E+ +IF ++ I Sbjct: 564 YAELGG----GRYYPARVMEEVPQIFLQETI 590 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 60/189 (31%), Gaps = 21/189 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +DSS + K LV+ A + + + K D R+ + S Sbjct: 187 PPSNLVFLIDSSGSMNSYDKLPLVQSAFSMLAEQLDKND------RISIVTY---AGSSA 237 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G + GST ++TAY+ E H +K Sbjct: 238 VLLDGEKGSNTDEILEQLYSITASGSTNGEGGIKTAYEL------AEEHFIKGGNNR--- 288 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ EE + + GI + + F + + N S Sbjct: 289 VILATDGDLNVGASSEEELTRLIETKRDNGIYLSVLGFGEGNYKDARMEALADNGNGNFS 348 Query: 428 FFEANSTHE 436 + ++ E Sbjct: 349 YIDSEDEAE 357 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 78/270 (28%), Gaps = 28/270 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S D + P S+N + +E ++ Sbjct: 2 SFSDDEPIIPSPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGG------DS 55 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D S + V S +K ++ A+ V++ + ID R+ F D Sbjct: 56 TSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTFLDTAN 109 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++K G+T I+D +QT + ++ + Sbjct: 110 RICPLQQVTEDSQPQLLKLI-DALQPGGNTNISDGLQTGLKVLAD------RKLSSGRVV 162 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L++DG+ + K + + T F + N + + Sbjct: 163 G--VMLMSDGQQNRG-----EPAANVKIGNVPVYTFGFGAD-YDPTVLNAVARN-SMGGT 213 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F N + L+ F + + V +T Sbjct: 214 FSVVNDVNLLSMAFSQCLAGLLTVVVQDLT 243 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 48/150 (32%), Gaps = 26/150 (17%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I ++ A+ I + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLTKR-LPTAASGGTSICSGLRLAFTVIKKKYPTD 403 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 IVLLTDGE+ I+ C N+ K G I T+A + ++ Sbjct: 404 GSE----------IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAARELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFR 442 K L AS + + + L F Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFG 473 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 19/207 (9%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 S +S LD L D+S + ++++++++ + +N Sbjct: 13 SNVRSADEECGGTVLDLWLDIVIVADNSQRVNQNNSVVDIQNSISNIFEIVPI--PIN-- 68 Query: 295 VRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAIDENEMGSTA-INDAMQTAYDT 349 R+G +N + SWG L + + ++ +T+ I + TA + Sbjct: 69 -RVGFVTYNSLATINADLNKFKSWG--DLSQGVNDSYNNMNLSSENTSFIGTGLITAGEL 125 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + K I++ N + +++ N KS GI I+T+A + Sbjct: 126 LQVQGS-----AIGRVYYPKVIIVYASAFNGTGLLDPLSVANTLKSAGITIITVAVDTDN 180 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHE 436 + + L++ ASP S F + + Sbjct: 181 NGVIQKQ--LASIASPGSAFSLDPDDD 205 >gi|156409371|ref|XP_001642143.1| predicted protein [Nematostella vectensis] gi|156229284|gb|EDO50080.1| predicted protein [Nematostella vectensis] Length = 332 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 13/109 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G ++ R + K R I+ G T A++ A S Sbjct: 237 TRVGMILYSTRSYPIFRLNQYTSK--RAIMGKIRNVRYPAGGTRTGQALRYARRYFFSGR 294 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + K+ ++LLTDG +QD+ +G A+ + ++ G + TI Sbjct: 295 KPKGR--------KRVLILLTDGI-SQDSVKGPAL--QLRNAGAEVFTI 332 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 205 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 256 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 257 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 314 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 315 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 357 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 194 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 245 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 246 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 303 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 304 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 346 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 15/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ G T I A+ Y+ + N Sbjct: 130 RVGLLPFAGTAFLMCPLTLDYDAFRNSLEALDTN-IIPQGGTDIASAI---YEAEAAFNN 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 D H K +VL++DGE+ + E ++ AK + + I T+ + Sbjct: 186 DANH---------KILVLVSDGEDLEG--EALSAAQAAKERDLTIYTVGVGTPSGE 230 >gi|218129580|ref|ZP_03458384.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] gi|217988310|gb|EEC54633.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] Length = 212 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 62/188 (32%), Gaps = 21/188 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + +S + V++ + ++I +++ +T + FN + Sbjct: 7 YLLLDTSGSMYGEPIEAVKNGVQTLISTLRGDPYALETAYISIITFNSVAQQVTPLT--- 63 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + G TA+ +A+ + + +K + + + L+TD Sbjct: 64 -----ELSAFQQPQIEASGCTALGEALTLLAQKVDTEIVKTTQEVKGDWKP--LVFLMTD 116 Query: 377 GENTQDNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 GE T D ++G+ ++ K + G+ + Q L + + Sbjct: 117 GEPTDDLQKGL---DEFKKRKFGMVV------ACAAGQGANTNTLKKITENVVQLDTADS 167 Query: 435 HELNKIFR 442 + F+ Sbjct: 168 ATIKAFFK 175 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 44/127 (34%), Gaps = 14/127 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +I N + R+G + + + + D TAI +A+Q Sbjct: 119 VTQIINNLASDRIGLIAYAGSAVPQLPITTDYSSAKMFLQSMN-TDLVSSQGTAIAEAIQ 177 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E K +V+++DGE+ + E + A GIRI+TI Sbjct: 178 LAESYYSEDTEAS-----------KVLVIISDGEDHEG--EALDYAEAAAENGIRIITIG 224 Query: 405 FSVNKTQ 411 K Sbjct: 225 VGTEKGG 231 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 65/204 (31%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + + ++ D V + Sbjct: 170 PPANLVFLVDVSGSMDEENKLPLVQKTLRILTQQLRPQDKVT------LITYASGEDLVL 223 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + G+T A+Q AY+ + + Sbjct: 224 PP---TSGADKETILSAIDKLRAGGATDGESALQMAYEQ--------AQKAFVPNGINR- 271 Query: 371 IVLLTDGENTQDNEEG---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + ++ + + G+ + T+ F + ++ A + Sbjct: 272 ILLATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGNYNEDMMEQIAD--AGDGN 329 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + ++ E K+ + ++ + + Sbjct: 330 YSYIDNEKEAKKVLQQQLTSTLAT 353 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 67/183 (36%), Gaps = 17/183 (9%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 DA+ V+ + + + R+G F D F+ + + +T Sbjct: 123 DAVKQVLAQFSQHRDGD---RLGLILFGDAAYLQAPFTADHETWLALLDETQV--GMAGQ 177 Query: 336 STAINDAMQTAYDTIISSNEDEVH-------RMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ A + + +K +++LTDG +T I Sbjct: 178 STHLGDAIGLAIKVFNDQADAAKQAATQQGSAVVTRPVKEKVVIVLTDGNDTDSLVPPID 237 Query: 389 ICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 A S+ IRI IA + +Q + AS F+A S EL ++++ + Sbjct: 238 AAKVAASRDIRIHMIAMGDPRTVGEQALDMEVIEQVASLTGGQSFQALSPAELTRVYK-K 296 Query: 445 IGN 447 IG Sbjct: 297 IGE 299 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 69/227 (30%), Gaps = 49/227 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A +I + VR+G F I + +++ I + Sbjct: 106 TRIGAAQQAARDLIVGL---PPG---VRLGIVSFAATAIVVLPPTDNRLRMLDAIDRFEL 159 Query: 329 IDENEMG--------------STAINDAM--------QTAYDTIISSNEDEVHRMKN--- 363 + G + + T ++ + + R ++ Sbjct: 160 QNGTATGSGLIQSLAVLFPDDGIDLEGILFGGESLAPGTGGRSLTEAAAADAVRKRDLEQ 219 Query: 364 -NLEAKKY----IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 Y ++LL+DG T + + A +G+R+ T+ F + + Sbjct: 220 PGAAPGSYRHGAVILLSDGRRTTGP-DPLDAARMAAQRGLRVYTVGFGAVQDAGTEGSSL 278 Query: 419 ----------LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L A + +F+A S +L +++R G ER Sbjct: 279 SYEMQVDEPTLRQIATLTDGEYFQAGSAADLTRVYRQLSGRFALERS 325 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 64/202 (31%), Gaps = 34/202 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 220 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLRR-MSDNGREAAGLAI 272 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NE 384 G T I + ++ + +E I+LL+DG++T + N Sbjct: 273 NSLTSSGGTNIVEGLKKGVRVLEERSE---------QNPVASIILLSDGKDTYNCDNVNR 323 Query: 385 EGIAICNKA-----KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + C + + I + T F + + + S +F S + Sbjct: 324 RQTSHCASSNPRQGRQAIIPVHTFGFGSDHDSTA-----MHAISDESGGTFSFIESVATV 378 Query: 438 NKIFRDRIGN--EIFERVIRIT 457 F IG + + +R+T Sbjct: 379 QDAFAMCIGGLLSVVAQELRLT 400 >gi|308466921|ref|XP_003095711.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] gi|308244476|gb|EFO88428.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] Length = 637 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 53/134 (39%), Gaps = 9/134 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +I + + VR+G ++ + H +++ + E G+T DA Sbjct: 482 SAISSLPISQEAVRVGLISYSGPGRTHVRVYLDKHNEKEKLIEEMFLMERHGGTTRTADA 541 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ A E H + N+ KK +V+ TDG + A+++G++++ Sbjct: 542 IRYATKIF----EGMAHPARRNV--KKVLVVFTDG---YSQDSPRDAARVARAKGLQLIA 592 Query: 403 IAFSVNKTQQEKAR 416 +A ++ + Sbjct: 593 VAVKDRLAPPDEEQ 606 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 87/298 (29%), Gaps = 32/298 (10%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + T + + S+ V + +M + F + SQ G Sbjct: 77 STTTCEKLNLQTSTSMSNVTEMKTNMSLGLTLTRNVGTGGFLTCGPLWAQQCGSQYYTTG 136 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + + + + P +D V S V++ L ++ Sbjct: 137 VCSDVSPDFQLRTSFAPAVQTCPSFIDV-----VVVCDESNSIY--PWDAVKNFLEKFVQ 189 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDA 342 + MG + + + K ++K + G T A Sbjct: 190 GLDIGPTKTQ---MGLIQYANNPRVVFNL--NTFKSKDEMIKATSQTFQYGGDLTNTFKA 244 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEA--KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 +Q A D + K +V++TDGE + D + A+ ++ I Sbjct: 245 IQYA--------RDTAYSTAAGGRPGATKVMVVVTDGE-SHDGSKLKAVIDQCNKDNILR 295 Query: 401 MTIAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + +L + IG +IF Sbjct: 296 FGIAVLGYLNRNALDTKNLIKEIKAIASIPTERHFFNVSDEADLLEK-AGTIGEQIFS 352 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 42/114 (36%), Gaps = 16/114 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 276 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-V 328 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T I DA++ A I ED H I+LL+DG++T + Sbjct: 329 NSLGASGGTNIADALKKAAKVI----EDRSH-----QNPVCSIILLSDGQDTYN 373 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 45/121 (37%), Gaps = 16/121 (13%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIR 321 R + L++ A ++ ++ D R+ FNDR + R Sbjct: 103 RRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLYMSADGRR 156 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +K+ G TA+ A + A + D +R+ +IVLLTDGE+T Sbjct: 157 KAMKS-VDQLEARGGTALVPAFEEAVKVLDGRVGDGRNRLG-------FIVLLTDGEDTS 208 Query: 382 D 382 Sbjct: 209 G 209 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 45/121 (37%), Gaps = 16/121 (13%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIR 321 R + L++ A ++ ++ D R+ FNDR + R Sbjct: 95 RRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLYMSADGRR 148 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +K+ G TA+ A + A + D +R+ +IVLLTDGE+T Sbjct: 149 KAMKS-VDQLEARGGTALVPAFEEAVKVLDGRVGDGRNRLG-------FIVLLTDGEDTS 200 Query: 382 D 382 Sbjct: 201 G 201 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 46/397 (11%), Positives = 123/397 (30%), Gaps = 26/397 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 72 LEEVSSRAKNSFTF----------PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ + S ++ + + + L + D+ + T Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHK-EHGVSI 179 + + ++ + S F +G ++ Q +++ + + + + Sbjct: 130 TSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 180 QWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCN 237 V D S SM D + + N +K + ++ +E Sbjct: 190 MVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 238 KSLYYMLYPGPLD-PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + L + H + S ++ + H V + +D Sbjct: 250 GAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNS 309 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYD 348 +G ++ G+ K + + G T + + TA + Sbjct: 310 IGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASN 369 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 ++S N E K+ I++L+DG + N Sbjct: 370 QMLSEKSRSEEL---NQETKRVILVLSDGNDDMSNLN 403 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 85/263 (32%), Gaps = 21/263 (7%) Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY--PGPLDPSL 253 + QP+ P T Q V + ++ + S+ Y P P Sbjct: 20 NSPQPVFSPLFPTMGTNLEIRPQRVAVSTQSTMDVALVIDRSGSMAYASDETPDPYVNPA 79 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + + ++ + + ++ ++ D + Sbjct: 80 SAPPGWTYGDPVPPNSRWLDLVASVNAFNGFLVDSP---QYEKLCLATYSSTASRDCDLT 136 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + + G T++ ++ + D H K + + +VL Sbjct: 137 HTYAEISNELDAISY--QFDGGGTSVGYGLEHGLAVLT----DATHARKFAV---RVMVL 187 Query: 374 LTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 +TDG NT + E + ++ G+ + TI FS + Q + + C F A Sbjct: 188 MTDGHHNTGKSPESMMY--HLQNHGVTLFTITFS-DDADQSRMSNLANACG--GENFHAT 242 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 +L F+ +I ++ + + Sbjct: 243 DASQLQNAFQ-KIAKKLPSLMTQ 264 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 70/465 (15%), Positives = 143/465 (30%), Gaps = 74/465 (15%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++P + G L+++ Y ++ ++V QS +V++ A+N+ F Sbjct: 94 VVLPETAFISGFLMEIDGKEYKASVKEKEKAQEEYQSAVDAGQSAAQVTANARNANQFTV 153 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 E + F N + R + V V ++ L +N +++ Sbjct: 154 SVNIEPEQKIFFNLTYEELLSRRKGIYEQAIHVTPGSVVPKMSVRVNIFETLPINKITVP 213 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + ++ VS+ + + + + +GQ + + Sbjct: 214 QLRGNDVEQDHNSTENEIAQIIRTNETTVVSVVYEPAEAEQIKMSKDGLQGQFVVQY--- 270 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 D S + G++ + D + V P L FV +S Sbjct: 271 -DVDRSSIEKKGGEIHVVDGYFVHFFV-----------PADLPTLPKHVIFVLDTSGSMA 318 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVR--MGATFFN--------DRVISDPS------ 311 + + A+ S++ ++K +++ V G T ++ D + Sbjct: 319 GTRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKPYKGPDHYYFNSPPEETTE 378 Query: 312 ---------------------FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 ++ V + K F + ST INDA+ A Sbjct: 379 DATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMDVTSSTNINDALLLALK-- 436 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS----QGIRIMTIAFS 406 +++ + I+ LTDGE T + I + + I +AF Sbjct: 437 ------NSQSVQSRVRLTPIIIFLTDGEPTASVTDTTEILKNVRKGNSDDVVSIFCLAF- 489 Query: 407 VNKTQQEKARYFLSNCASPN-----SFFEANSTHELNKIFRDRIG 446 FL+ +S N +EA K F D + Sbjct: 490 ----GTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGFFDEVA 530 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRSEDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Query: 434 THELNKIFR 442 +L + F Sbjct: 199 F-DLIQEFG 206 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVMYAVGVGKAVEAE------LREIAS 497 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 84/247 (34%), Gaps = 23/247 (9%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + Y+ C++ Sbjct: 630 EGFRLAEDGKRCRRKDVCKSTYHGCEHICVNHGNSYICKCSEGFVLAEDGKRCKRCTEGP 689 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S + +V+ +A +I S+ + R+G ++ +V ++ + Sbjct: 690 LDLVFVIDGSKSLGEENFEIVKQFVAGIIDSL-AVSPKA--ARVGLLQYSTQVRTEFTL- 745 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G R + K A + + A++ ++ + E + + + ++ Sbjct: 746 -GDFSSARDMKKAVAHMKYMGKGSMTGLALKHMFERSFNPVE-GARPVSPGV--SRVAIV 801 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ A +A++ GI + + ++ L AS F Sbjct: 802 FTDG---RAQDDVSAWARRARAGGITMYAVGVGKAIEEE------LQEIASEPTDKHLFY 852 Query: 431 ANSTHEL 437 A + Sbjct: 853 AEDFSTM 859 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 13/166 (7%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ ++ + K D V+ GA ++D + +K+ I+ D + Sbjct: 211 KEMQNFMVSLVNKSAVGPDNVQFGALKYSDYNTELFYLNRYTNKV--DIINHINKDTTQG 268 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T A++ + + E H + + ++++TDG+ + D ++ + + Sbjct: 269 GNTYTAGAVRFSKEFFT-----EKHGSRKARGVPQIVMVITDGD-SHDKDKLNETARQLE 322 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +GI I I Q + + +F + L I Sbjct: 323 QEGIIIYAIGI-----DQANTNELETLAGTEGKWFMVANFSGLQDI 363 >gi|262196282|ref|YP_003267491.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079629|gb|ACY15598.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 340 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 87/237 (36%), Gaps = 33/237 (13%) Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 G P + +G Y + G PY +S+ Y L E+ Sbjct: 133 GPPADAYGPGDPCAAPDYQPEPPYEGDPCAPFEPYGPGDPRSI--GGYDERPAGELGPEY 190 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 D + V +K + R A ++ +I++++ RM FFN+R+ + + G+ Sbjct: 191 PADEPAAPLVPRKIDVAR---AELVHAIERLEPG---TRMNVLFFNNRLEAMAATIVGLD 244 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + R + F + G TA+ A++TA ++ ++ +VLL+DG Sbjct: 245 EPGRDALVHFVLSTEPYGPTALAPALRTAL----------------SMNPRR-LVLLSDG 287 Query: 378 ENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEA 431 D + +A G+RI TI Q + L A ++A Sbjct: 288 LGNVGGDASAVLRDAREAMLGGVRIDTIGI-----GQAQDDRLLRALAEESGGLYQA 339 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 296 RMGATFFNDRVISDPSFSW--GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G ++ R F + +++ I + I G T AM+ A + S Sbjct: 42 RIGLIVYSSRSYLVGGFRRYRNLRSVLQAIKRIRYIR----GGTYTGKAMKYALRKLFSR 97 Query: 354 NEDEVHR-----MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 H + A K +V++TDG + + K G+ I ++ Sbjct: 98 RAGYHHARVRLFRSSRKGAAKILVMITDGI---SQDRVTTPALRLKKMGVVIFSVGV--- 151 Query: 409 KTQQEKARYFLSNCASPNSFFEA 431 ++ + + + + P F A Sbjct: 152 -GKRYRLKQLMQIASRPRLVFTA 173 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 37/110 (33%), Gaps = 17/110 (15%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R ++ + G T+I ++TA+ I + Sbjct: 346 GMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSA-GGTSICSGLRTAFTVIKKKYPTD 404 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFS 406 IVLLTDGE+ I+ C + K G I T+A Sbjct: 405 GSE----------IVLLTDGEDN-----TISSCFDLVKQSGAIIHTVALG 439 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 41/272 (15%), Positives = 77/272 (28%), Gaps = 60/272 (22%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + K V+ +S P D + + S+ + + + Sbjct: 258 YRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHNSRLEVAK 317 Query: 276 DALASVIRSIKKIDN--VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A+ S+ + + +N + +RM F+ V + +F+ ++ + Sbjct: 318 TAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAV-----NGLRA 372 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------ 387 G T + KKYIV ++DG+ T Sbjct: 373 DGGTNWE----------AALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 422 Query: 388 --------------------------------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 A+ + + ++ S + K Sbjct: 423 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVS---SDPTKM 479 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R F + S++ A ST ELNK F D IG Sbjct: 480 RGFADQ--TKGSYYSATSTDELNKAFADIIGQ 509 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 56/179 (31%), Gaps = 22/179 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + V+ +A+ + +K D R+G F D Sbjct: 91 AIDLSGSMATRDFVDPAGERMDRLSAVKRVVANFVAK-RKGD------RIGLVVFGDAAY 143 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + + TAI DA+ + + H Sbjct: 144 PQAPLTLDHDSVRILLDQMQIGMAGPR--TAIGDAIG-----LTVKLMADSHAQ------ 190 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCAS 424 +K ++LLTDG +T AK + + T+ T E L+ AS Sbjct: 191 EKVLILLTDGNDTSSAIPPERAAEIAKQHKLVVHTVGIGDPGTTGEDRVDLEALARIAS 249 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 65/194 (33%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS ++ V+ L+ VI + N + R+G + V ++ S Sbjct: 41 VFIIDSSRSVRPQEFEKVKVFLSRVIEGLDVGPN---STRVGVINYASAVKNEFSLK--T 95 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H+ +++ E T A+Q A S E R N K +++TD Sbjct: 96 HQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFSDTEGARLRSPN---INKVAIVVTD 152 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G ++ A +A+ GI I I L AS + S Sbjct: 153 GRPQDGVQDVSA---RARQAGIEIFAIGVGRVDMHT------LRQIASEPLDDHVDYVES 203 Query: 434 ---THELNKIFRDR 444 +L F++ Sbjct: 204 YSVIEKLTHKFQEA 217 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICRGLRSAFTVIKKKYPTD 403 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 404 GSE----------IVLLTDGEDN-----TISECFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 64/170 (37%), Gaps = 20/170 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + I+ +D +T R+G + V + H+ ++++ E T Sbjct: 63 LSQVIESLDVGPNTTRVGVINYASAVKHEFPLK--AHRSKASLLQAVRKIEPLSTGTMTG 120 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E R+K +K IV +TDG ++ A +AK GI + Sbjct: 121 LAIQFAINKAFSEVEGS--RLKFPEISKVAIV-VTDGRPQDGVKDVSA---RAKQSGIEL 174 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 I ++ L AS + S +L+K F++ Sbjct: 175 FAIGVG------RVDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQEA 218 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 24/149 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 GA ++D VI S + + L + I + T + A++ + ++ Sbjct: 117 GALHYSDEVILINSLTRDMKTLRDNVETVEYIGK----GTHTDCAIKRGIEEVLIGG--- 169 Query: 358 VHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ + KY++++TDG E ++ G+ N+AK GI++ ++A S N + Sbjct: 170 -----SHQKENKYLIVVTDGHPLEGYKEPCGGLEDAANEAKHLGIKVFSVAISPNHLEPR 224 Query: 414 KARYFLSNCASPNSF---FEANSTHELNK 439 LS AS S F A S L Sbjct: 225 -----LSVIASDASHRRNFTATSAVGLTD 248 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 79/228 (34%), Gaps = 25/228 (10%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C + Y GP D ++ + + +H K V+ +K + + Sbjct: 835 CPDYTCPITYEGPADITMLVDSSTRVGN-QHFQTSKSFVKLLAERF---LKAKPPPSGSA 890 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ ++ + + + + + + G+T + A++ A + E Sbjct: 891 RVSVVQYSGQNQQIVEAQFLTNYTVLEV--PVDNMQFINGATNVVSALR-AVTELY--RE 945 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D + + KK +V +DG NTQ+ + + + A+S GI I +A ++ Sbjct: 946 DSLAGVN-----KKLLVF-SDG-NTQEEKGLLKVVQDAQSAGIEIYVLAVG-SRLNYPNL 997 Query: 416 RYFLSNCASP-------NSFFEANSTHELNKIFR-DRIGNEIFERVIR 455 + L+ A+ F L + R I I + + Sbjct: 998 QVMLTGSAADIAGPFPEERLFRVPDYTSLLQGVRYQSISRRIALKSSQ 1045 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 88/270 (32%), Gaps = 34/270 (12%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + M++Y + P + + + N + L Y V Sbjct: 176 RAEEMINYFQYDYKVPDSREAPFSTNVSVVETPWNSDTKLLHIGLKGYTV---------- 225 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P P + +D+S K L++ A ++ ++++ D + G Sbjct: 226 -PLDDLPPQNLVFLIDTSGSMSDENKLPLLQQAFRLLLSTLREDDTIAIVTYAGNAG--- 281 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 ++ + S ++ + GSTA + ++ AY + +D+ Sbjct: 282 VLLEPTALS------DKSKIAEAIAALTSGGSTAGHAGLKEAYRLAETMQDDDT------ 329 Query: 365 LEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K I+L TDG+ ++ + + GI + + F E + N Sbjct: 330 ---KSRIILATDGDFNVGLSSADDMKRFVKEKRDSGITLSVLGFGRGNYNDELMQALAQN 386 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E K+ D+I + I Sbjct: 387 GNGVAAY--IDTLSEARKVLVDQIVSSIST 414 >gi|118387578|ref|XP_001026893.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89308660|gb|EAS06648.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1074 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 53/143 (37%), Gaps = 10/143 (6%) Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + S+QN + P M+S N + P +S S+ +K Sbjct: 324 INIQSTQNYIQLFEQSQQIPVMISLNTKGNFDAKAYQRPPIDLICVMDNSGSMHG--EKI 381 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +++++ L +I + + D R+G FN V P S ++ +K + D Sbjct: 382 NMLKETLLYLIDQLDEKD------RLGLVLFNSEVTFRPMKSMDTTNKLK--LKQYISDI 433 Query: 332 NEMGSTAINDAMQTAYDTIISSN 354 G T IN M A+ I + Sbjct: 434 RAQGGTDINLGMTEAFKFIKTRK 456 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 18/124 (14%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +H+ + L++ A+ +I+ ++ D R+ FN V+S+ + + + Sbjct: 125 KHMSSRLDLLKIAMKYIIKLVRDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLE 178 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN-------NLEAKK----YIVL 373 G+T A++ A + + N N E KK +I+L Sbjct: 179 NL-VDKLKASGNTDFRPALKKAVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILL 237 Query: 374 LTDG 377 L+DG Sbjct: 238 LSDG 241 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 50/376 (13%), Positives = 112/376 (29%), Gaps = 57/376 (15%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + + +M+ GG+ VD++R+ ++ +A++ ++ S Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASA-----GAPYGSDHRAIIED 55 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + N TD + + V +N +++ D+ ++ Sbjct: 56 YFR---------------VANMTDYLAAEKEGEIVVTVNSAS------VTANADMTMD-- 92 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + ++ GIK + S + K + + V+D S SM + + Sbjct: 93 TYLMKLSGIK------ELRTTGGSTAVRKVPKLEVVLVLDVSGSMGSNSKLVNLKKAAKE 146 Query: 205 -------GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 G TV S + V V K + + Sbjct: 147 FVTSLLNGSEPGNTVISIVPFSWSVSPSVATFEALAVD-RKHEFSTCIRFKANDHSHASL 205 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 +S ++ AL ++ +D + + + + FS Sbjct: 206 ATGNSGFSSGQPLDQMIYTALYGNFDEFSGSESSSDY----RSCYANDYMEILPFSVSET 261 Query: 318 KLIRTIVKTFAIDENEM------GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--- 368 +L I A G+ ++ + + D +I++ E + Sbjct: 262 ELHAKIDSLQASGNTSGNQGMIWGAALLDPSFRQITDDLIAAGEVASSQAAIPSNYGTAE 321 Query: 369 --KYIVLLTDGENTQD 382 K V++ DG+NT Sbjct: 322 TLKVAVVMGDGQNTTS 337 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 37/81 (45%), Gaps = 5/81 (6%) Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + ++ + A C K++G+ + +I F +++ + L NCAS + + + Sbjct: 475 DGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAE--QVLKNCASSENHYFRAEGINI 532 Query: 438 NKIFRDRIGNEIFERVIRITK 458 N F I + + +R+T+ Sbjct: 533 NDAF-SAIASNVVN--LRLTQ 550 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ ++ + R+G ++ +V S + + T Sbjct: 5 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 62 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A++ GI I Sbjct: 63 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARASGIEI 116 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 117 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 154 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 269 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 323 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 324 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 377 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 378 RAKEEGIVMYAVGVGKAVEAE------LREIAS 404 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 59/163 (36%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ ++ T ++ G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELV-QINGATERDALTKSLPTVASGGTSICSGLRSAFAVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + + IVLLTDGE+ I+ C N+ K G I T+A + ++ Sbjct: 397 --RKKFSTDGSE-IVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS +A + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QAQN-NGLIDAFGALSSGNGAASQRA 486 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 21/189 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + VR L ++ + N R+G ++ +V S G Sbjct: 38 VFVIDSSRSVRPFEFETVRQFLVGLLHGLNVGPNA---TRVGVIQYSSQVQSVFPL--GA 92 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 93 FSRPEDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 149 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + + + +A+++GI I + Q L AS F S Sbjct: 150 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASHPLDEHVFLVES 200 Query: 434 THELNKIFR 442 +L + F Sbjct: 201 F-DLIQEFG 208 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 63/200 (31%), Gaps = 22/200 (11%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD-----SSSLRHVIKKKHLVRDALASVIRS 284 + C L P + E VD S + LV+ + ++ Sbjct: 317 EGFSYRCLCPEGRQLQPDGKSCNRCREGHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDF 376 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + R+G F+ RV ++ G + + + E T A++ Sbjct: 377 LDVSPDG---TRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 431 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + S + R N + +V TDG + ++ +AK +GI + + Sbjct: 432 HMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAARAKEEGIVMYAVG 485 Query: 405 FSVNKTQQEKARYFLSNCAS 424 ++ L AS Sbjct: 486 VGKAVEEE------LREIAS 499 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 87/223 (39%), Gaps = 40/223 (17%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ND R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLNDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIRGLTRMPSG--RDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY+V++TDG E ++ G+ N+AK GI++ ++A + + + LS Sbjct: 149 ENKYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIRI 456 A+ +++ F A E+ D I + I V ++ Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTIDTITDMIKNNVEQV 246 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 41/272 (15%), Positives = 77/272 (28%), Gaps = 60/272 (22%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + K V+ +S P D + + S+ + + + Sbjct: 576 YRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHNSRLEVAK 635 Query: 276 DALASVIRSIKKIDN--VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A+ S+ + + +N + +RM F+ V + +F+ ++ + Sbjct: 636 TAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAV-----NGLRA 690 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------ 387 G T + KKYIV ++DG+ T Sbjct: 691 DGGTNWE----------AALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 740 Query: 388 --------------------------------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 A+ + + ++ S + K Sbjct: 741 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVS---SDPTKM 797 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R F + S++ A ST ELNK F D IG Sbjct: 798 RGFADQ--TKGSYYSATSTDELNKAFADIIGQ 827 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 46/397 (11%), Positives = 123/397 (30%), Gaps = 26/397 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 72 LEEVSSRAKNSFTF----------PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ + S ++ + + + L + D+ + T Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHK-EHGVSI 179 + + ++ + S F +G ++ Q +++ + + + + Sbjct: 130 TSGKGHTSSSEVTCTVSGTIKHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 180 QWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCN 237 V D S SM D + + N +K + ++ +E Sbjct: 190 MVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 238 KSLYYMLYPGPLD-PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + L + H + S ++ + H V + +D Sbjct: 250 GAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNS 309 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI--------VKTFAIDENEMGSTAINDAMQTAYD 348 +G ++ G+ K + + G T + + TA + Sbjct: 310 IGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASN 369 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 ++S N E K+ I++L+DG + N Sbjct: 370 QMLSEKSRSEEL---NQETKRVILVLSDGNDDMSNLN 403 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 57/375 (15%), Positives = 122/375 (32%), Gaps = 51/375 (13%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNS 82 I+ L+P+M+ + + + + + +AA+ A + LI S +E Sbjct: 3 GILFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLA----LIASPKESEDENVKY 58 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV-------VLSS 135 DR V D + D VE+ RK Y+ ++ Sbjct: 59 AR--------------------QLVDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAP 98 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAE--AETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 D +++ + + + ++ + E ++V+R Y + V + ++ DFS SM + Sbjct: 99 FSDFVVSAKAEHKSWIAYEKVDLKPEFEVAGKSVTRKYLPQ-PVDVYFIGDFSGSM-NGH 156 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPGPLDP 251 L+ Q +R V+ + N + R L +P V ++Y Y Sbjct: 157 WKGGKTKLDVVKQTIERVVEDIENFNTEEKSRVALLGYNPLHVKQTGTVYLNSYGYRRSW 216 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID--NVNDTVRMGATFFNDRVISD 309 + + + K + V ++ + +V + V F + D Sbjct: 217 PKKVAYDYARGTTAQTVAKMFDPPSVYSRVQEYVRGMSRHDVKNLVVNNDRFVDYYKFYD 276 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + G T+ + + A + N+ ++ Sbjct: 277 IPL---TEDYTNFKSRLANASLGAEGGTSSWNGIIAAAQEANRA---------TNINPEQ 324 Query: 370 YIVLLTDGENTQDNE 384 ++L+DG + N Sbjct: 325 VFIVLSDGADNDTNY 339 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 59/163 (36%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ ++ T ++ G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQSELV-QINGATERDALTKSLPTVASGGTSICSGLRSAFAVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + + IVLLTDGE+ I+ C N+ K G I T+A + ++ Sbjct: 397 --RKKFSTDGSE-IVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS +A + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QAQN-NGLIDAFGALSSGNGAASQRA 486 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ ++ +V A + G T A + Sbjct: 199 IQVGVVQYGEDVVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 61/214 (28%), Gaps = 28/214 (13%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +Y PLD + K + I +D T R Sbjct: 20 AGHVYAACKNQPLDLVFIVDSSRSVRPEEFEKVKI--------FLSEMIDTLDVGERTTR 71 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + V + K + + T A+QTA + + + Sbjct: 72 VAVMNYASTVKVEFPLRTYFDKASMKEAVSRIEPLSA--GTMTGLAIQTAMEEVFTEEMG 129 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 N + ++++TDG + ++ + A++ GI I T+ Q Sbjct: 130 TRPAAFN---IPRVVIVVTDG---RPQDQVQDVAASARTAGIEIYTVGVGRADMQA---- 179 Query: 417 YFLSNCAS---PNSFFEANS---THELNKIFRDR 444 L AS F + +L FR+ Sbjct: 180 --LRIMASEPLDEHVFYVETYGVIEKLTSRFRET 211 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/316 (14%), Positives = 92/316 (29%), Gaps = 32/316 (10%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + S + + ++ + + + D + + + P + Sbjct: 169 VPSHHREASGSHTSEPVDFNDDEPLQLIESGDSRDARCSRAVEIKTYPEFSAIPQSSSED 228 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + K + + S+ Y P+D + S K Sbjct: 229 DFAVLIHLKAPCANPEQVTGRPFNATSIGYPTSRAPVDLVTVLDV-----SGSMAGTKLA 283 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 284 LLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-VNLLG 336 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICN 391 G T I DA++ A I +N I+LL+DG++T + + Sbjct: 337 AGGGTNIADALKKAAKVIED---------RNYKNPVCSIILLSDGQDTYNISSNVRGTRP 387 Query: 392 KAKS---QGI---RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +S I I T+ + L + A S +F + F Sbjct: 388 DYRSLVPSSILNHTICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQ 447 Query: 444 RIGN--EIFERVIRIT 457 IG + + +R+T Sbjct: 448 CIGGLLSVVVQDMRLT 463 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/271 (13%), Positives = 65/271 (23%), Gaps = 69/271 (25%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + + + + I V+D S SM D Sbjct: 104 LNVTGAKSAGTGAIVTNQPLDIVLVLDVSGSMAD-------------------------- 137 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 N G + V+ D S + LV+ Sbjct: 138 -NLSGGPKKIDALKTAVNGFIDATADENAKITDQSQRN--------------RIALVK-- 180 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +K ND R G + +N + ++ V L T+ + G+T Sbjct: 181 ----FAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTV-----NGLSASGAT 230 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAIC 390 + + A A + KK ++ TDGE + + Sbjct: 231 SADYAFNRAQAALTYQPRANA---------KKVVIFFTDGEPNHGSGFDPTVAATAVNKA 281 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K G I +I + + Sbjct: 282 KSLKDAGTTIYSIGVVSGANPGDTSSNLNKY 312 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + ++ + R+G ++ +V S + + T Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 362 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 416 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 417 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 470 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 471 RAKEEGIVVYAVGVGKAVEAE------LREIAS 497 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + ++ + R+G ++ +V S + + T Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 280 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 334 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 335 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 388 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 389 RAKEEGIVVYAVGVGKAVEAE------LREIAS 415 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + ++ + R+G ++ +V S + + T Sbjct: 5 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 62 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 63 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 116 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 117 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 154 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 269 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 323 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 324 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 377 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 378 RAKEEGIVVYAVGVGKAVEAE------LREIAS 404 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + ++ + R+G ++ +V S + + T Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 321 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 375 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 376 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 429 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 430 RAKEEGIVVYAVGVGKAVEAE------LREIAS 456 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 76/272 (27%), Gaps = 60/272 (22%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + K V+ +S P D + + S+ + + + Sbjct: 579 YRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHNSRLEVAK 638 Query: 276 DALASVIRSIKKIDN--VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A+ S+ + + +N + +RM F+ + +F+ ++ + Sbjct: 639 TAVNSMAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAV-----NGLGA 693 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------ 387 G T + KKYIV ++DG+ T Sbjct: 694 DGGTNWE----------AALKAANAKLTSGRKGVKKYIVFMSDGDPTYRTSSVRTGTDWL 743 Query: 388 --------------------------------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 A+ + + ++ S + K Sbjct: 744 GRPIYDADDGWGLPAGVHGSGLSDRYGANLSSAVAEANRRGDATLFSVGVS---SDPTKM 800 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R F + S++ A ST ELNK F D IG Sbjct: 801 RGFADQ--TKGSYYSATSTDELNKAFADIIGQ 830 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ ++ D R+ FN R + + R +K Sbjct: 59 VENVKRAAWLLVDKLRDQD------RLSIVVFNHRAEV---LLSNQNVVDRDHIKQQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAI 389 + G T+I++ ++ + + D + + LLTDGEN + + Sbjct: 110 LSANGGTSIDEGLRLGIEELAKGRRDTISQAF----------LLTDGENEHGDNNRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCASPNSFFEANSTHELNK 439 A + + T+ F N Q + LS+ P+ + + L Sbjct: 160 AQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHPDQ--AVDKFNSLLM 213 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 56/174 (32%), Gaps = 14/174 (8%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G I L M V++G G + +DV + L+ +A ++A+ + ++ Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA------- 128 + + + +N ++ + T +P Sbjct: 72 EAIVAGNEAINNNLTLDGFGSMTIDNTDIHYEFSDSLPFDSSTNTATSPYVRVRIEDVDV 131 Query: 129 -------YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + + +S+R + P S + + I +E+ T Y + Sbjct: 132 ADYLVAIFNIDMSARSSAVAGPSSSITTTCNVVPLSICEGSESSTTLSGYSEGS 185 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 18/145 (12%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+D S + K I++ +T A++ + + Sbjct: 672 FSDTSKEVFSLNKNTSK-KSDIIQAVEDMSLIGSTTNTGGALRFVSKYFKLAKQARP--- 727 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLS 420 K +VL+TDGE ++E A + ++ GI I ++ F+ NKTQ E+ Sbjct: 728 ----SVNKILVLITDGE---ASDEVTAPATELRNDGIIIYSVGVFNANKTQLEEISG--- 777 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 P F + L I D I Sbjct: 778 ---KPEKVFYVENFDILEDIKGDII 799 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 50/128 (39%), Gaps = 10/128 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 VI + K D N+ V+ GA ++ P + ++ I+ D +T Sbjct: 839 DFVIDIVNKSDVGNNRVQFGAVKYSAY----PQILFNLNGNKADIIDKIKGDTLLNDTTY 894 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A++ A + S R + +V +TDG + D ++ A+ + ++ GI Sbjct: 895 TAEALRHAENLFTESKGSRKRRG----VPQLLMV-ITDG-TSHDKDKLDAVSTRIRNDGI 948 Query: 399 RIMTIAFS 406 I I Sbjct: 949 TIYAIGIK 956 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 74/243 (30%), Gaps = 28/243 (11%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQ-PLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + V+D R + + Q + P + + + E Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYA 168 Query: 230 SPYMVSCNKSLYYML-----YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 P L L P P+ P + +D S + K + AL + + Sbjct: 169 KPPDGDAQGRLDVTLIPQGGVPAPVRPPIDVVFVMDVSGSMTTM-KLQSAKSALQAAVNY 227 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF---AIDENEMGSTAIND 341 K + ND R F+D V + +G + + + G T + Sbjct: 228 FKTNYHPND--RFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSA 285 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE----NTQDNEEGIAICNKAKSQG 397 A+ A N+ E KKYI+ LTDG NT + I K G Sbjct: 286 ALSLA------------QSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDG 333 Query: 398 IRI 400 +I Sbjct: 334 EKI 336 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 55/179 (30%), Gaps = 22/179 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S S + + V+ +A + +K D R+G F D Sbjct: 99 AIDLSGSMATRDFVDPAGERMDRLSAVKRVVADFVAK-RKGD------RIGLVVFGDAAY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + + TAI DA+ + + H Sbjct: 152 PQAPLTLDHDSVRILLDQMQIGMAGPR--TAIGDAIG-----LTVKLMADSHAQ------ 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCAS 424 +K ++LLTDG +T AK + + TI T E L+ AS Sbjct: 199 EKVLILLTDGNDTSSAIPPERAAEIAKQHKLVVHTIGIGDPGTTGEDRVDLEALARIAS 257 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 63/190 (33%), Gaps = 36/190 (18%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + S + +K R+G F + ++ +R ++ + E+ T Sbjct: 118 IGSFLEQVKTD-------RIGIIAFAGTATLQCPLT-DDYEAVRIVLNGLNSNTVEIPGT 169 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 I A++ A + ++ K +VL++DGE+ Q + + K++G Sbjct: 170 DIGSALRLAENAFPEGSK------------SKTLVLISDGEDLQHSA--LREARILKTKG 215 Query: 398 IRIMTIAFSVNKT----QQEKARYF--------LSNCA--SPNSFFEANSTHELNKIFRD 443 IR+ T+ + E L A + ++ E ++ Sbjct: 216 IRVYTMGVGSPEGTIIRHPETGEEVKSKLDEATLQEIARITEGEYYRVTPGGEEIQLILK 275 Query: 444 RIGNEIFERV 453 RI R Sbjct: 276 RIYESESTRR 285 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 68/193 (35%), Gaps = 19/193 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS + ++ L ++ S++ D+ R+G + V + Sbjct: 56 VFIVDSSRSVRPTEFEKAKEFLQDLVDSLEV---GLDSTRVGLVNYASTVRMEFPLK--- 109 Query: 317 HKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +K + T A++TA + ++ E N+ + + V++T Sbjct: 110 AHFSKPALKGALARVEPLASGTMTGLAIRTAVEKAFAA---EAGARLNSTKVARVAVVVT 166 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS-- 433 DG + +E + A+ GI I + + +++ + F + Sbjct: 167 DG---RPQDEVERVSAAARESGIEIYAVGV---DRADRTSLRLMASQPHEDHVFYVETYG 220 Query: 434 -THELNKIFRDRI 445 +L FR+ + Sbjct: 221 VIEKLTSRFRETL 233 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 61/451 (13%), Positives = 131/451 (29%), Gaps = 52/451 (11%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 I+++ PV+LG+ +V + W+ + A + + S+ Sbjct: 111 IMSS---PVVLGLATPVVQRLGWA---GKPVSWGEIAAQAGKKAFSYGMTDPSASNSGFS 164 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 N + + ++ + LS Y Sbjct: 165 ALVGVASALAGAGNAIDARQIASVTPQLTQFFSAQTLSAGSSGW-----LSDAYVRRATG 219 Query: 144 LSLFLRSMGIKSWLIQTKAEAE---TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + +S L+ A + ++ Y + V+ + + R + + Sbjct: 220 PDAVDGLINYESVLLSANASGKLPQPLTLVYPSDGVVTADYPLTLLADAGSDARSAHQRL 279 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + PA + ++Q + + L P + +L +F Sbjct: 280 ADHLRTPAVQKRIMETTQ-RRPVVPGVALGPQFAQRDLVELPFPATQQAVDALLSAYFDK 338 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKID---NVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + I S++ DT G + Sbjct: 339 LRRPSRTLYVLDTSGSMAGARIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTG 398 Query: 318 --------------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 +KTFA G TAI D++ AY + + R + Sbjct: 399 PGAPRTFTVPEENPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVLEPLMAADPDRFTS 458 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFL 419 IVL+TDGEN + + + A + + + T+ F + + L Sbjct: 459 -------IVLMTDGENANGSSLPDFLTSLASLPPAMKQVPVFTVLFGEGSSDE------L 505 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + A + F+A + +L+++F++ G + Sbjct: 506 TQVATRTGGKVFDARNV-QLSRVFQEIRGYQ 535 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 59/173 (34%), Gaps = 52/173 (30%) Query: 326 TFAIDENEMGST-------------AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ G T + ++A N+ + KKY++ Sbjct: 473 SYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGFSVKKYMI 532 Query: 373 LLTDGEN-----------------------TQDNEEGIA--------ICNKAKSQGIRIM 401 +TDG+ T DN +A CN AK++GI I Sbjct: 533 FMTDGDMSPTWSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAKAKGIDIW 592 Query: 402 TIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERV 453 IAFS T ++NCAS P +S L F++ IG++I + Sbjct: 593 VIAFSTTLTAD------MTNCASKPEQAAGLSSNAALIAKFKE-IGSKIATVL 638 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 66/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + +++ ++++ ++ + D ++ A F Sbjct: 356 MMCSKTCYNSVNIAFLIDGSSSVGESNFRLMLKF-VSNIAKTFEISDIGA---KIAAVQF 411 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + D Sbjct: 412 T--YDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDS----- 464 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A A GI I ++ + + + S Sbjct: 465 ---PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDLKDMASKP 516 Query: 423 ASPNSFF 429 ++FF Sbjct: 517 KESHAFF 523 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 72/230 (31%), Gaps = 38/230 (16%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVD----SSSLRHVIKKKHLVRDAL---ASVIRSIK 286 V +Y L PSL +D S K L ++A S++ + Sbjct: 388 VEKALPVYMDLRGKKEIPSLGLVLVIDKSGSMSEGSGGYSKVELAKEAAIQATSILGPLD 447 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 G F+D F + ++ G T+I A+ A Sbjct: 448 MA---------GVVAFDDTAQWVVEFQ---AVKDKDAIQDDIATIRADGGTSIYPALALA 495 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 Y + ++ H I+LLTDG+ + + + + GI + T+A Sbjct: 496 YTALKDAHTKFKH-----------IILLTDGQ-SATTGDYYFLSRRMARAGITMSTVAV- 542 Query: 407 VNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L A+ ++ ++ + +IF I ++ Sbjct: 543 ----GEGADTLLLEQLAAWGQGRYYFSDEISNIPRIFTKETMKAIKSYLV 588 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 63/181 (34%), Gaps = 27/181 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVH 317 S + K + ++A S++ D R+G F+ V+ G Sbjct: 403 SMGPETGLSKFTMAKEAAIMATESLRAED------RIGVLAFDVSTRWVVDFQPV--GTG 454 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + I + G T I +A+QT + H VLLTDG Sbjct: 455 LSLADIQR-RISTLPLGGGTDIYNALQTGLPELARQPGRVRHA-----------VLLTDG 502 Query: 378 EN-TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + T D + A+ +A+S+ I + TIA + + + A ++ A + Sbjct: 503 RSFTDDRQAYQALIEEARSRNITLSTIAIGTDA-DIDLLQTLARWGA--GRYYFAAEPGD 559 Query: 437 L 437 + Sbjct: 560 I 560 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 64/182 (35%), Gaps = 27/182 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G F S + RT VK STA +A+ A Sbjct: 119 ADQLTPGINLGLVSFAGTAAVLVSPT-----TDRTAVKQAVDGLKLSESTATGEAIFAAL 173 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQGIRI 400 +I S + + IVL++DG+ T D +A ++ I + Sbjct: 174 QSIDSFSRTVAASGTEGPPPAR-IVLMSDGKQTVPGPDGENDPRGSFTAAKQAAAEKIPV 232 Query: 401 MTIAFSVNKT--QQEKAR-------YFLSNCA--SPNSFFEANSTHELNKIF---RDRIG 446 TI+F + E R + A S FF A S +L +++ ++IG Sbjct: 233 STISFGTDYGTIDIEGGRTRVAVDDASMQQIASLSGGQFFTAASESQLRQVYSELGEQIG 292 Query: 447 NE 448 E Sbjct: 293 YE 294 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 77/256 (30%), Gaps = 26/256 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL-SPYMVSCNKSLYYMLYP 246 S+ + N + S S + D+ + M + + Sbjct: 243 SIDSVTEFCTAKTHNTEAPNLQNKMCSSRSTWDVIMDSDDFQNASPMKGTDPPPHPTFSL 302 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + S S+ + + + A +I+ I+K V G F Sbjct: 303 LKSKQRVVCLVLDKSGSMSSEDRLLRMNQAAELYLIQIIEKGSLV------GMVTFESLA 356 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + T + G T+I ++ + II SN+ Sbjct: 357 TILNYLT-DIIGDNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQSTSGSE----- 410 Query: 367 AKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+LLTDGE ++ I+ C + K G I TIA + + LSN Sbjct: 411 ----IILLTDGE-----DDQISSCFEEVKQSGAVIHTIALGPSA---ARELETLSNMTGG 458 Query: 426 NSFFEANSTHELNKIF 441 F+ + L F Sbjct: 459 YRFYANKDINGLTDAF 474 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + V+ + ++ + +V A + G T A+ I Sbjct: 202 IQVGVLQYGETVVHEFYL--NNYRSVTDVV-EAAKRIEQRGGTETRTALG------IEKA 252 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + ++ I +A + Sbjct: 253 VTEAFQRGGRKGAKKVMIVITDGE-SHDSPDLQRVIESSEKDNITRYAVAVLGYYNRRGI 311 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G IF Sbjct: 312 NPEAFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGERIFS 354 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 47/389 (12%), Positives = 115/389 (29%), Gaps = 50/389 (12%) Query: 45 RWSYYEHALKQAAQT---AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 + ++ +K+ + + + + S K +F K K +E + + ++ Sbjct: 63 NYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDM 122 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + +N E+ ++ + ++ ++ + G K Sbjct: 123 VPRNGGQTEIGTGLQHVRENSFSGAEG---GGNPDKQKIVILMTDGKSNAGAPPQHEAHK 179 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 +AE V+ ++ ++ +M +Y + Sbjct: 180 LKAEGVTVIAIGIGQGFVKTELEQIATMKNYVLTTNSFSELSTLLKL------------- 226 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + D +V C +++++ + L++D + + Sbjct: 227 --VIDLACEVCVVDCAGHADIAFVFDASSS-------INANNPNNY----GLMKDFMKDI 273 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM--GSTAI 339 + K + F DR + +K G TAI Sbjct: 274 VDRFNKTGPDG--TQFAVVTFADRATKQFGLK---DYSSKAEIKGAIDKVTPSIIGQTAI 328 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 D ++ A + + E +K ++LLTDG+N ++ + + +G+ Sbjct: 329 GDGLENARLEVFPN-----RNGGGREEVQKVVILLTDGQNN-GHKSPEHESSLLRKEGVV 382 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I+ I + L N AS + Sbjct: 383 IVAIGV-----GTGFLKSELINIASSEEY 406 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 71/204 (34%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + ++ D V + Sbjct: 194 PPANLVFLVDVSGSMTDPDKLPLVKKTLRILTEQLRPQDKVT------LITYASGEQLVL 247 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G K TI++ + G+T+ A++ AY+ + +KN + Sbjct: 248 PPTSGKDK--DTILRAL-NALHAGGATSGERALRMAYEQAEKA------YVKNGINR--- 295 Query: 371 IVLLTDGENTQDNEEG---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + ++ + + GI + T+ + + A + Sbjct: 296 IILATDGDFNVGVSDTETLKSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIAD--AGDGN 353 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + +S E K+ R ++ + + Sbjct: 354 YSYIDSEKEARKVLRHQLTSTLAT 377 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/281 (12%), Positives = 83/281 (29%), Gaps = 35/281 (12%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 ++ I S + +A + SR + + D + + G+ + F Sbjct: 12 NLQIPSCPLLIRAASAGRSRKPGTKSSAPNKMFNDDEEPIAPAS--NAGKQVRGFSDVGK 69 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 +VK Y + +G ++ + S + + +L +D S Sbjct: 70 ASVKPYYPKEAPLGASTVRVLLDVSSSSST--------AGRAALDLVVVLDVSGSMRDFG 121 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ A+ +I+ + +D R+ FN + + + ++ Sbjct: 122 RLDKLKSAMRFIIKKLAPMD------RLSVVTFNGGATRECPL-RAMSEDAVPVLTDIVD 174 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I ++ + ++L++DGE + + Sbjct: 175 GLVARGGTNIEAGLKMGLQVLDGRRYTGARTAG--------VILMSDGEQNSGDATRVR- 225 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 Q + T++F N L A + Sbjct: 226 ----NPQNYPVYTLSFGSNAD-----MNLLQKLAGGGGTYN 257 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 82/228 (35%), Gaps = 23/228 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K G+F +I AL++ +L G + VD + ++ + A + Sbjct: 4 RFAKDERGNFAMIMALVLVPLLLAGMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGE 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + F N T + D A + + + Q+ Sbjct: 64 SQTVVQVYGAQVFT-----------------ANLTALSADAVNFDVAFPKD-KTTDQQIQ 105 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++ + SLF + T+S S ++ + + V+D SRSM + Sbjct: 106 ATAGFTYK----SLFGVIASRLTGD-DWDQNQYTLSSSVRLKNTIEVALVLDNSRSMDET 160 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S + ++ + A + V++ ++Q+ + + + +V S+ Sbjct: 161 RSGSTKKRIDLLKEAASQLVETMAAQSTLITHVENPVQFSLVPFAGSV 208 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 47/126 (37%), Gaps = 8/126 (6%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S R KK+ +RD+L + R + D G ++ + + Sbjct: 353 SPRQSDLKKYYLRDSLDKIYRGGRSND--------GGPNYSCTSSPLTPLTDVTTEQGMK 404 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++T G T + +AM + TI+ R K +++LTDG NT Sbjct: 405 TIQTAIKAMVPSGGTNVPEAMAWGWRTIVRGAPFTEARPSTERGNDKVVIVLTDGANTYY 464 Query: 383 NEEGIA 388 +G+A Sbjct: 465 KYDGLA 470 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 85/259 (32%), Gaps = 32/259 (12%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + N + + S + ++ + ++ K + + Sbjct: 241 SIDSVVEFCTEKNHNTLAPNPQNKMCNSRSTWEVIQNSEDYKNSIPMTEAKPPHPIFSLK 300 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + S S+ + L + + +++ I+K G F+ Sbjct: 301 QIRERVLILVLDKSGSMAGGDRLNRLNQASQLFLLQIIEKGS------WTGMVTFDSSAT 354 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + R + + + G T+I ++TA+ I + + Sbjct: 355 IQSALIQIETDAQRNSLISR-LPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE------- 406 Query: 368 KKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAF--SVNKTQQEKARYF--LSNC 422 IVLLTDGE + I+ C ++ K G I T+A S + +E A+ + Sbjct: 407 ---IVLLTDGE-----DSTISSCFDEVKQSGAIIHTVALGPSADPGLEELAKMTGGMKTS 458 Query: 423 ASPNSFFEANSTHELNKIF 441 A+ N+ + L F Sbjct: 459 ATDNA-----QNNGLIDAF 472 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 63/224 (28%), Gaps = 22/224 (9%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D D+ + ADR S + ++ + L Y+ P Sbjct: 287 DTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVGLFREHIGKDDYVLAYVTPPALAS 346 Query: 251 PSL--SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P E FV +S + +L + ++ D R F+D + Sbjct: 347 PKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGD------RFNVIRFDDTLTK 400 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 S ++ + F G T + A+ A D N Sbjct: 401 FFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALDDSNQGNGLRQ---------- 450 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 IV LTDGE NE+ + A+ RI + Sbjct: 451 --IVFLTDGE--ISNEQQLLDAVAARRGRSRIFMVGIGSAPNSY 490 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 86/261 (32%), Gaps = 28/261 (10%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-- 254 EG L G+ R S+ +G I + + Y+ C+K Sbjct: 596 EGFRLAEDGKRCRRKDVCKSTHHGCEHICVNRGNSYICKCSKGFILAEDGRRCKRCTEGP 655 Query: 255 -EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + FV S +V+ + +I S+ I R+G ++ V ++ + Sbjct: 656 VDLVFVIDGSKSLGEDNFEIVKQFVTGIIDSL-AISPKA--ARVGLLQYSTLVRTEFTLR 712 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + K A + + A++ ++ + E R + + IV Sbjct: 713 N--FSSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVEGA--RPLSARVPRVAIVF 768 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFE 430 TDG + ++ +KA++ GI + + ++ L AS F Sbjct: 769 -TDG---RAQDDVSEWASKAQASGITMYAVGVGKAIEEE------LQEIASEPTEKHLFY 818 Query: 431 ANSTHELNKIFRDRIGNEIFE 451 A + I +++ + Sbjct: 819 AEDFSTM-----GEISDKLQK 834 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 74/243 (30%), Gaps = 28/243 (11%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQ-PLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + V+D R + + Q + P + + + E Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYA 168 Query: 230 SPYMVSCNKSLYYML-----YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 P L L P P+ P + +D S + K + AL + + Sbjct: 169 KPPDGDAQGRLDVTLIPQGGVPAPVRPPIDVVFVMDVSGSMTTM-KLQSAKSALQAAVNY 227 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF---AIDENEMGSTAIND 341 K + ND R F+D V + +G + + + G T + Sbjct: 228 FKTNYHPND--RFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSA 285 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE----NTQDNEEGIAICNKAKSQG 397 A+ A N+ E KKYI+ LTDG NT + I K G Sbjct: 286 ALSLA------------QSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDG 333 Query: 398 IRI 400 +I Sbjct: 334 EKI 336 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 48/156 (30%), Gaps = 25/156 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ + D R+ FN VK Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP---NQSVEDLQGVKNKINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G T I++ M+ E+ V + I LLTDGEN DNE + + Sbjct: 110 LTASGGTCIDEGMKLGIKEAALGKENRVSQ----------IFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 A I + T+ F N + L A Sbjct: 160 AKVAAEYNITLNTLGFGSNWN-----QDILEQIADS 190 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 67/216 (31%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 258 NGYFVHYFAPEGLPTIPKNVIFVIDKSGSMSGRKMQQTREALIKILDDLSPKDQFN---- 313 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ +++A G T IN+A+ A + S+ + Sbjct: 314 --LISFSGDAAQWKPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQK 371 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 E+ I+LLTDG+ T I K + + F + + Sbjct: 372 ELMPEGTVSL----IILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVS--- 424 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + +++S +L + + Sbjct: 425 --YAFLEKLALDNGGLARRIYEDSDSALQLQDFYEE 458 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 67/180 (37%), Gaps = 23/180 (12%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S SL K ++ + ++ I + V++G T F++ + + + + Sbjct: 1409 SGSLTSAPNKDQVLMNFTNNLANMYDTI----NQVKIGLTSFSESSVLEMPLDFYNQLEL 1464 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + V + T I ++TA + + + ++L+TDG + Sbjct: 1465 QDGVS---NMTWQGSFTNITSGVETALNDMD-----------TSDAVDDVMILITDGFQS 1510 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + ++AK+ G+R++ + F + Y + N + A + EL I Sbjct: 1511 TNTTLMFQMIDQAKADGVRLIALGFFGDFAFYSPNLYLM-----TNEVYHAANYAELLAI 1565 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 82/276 (29%), Gaps = 30/276 (10%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM--LYPGPLDPS 252 ++ G T + + N SP S YY + Sbjct: 63 ETNGHQKTGDVYKCPVTQGNCTKLNLGRVTLSNACSPLWSHECGSSYYTTGMCSRVNSNF 122 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALAS----------VIRSIKKIDNVNDTVRMGATFF 302 + + +V D S +I + K +++G + Sbjct: 123 RFSKTVAPALQRCQTYMDIVIVLDGSNSIYPWVEVQHFLIEILTKFYIGPGQIQVGIVQY 182 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + +K ++ +V A + G T A + E + Sbjct: 183 GEDAVHEFHL--NDYKSVKDVV-EAASHIEQRGGTETRTAFGIEFAR------SEAFQKG 233 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ----QEKARYF 418 AKK ++++TDGE + D+ + + +++ + +A + E Sbjct: 234 GRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNE 292 Query: 419 LSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 + AS FF L I D +G+ IF Sbjct: 293 IKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 327 >gi|78484443|ref|YP_390368.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362729|gb|ABB40694.1| Hypothetical protein; predicted integral membrane protein with a von Willebrand factor type A domain [Thiomicrospira crunogena XCL-2] Length = 363 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 47/160 (29%), Gaps = 23/160 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G + D + + R ++K A +A+ A S + Sbjct: 161 RFGFILYADDAYTLMPLTSD-ATTARLMLKRLKPYLAGRTDEATGEALGLALQQAEKSTD 219 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +VL++DG N A+ I I TI N +K Sbjct: 220 STE---------NRIVVLISDGSTRDSRLPIAEAINYAQGLNIPIYTIGVGANSKDADKR 270 Query: 416 R-----------YFLSNCA--SPNSFFEANSTHELNKIFR 442 L A + +++ S +L K+ + Sbjct: 271 EFRGLLYEALESSSLKQIADQTQGRYYQIGSGQDLQKVLQ 310 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 61/179 (34%), Gaps = 21/179 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P + +D S K LV+ ++ ++ ++ D V+ V GA Sbjct: 320 PLAHPPKANLVFLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAG---T 376 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V+ + + I+ N GSTA ++ AY ++ + + Sbjct: 377 VLEPTPVA-----EQQKILAAL-DRLNAGGSTAGAQGIELAYQLAEANYQRDAVNR---- 426 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ D E+ + ++ GI + + F + N Sbjct: 427 -----IILATDGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDALMQQLAQN 480 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 48/156 (30%), Gaps = 25/156 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ + D R+ FN VK Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP---NQSVEDLQGVKNKINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G T I++ M+ E+ V + I LLTDGEN DNE + + Sbjct: 110 LTASGGTCIDEGMKLGIKEAALGKENRVSQ----------IFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 A I + T+ F N + L A Sbjct: 160 AKVAAEYNITLNTLGFGSNWN-----QDILEQIADS 190 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 71/221 (32%), Gaps = 27/221 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 N + L P D F+ SS + V+ LA ++ ++ Sbjct: 40 RNNGLPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRPGEFEKVKIFLADMVDTLDVGP- 98 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDT 349 D R+ + V + L + +K + T A++ A D Sbjct: 99 --DATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDE 153 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + KN K +++TDG + ++ + A++ GI I + Sbjct: 154 AFTEKSGARPKSKN---ISKVAIIVTDG---RPQDQVEEVSAAARASGIEIYAVGV---- 203 Query: 410 TQQEKARYFLSNCASP--NSFFEANS---THELNKIFRDRI 445 + R ++P + F + +L FR+ + Sbjct: 204 -DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 60/191 (31%), Gaps = 34/191 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV+ L V+ + + D R+ F+ V D + + + + Sbjct: 67 KLALVKKTLEFVVSQLNEKD------RLCLITFDTSVYLDFKLT-PMTPMNKYQTLKIIK 119 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D + T + + +I D KN + + ++L TDG + I Sbjct: 120 DISPGSMTNLCGGLMKGLCEVI----DRADEEKNEVAS---VLLFTDGFANKGGLTNI-Y 171 Query: 390 CNKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTH 435 C+ + K+ I T F N Q L A ++ + Sbjct: 172 CSSSQTAKYTIGIVGPKTADASIYTFGFGSNHNAQ-----MLKEISDAGSGMYYYIENVD 226 Query: 436 ELNKIFRDRIG 446 + + F +G Sbjct: 227 MIAEAFGQCLG 237 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 60/171 (35%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +K +D DT R+G + V ++ S +K I + + T Sbjct: 75 LITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLK--TYKKKMDIERAVKRMMHLATGTMTG 132 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S +E N + +++TDG + + I KA++ GI I Sbjct: 133 LAIQYAMNIAFSESEGARPL---NQHVPRIAMIVTDG---RPQDPVAEIAAKARNSGILI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L S F + L +F++++ Sbjct: 187 FAVGVGRVDMST------LKTIGSEPHTEHVFLVANFSQIETLTTVFQNKL 231 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 17/168 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV S +V+ + ++ S+ +I V G ++ + ++ + + Sbjct: 573 VFVIDGSKSLGENNFEIVKQFVNGILDSL-EISQKAAHV--GLIQYSTHIRTEFTMA--Q 627 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + K + + + A++ ++ S + R + IV TD Sbjct: 628 YSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFSEVQGARPRAMG--VPRVAIVF-TD 684 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 G + +E KAK GI I I ++ L AS Sbjct: 685 G---RAQDEVSEYAKKAKQSGITIYAIGVGKAIDEE------LQEIAS 723 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 51/145 (35%), Gaps = 19/145 (13%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + + + T + + G T+I ++ + II S + Sbjct: 347 GMVTFDSVAEIRNNLT-KITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQST 405 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE+ + I C + K G+ I TIA + K Sbjct: 406 SGSE---------IILLTDGEDNE-----IHSCIEEVKQSGVIIHTIALGPSA---AKEL 448 Query: 417 YFLSNCASPNSFFEANSTHELNKIF 441 LS+ + F+ + L F Sbjct: 449 ETLSDMTGGHRFYANKDINGLTNAF 473 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/132 (27%), Positives = 47/132 (35%), Gaps = 18/132 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P F S S+ K + AL S + K N ND R F+D V SD Sbjct: 78 PPIDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPND--RFALVPFSDGVQSD 135 Query: 310 P--SFSWGVHKLIRT--IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F G + + + + T A G T A+Q A N+ Sbjct: 136 KVVPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQA------------QSFFNDP 183 Query: 366 EAKKYIVLLTDG 377 KKYI+ LTDG Sbjct: 184 ARKKYIIFLTDG 195 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 62/199 (31%), Gaps = 17/199 (8%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S S D +S G +S S N V I + + + Sbjct: 206 ERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN 265 Query: 244 LYPGPLDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + P + V S K L++ A+ VI+S+ D R+ F Sbjct: 266 MQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD------RLSVISF 319 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + ++ G T I + ++ + + K Sbjct: 320 SSTARRLFPLRRMTDTGRQQALQA-VNSLVSNGGTNIAEGLRK------GAKVMLDRKWK 372 Query: 363 NNLEAKKYIVLLTDGENTQ 381 N + + I+LL+DG++T Sbjct: 373 NPVSS---IILLSDGQDTY 388 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 62/199 (31%), Gaps = 17/199 (8%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S S D +S G +S S N V I + + + Sbjct: 206 ERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN 265 Query: 244 LYPGPLDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + P + V S K L++ A+ VI+S+ D R+ F Sbjct: 266 MQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD------RLSVISF 319 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + ++ G T I + ++ + + K Sbjct: 320 SSTARRLFPLRRMTDTGRQQALQA-VNSLVSNGGTNIAEGLRK------GAKVMLDRKWK 372 Query: 363 NNLEAKKYIVLLTDGENTQ 381 N + + I+LL+DG++T Sbjct: 373 NPVSS---IILLSDGQDTY 388 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 59/196 (30%), Gaps = 43/196 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ + R Sbjct: 277 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQR-MSHYGRQQALQAI 329 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I ED ++ I+LL+DG++T + Sbjct: 330 NSLGAGGGTNIADALKKATKVI----EDRSYKNSVCS-----IILLSDGQDTYN------ 374 Query: 389 ICNKAK------------------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 IC+ + + + I F + + AS +F Sbjct: 375 ICSNVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADH-DSDSLHSIAE--ASGGTFSF 431 Query: 431 ANSTHELNKIFRDRIG 446 + F IG Sbjct: 432 IEDEGVMQDAFAQCIG 447 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 59/196 (30%), Gaps = 43/196 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ + R Sbjct: 277 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQR-MSHYGRQQALQAI 329 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I ED ++ I+LL+DG++T + Sbjct: 330 NSLGAGGGTNIADALKKATKVI----EDRSYKNSVCS-----IILLSDGQDTYN------ 374 Query: 389 ICNKAK------------------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 IC+ + + + I F + + AS +F Sbjct: 375 ICSNVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADH-DSDSLHSIAE--ASGGTFSF 431 Query: 431 ANSTHELNKIFRDRIG 446 + F IG Sbjct: 432 IEDEGVMQDAFAQCIG 447 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 50/153 (32%), Gaps = 27/153 (17%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 V++A ++ +++ D R+ F+ + + +K Sbjct: 62 TVKEAAIQLVDQLREGD------RLSVIAFDHKAKVIVP---NQDVTDKAHIKAQIDRLE 112 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEEG-IAIC 390 G T I+D ++ + SS + V + TDGEN + + I Sbjct: 113 AAGGTCIDDGIKLGLQELASSPGKRAAQ-----------VFMLTDGENEHGDNGRCLEIA 161 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 A G+ + ++ F N + L A Sbjct: 162 AVAAEHGVTLNSLGFGENWN-----QDVLEKIA 189 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 47/389 (12%), Positives = 115/389 (29%), Gaps = 50/389 (12%) Query: 45 RWSYYEHALKQAAQT---AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 + ++ +K+ + + + + S K +F K K +E + + ++ Sbjct: 60 NYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDM 119 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + +N E+ ++ + ++ ++ + G K Sbjct: 120 VPRNGGQTEIGTGLQHVRENSFSGAEG---GGNPDKQKIVILMTDGKSNAGAPPQHEAHK 176 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 +AE V+ ++ ++ +M +Y + Sbjct: 177 LKAEGVTVIAIGIGQGFVKTELEQIATMKNYVLTTNSFSELSTLLKL------------- 223 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + D +V C +++++ + L++D + + Sbjct: 224 --VIDLACEVCVVDCAGHADIAFVFDASSS-------INANNPNNY----GLMKDFMKDI 270 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM--GSTAI 339 + K + F DR + +K G TAI Sbjct: 271 VDRFNKTGPDG--TQFAVVTFADRATKQFGLK---DYSSKAEIKGAIDKVTPSIIGQTAI 325 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 D ++ A + + E +K ++LLTDG+N ++ + + +G+ Sbjct: 326 GDGLENARLEVFPN-----RNGGGREEVQKVVILLTDGQNN-GHKSPEHESSLLRKEGVV 379 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I+ I + L N AS + Sbjct: 380 IVAIGV-----GTGFLKSELINIASSEEY 403 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 79/248 (31%), Gaps = 17/248 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G I L + V+LG+ G+++D+ + L+ A + + A+ L + Sbjct: 7 KKERGVILPIVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQEL-DGAADA 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 +RA ++ N K + + + D+ S + V ++ Sbjct: 66 LTRATSAGLTAGNA-------NKVQYQKASASLIDTDVTFSDSLT--GAFSSTFTPVANA 116 Query: 136 RY----DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 RY L L+ ++ +G + T + + + + + D Sbjct: 117 RYAKCGHLTTGILAYLIQMVGGPTSNAVAAIGVATRT-HAQSTCPIPVGLLPRTGGTAPD 175 Query: 192 YQRDSEGQPLNCF--GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 Y + + A + + +G + K + CN + L Sbjct: 176 YGFAVGDWVTVLYDGTKTAGPGEMGWYNLDGSTNANETKNEMSVGYCNSKVNDTLRTPGA 235 Query: 250 DPSLSEEH 257 ++ ++ Sbjct: 236 KVAVDDQW 243 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 72/204 (35%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + + ++ D V ++ Sbjct: 162 PPANLVFLVDVSGSMDEENKLPLVQKTLRILTQQLRPQDKVT------LITYSSGEELVL 215 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G K TI+K GST+ A++ AY+ E + + Sbjct: 216 PPTSGSDK--ETILKAI-DKLKAEGSTSGESALRMAYE--------EAQKAFVPNGINR- 263 Query: 371 IVLLTDGENTQDNEEG---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + ++ + + GI + T+ F + ++ A + Sbjct: 264 ILLATDGDFNVGVSDTETLKSMVAEKRKTGISLSTLGFGTDNYNEDMMEQIAD--AGDGN 321 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + ++ E K+ + ++ + + Sbjct: 322 YSYIDNEKEAKKVLQQQLTSTLAT 345 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 24/164 (14%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAG- 73 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 +L+ V S +++ Q I+ D V V + + + Sbjct: 74 ---GNALKLVVSSSESIVGLTTQTIQAEAKAGA---------DAPVSIQVTVASDKTSVE 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 A QV+ F +G+ S I +A A V R Sbjct: 122 ARAEQVI-----------KLTFGAFVGMASIPISARARASVVGR 154 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 63/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S + ++ L++ +L +++ +++ DN++ G + R+ Sbjct: 224 PASNLVFLIDTSGSMYSDERLPLIQSSLKLLVKELREQDNISIVTYAG----DSRIALPS 279 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + N GST ++ AY +K + Sbjct: 280 -----TSGNHKDEINAAIDSLNARGSTNGGAGLEMAYQQAAKG------FIKGGVNR--- 325 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 326 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGRDNYNEA 371 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 24/164 (14%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAG- 73 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 +L+ V S +++ Q I+ D V V + + + Sbjct: 74 ---GNALKLVVSSSESIVGLTTQTIQAEAKAGA---------DAPVSIQVTVASDKTSVE 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 A QV+ F +G+ S I +A A V R Sbjct: 122 ARAEQVI-----------KLTFGAFVGMASIPISARARASVVGR 154 >gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] gi|160332333|sp|P56652|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus] gi|296474789|gb|DAA16904.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] Length = 891 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 48/314 (15%), Positives = 91/314 (28%), Gaps = 45/314 (14%) Query: 147 FLRSMGIKSWLIQTK------AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 GI + + + K+ VS + +D RS Sbjct: 188 IFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------C 236 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 C N + + ++ Y V + P L FV Sbjct: 237 PTCTDSLLKGDFIITYDVNRESPANVQIVNGYFV-------HFFAPQGLPVVPKSVVFVI 289 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSFSWGVHKL 319 S +K +DAL ++ +K+ D +N + G + D ++ Sbjct: 290 DVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPATP-------E 342 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTD 376 F +D + G T INDA+ + + E + ++ I++LTD Sbjct: 343 NIQEASKFVMDIQDRGMTNINDALLRGISMLNKARE-------EHTVPERSTSIIIMLTD 395 Query: 377 GENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 G+ I ++ + + F N L N +E + Sbjct: 396 GDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRIYEDSD 455 Query: 434 THELNKIFRDRIGN 447 + + F + + N Sbjct: 456 ANLQLQGFYEEVAN 469 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 24/164 (14%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAG- 73 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 +L+ V S +++ Q I+ D V V + + + Sbjct: 74 ---GNALKLVVSSSESIVGLTTQTIQAEAKAGA---------DAPVSIQVTVASDKTSVE 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 A QV+ F +G+ S I +A A V R Sbjct: 122 ARAEQVI-----------KLTFGAFVGMASIPISARARASVVGR 154 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 62/199 (31%), Gaps = 17/199 (8%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S S D +S G +S S N V I + + + Sbjct: 206 ERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN 265 Query: 244 LYPGPLDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + P + V S K L++ A+ VI+S+ D R+ F Sbjct: 266 MQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD------RLSVISF 319 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + ++ G T I + ++ + + K Sbjct: 320 SSTARRLFPLRRMTDTGRQQALQA-VNSLISNGGTNIAEGLRK------GAKVMLDRKWK 372 Query: 363 NNLEAKKYIVLLTDGENTQ 381 N + + I+LL+DG++T Sbjct: 373 NPVSS---IILLSDGQDTY 388 >gi|119613593|gb|EAW93187.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_b [Homo sapiens] Length = 523 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P L P FV S K + A+ ++ ++ D Sbjct: 159 IHYFAPRGLPPMEKNVVFVIDVSSSMFGTKMEQTKTAMNVILSDLQAND----------- 207 Query: 301 FFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +FN SD W I+ + K + G T +N A+ A + SN Sbjct: 208 YFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSN 267 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 ++ I+ LTDGE T I + + + + ++AF + Sbjct: 268 QEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDA 323 >gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus] Length = 889 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 48/314 (15%), Positives = 91/314 (28%), Gaps = 45/314 (14%) Query: 147 FLRSMGIKSWLIQTK------AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 GI + + + K+ VS + +D RS Sbjct: 186 IFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------C 234 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 C N + + ++ Y V + P L FV Sbjct: 235 PTCTDSLLKGDFIITYDVNRESPANVQIVNGYFV-------HFFAPQGLPVVPKSVVFVI 287 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSFSWGVHKL 319 S +K +DAL ++ +K+ D +N + G + D ++ Sbjct: 288 DVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPATP-------E 340 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTD 376 F +D + G T INDA+ + + E + ++ I++LTD Sbjct: 341 NIQEASKFVMDIQDRGMTNINDALLRGISMLNKARE-------EHTVPERSTSIIIMLTD 393 Query: 377 GENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 G+ I ++ + + F N L N +E + Sbjct: 394 GDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRIYEDSD 453 Query: 434 THELNKIFRDRIGN 447 + + F + + N Sbjct: 454 ANLQLQGFYEEVAN 467 >gi|62001368|gb|AAX58386.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 98/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V+ + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVVMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 106/359 (29%), Gaps = 43/359 (11%) Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 F + T V + ++ +Y N + ++ L + + Sbjct: 165 FDEYNNDAPFYTTGVN-QAEATRCSAGITGQYIFRSNTGAARKCKFEYRTQLYEPGCQ-- 221 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + + +S+ + N ++ S + Sbjct: 222 FIPDKKQTASASIMYM-----QSLSSAMQFCNKSNHNIKATNMQNKQCNFRSTWEVIMNS 276 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + S + ++ S + D +D S + ++ A + I Sbjct: 277 SDYASSFPINSPPSAPAISLLQTHD--RVVCLVLDVSGSMTTYDRIARLKQAAELFLLQI 334 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + +G FN +R +K + + + G T I ++ Sbjct: 335 IETGSW-----VGIVTFNSYATRQIGLRQITSDSVRESLKNY-LPTSAGGGTIICSGVRQ 388 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIA 404 + + + IVLLTDGE+ ++ C + +S G I TIA Sbjct: 389 GFQVKQKKYQTSTKGCE--------IVLLTDGEDN-----SVSSCFAEVQSSGSIIHTIA 435 Query: 405 FSVNKTQQ----EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN---EIFERVIRI 456 N ++ L A+ + ++ L F RI + +I ++ I++ Sbjct: 436 LGPNAAKELEMLADMTGGLKFSATDS-----LDSNGLIDAF-SRISSESGDISQQSIQL 488 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 61/461 (13%), Positives = 132/461 (28%), Gaps = 61/461 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ G I + +P + + +L D + + L A + + + + + Sbjct: 25 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAE--NNAK 82 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + NF + T + V Sbjct: 83 NDTRNNELISAYI----------NFYLGHRHQLTQY------NNITVNYQQNPDRLYHTQ 126 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDY 192 S+Y + N L S LI S + K+ + + +V DFS SM Sbjct: 127 LSQYHIDANIEQPTL--FPFTSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGD 184 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + R + KS P Sbjct: 185 FHNPDDPEVLSKLDELKRIFFKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPK 244 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALA-----SVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + S+ + H + +A + +I+ I N+ + + +D + Sbjct: 245 EKNKIYPIPSNEIERNTEAHQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELC 304 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S ++ + NE GST ++ + + + + Sbjct: 305 LYSSNAYPISLAKDIDDLNVIKTMNEGGSTLVSSGIMQG-----------ADLLMDGVNH 353 Query: 368 KKYIVLLTDGE---------------------NTQDNEEGIA--ICNKAKSQGIRIMTIA 404 K +++L+DG N + + + +C K + RI+ I Sbjct: 354 NKLMIILSDGHDYPTTAVVHDKTITSAKEVRVNVDISRQLVQHGMCKKIRETVGRIVFIG 413 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 N + + +C +F+ A +T EL R + Sbjct: 414 IGYNPSANHYINW-AEDCVDTENFYLAMNTKELEDSIRSAL 453 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 60/199 (30%), Gaps = 24/199 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 PG L + S+ + + L ++ D+ G T F Sbjct: 86 KPGGYSAMLLLDQTGSISTTDPYNLRIEASKIFLNNL-----GTDD-----YTGLTSFTS 135 Query: 305 RVISDPSFSWG-VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S G +K + + N G T + S+ + + + Sbjct: 136 SYTSVVKLHSGFTNKTEQMKKSLDTLALNVSGGTPLY----------TSTIQSVTYTAQK 185 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 A K +++ TDGEN KA Q I + T+ S + A+ Sbjct: 186 GPTANKAVIVFTDGENNVTTNTLEDATAKAIQQKIPLFTVGLSTDVNVNVLAQMANE--- 242 Query: 424 SPNSFFEANSTHELNKIFR 442 + +FF A +L F Sbjct: 243 TGGAFFYAKDAGQLISTFG 261 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 45/316 (14%), Positives = 92/316 (29%), Gaps = 32/316 (10%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + S + + ++ + + + D + + + P + Sbjct: 169 VPSHHREASGSHTSEPVDFNDDEPLQLIESGDSRDARCSRVVEIKTYPEFSAIPQSSSED 228 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + K + + S+ Y P+D + S K Sbjct: 229 DFAVLIHLKAPCANPEQVTGRPFNATSIGYPTSRAPVDLVTVLDV-----SGSMAGTKLA 283 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 284 LLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-VNLLG 336 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICN 391 G T I DA++ A I +N I+LL+DG++T + + Sbjct: 337 AGGGTNIADALKKAAKVIED---------RNYKNPVCSIILLSDGQDTYNISSNVRGTRP 387 Query: 392 KAKS---QGI---RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +S I I T+ + L + A S +F + F Sbjct: 388 DYRSLVPSSILNHTICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQ 447 Query: 444 RIGN--EIFERVIRIT 457 IG + + +R+T Sbjct: 448 CIGGLLSVVVQDMRLT 463 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 108 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 159 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 160 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 211 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 212 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 249 >gi|168998772|ref|YP_001688040.1| TerY1 [Klebsiella pneumoniae NTUH-K2044] gi|238549793|dbj|BAH66144.1| tellurite resistance protein [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQAVPLT--------DLLSFNLPS 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGVNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 AK G+ + + L + + + F+ Sbjct: 131 KAAKK-GVVV------ACAAGHDADTAVLKEITEIVLQLDTADSSSIKAFFK 175 >gi|126334038|ref|XP_001370553.1| PREDICTED: similar to Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (Leu M5) (CD11c antigen) [Monodelphis domestica] Length = 1224 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/282 (12%), Positives = 78/282 (27%), Gaps = 27/282 (9%) Query: 175 HGVSIQW---VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + +D S + Q L C + + Sbjct: 133 EPIPMHIPPEAVDMSLGLSLTASTKPSQILACGPTIHQACGVNTYMKG--------FCFV 184 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + + Y ++ F+ S +++ + ++I +K Sbjct: 185 LNSNLQQRQKYPAALQKCPKQDTDIAFLIDGSGSISSSDFQKMKNFVKAMISQFEK--PS 242 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 M N ++ H R + + G T ++ + + Sbjct: 243 TQFSLMQFAS-NFKIHFTFEKFKNSHDPRRLVDEI----TQLSGVTKTASGIKKVINELF 297 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 A K ++++TDGE D E + A+ GI I + Sbjct: 298 QKTRGAR------QYATKILIVITDGEKYDDPLEYSQVIPTAEKAGIIRYAIGVGEAFER 351 Query: 412 QEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFE 451 + + P + F ++ L+ I ++++ +IF Sbjct: 352 PSSRQELEEIASEPSKDHIFWVDNFGALSNI-QNQLKEKIFA 392 >gi|38639545|ref|NP_943314.1| TerY [Klebsiella pneumoniae] gi|38016643|gb|AAR07664.1| TerY [Klebsiella pneumoniae] Length = 212 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQAVPLT--------DLLSFNLPS 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGLNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 AK G+ + + L + + + F+ Sbjct: 131 KAAKK-GVVV------ACAAGHDADTAVLKEITEIVLQLDTADSSSIKAFFK 175 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 65/217 (29%), Gaps = 11/217 (5%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + K S + C+ Sbjct: 157 ALKRAGVLLYTVGVKDAVLAELKEIASSPAEKFTSFVPNFPDLGSHAQKLRQQVCDTLAK 216 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + P+ E D L + + I +D +D VR+G Sbjct: 217 AAQPVDSVFPACREAALADIVFLVDSSTSIGPQNFQKVKNFLYSVILGLDISSDQVRVGL 276 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + + L +++ G T A++ + + E Sbjct: 277 AQYNDNIYPAFQL--NQYPLKSVVLEQIQNLPYRTGDTNTGSALE--FIRMHYLTEAAGS 332 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 R K+++ + ++L+TDGE N+E NK K Sbjct: 333 RAKDSVP--QIVILVTDGE---SNDEVQEAANKLKED 364 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 61/188 (32%), Gaps = 30/188 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S + ++ A + I N+ D +G FN+ V Sbjct: 308 LDVSGSMSGNNRIARLKQAAETFILQ-----NIEDGSWVGIVTFNNAATIQTGLQQVVSD 362 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +R + + + + G T I +Q + S IVLLTDGE Sbjct: 363 TVRKTLNGY-LPISANGGTNICAGVQKGFQVFSSKYASTEGCE---------IVLLTDGE 412 Query: 379 NTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ----EKARYFLSNCASPNSFFEANS 433 + G++ C + + G I TIA + ++ L A+ + Sbjct: 413 -----DSGLSSCFAEVQRSGSVIHTIALGPSAAKELEMLADMTGGLKFSATDS-----VD 462 Query: 434 THELNKIF 441 + L F Sbjct: 463 SSSLEDAF 470 >gi|62001364|gb|AAX58384.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 98/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V+ + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVVMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 52/159 (32%), Gaps = 25/159 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ A++ ++ + D R+ F+ R R +K Sbjct: 57 RALETVKKAVSLLVDQLSSED------RLSIVVFDHRAKILVP---NQIISDRNQIKQQI 107 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-I 387 G TAI++ ++ + + +D + + LLTDGEN + + Sbjct: 108 NRLTADGGTAIDEGLRLGIEELAKGKKDTISQAF----------LLTDGENEHGDNNRCL 157 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A S + + T+ F N + L A Sbjct: 158 KFAQLAASYNLTLNTLGFGDNWN-----QDILEKIADAG 191 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 74/224 (33%), Gaps = 35/224 (15%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 CN+ + + D S + R + V+ +D + + Sbjct: 267 CNRRSTWDVIKASADFQNSPPMRGTEAPPPPTFSLLKSRRRVVCLVLDKSGSMDKEDRLI 326 Query: 296 RM--GATFFNDRVISDPSF----SWGVH-KLIRTIVK----------TFAIDENEMGSTA 338 RM A + +++ S ++ + ++K T + + G T+ Sbjct: 327 RMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQATGGTS 386 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQG 397 I +Q + I SS++ IVLLTDGE+ GI+ C +A G Sbjct: 387 ICHGLQAGFQAITSSDQSTSGSE---------IVLLTDGEDN-----GISSCFEAVSRSG 432 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I TIA + + LS+ F+ L F Sbjct: 433 AIIHTIALGPSA---ARELETLSDMTGGLRFYANKHVSSLIDAF 473 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 51/141 (36%), Gaps = 18/141 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A VI S++K+ N R+G F S + + V Sbjct: 110 DRLSLAKNIVISSMQKMTND----RVGLAVFAGEAFSVMPLTTDYL-AAESFVSGLETSV 164 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T AMQ A + ++ IVL++DGE+ + NE A Sbjct: 165 VSTQGTDFYKAMQVAVSKFKAVSKGSGR-----------IVLISDGEDNEGNEA--AAIK 211 Query: 392 KAKSQGIRIMTIAFSVNKTQQ 412 +A+S GI+++T+ + Sbjct: 212 EAQSNGIQVITVGVGTEEGAP 232 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 63/212 (29%), Gaps = 51/212 (24%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + I+ + + VR+G F + L++ I Sbjct: 107 RLTAAQNAAKAFIQDLPR------HVRVGVVAFAGTAQLAQLPTQSHEDLLKAIDSFQLQ 160 Query: 330 DENEMGSTAINDAMQTAYDTIISSNE-----------------------DEVHRMKNNLE 366 TA + + A T+ D + + Sbjct: 161 R-----GTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPIDEVGRADPAKKPFTPVA 215 Query: 367 AKKY----IVLLTDGENTQDNEEGIAICNKAKSQGIRIMT----------IAFSVNKTQQ 412 Y I++LTDG+ T + + A +G+R+ T I F + Sbjct: 216 PGSYRSAAIIMLTDGQRTTG-VDPLEAAQWAADRGVRVYTVGVGTVQGELIGFEGWSMRV 274 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 L A + +F A + +L K++ Sbjct: 275 RLDEDTLKAVALRTNAEYFHAATAQDLRKVYE 306 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 67/239 (28%), Gaps = 29/239 (12%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC-------NKS 239 + Y R + +S S +G + M Sbjct: 172 GDLPSYTRIQREETCVRITFSPTLQEQSAFSNSGIMADFTVHYDVVMQDIIGDVQVYGGY 231 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + P L P FV S K + A+ ++ ++ D Sbjct: 232 FIHYFAPRGLQPVEKNVVFVIDVSGSMFGTKLQQTKKAMDKILSDLQTSD---------- 281 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISS 353 FN SD W I+ V K + G T IN A+ A + S Sbjct: 282 -SFNIITFSDTVNIWKAEGSIQATVQNIHNAKNYVSRMEANGWTDINAALLAAASVLNHS 340 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 N++ I+ LTDGE T I + + + + ++AF + Sbjct: 341 NQEPGKGRGVGQIP--LIMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDA 397 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 51/137 (37%), Gaps = 18/137 (13%) Query: 275 RDALASVIRS---IKKIDNVNDTVRMGATFFNDRVISDPSF-SW-GVHKLIRTIVKTFAI 329 +D + + +K +M F+ V + SF W GV R + I Sbjct: 76 KDFVLNFTDQISHLKLAKPWKTKTKMAIIQFSSSVRIEQSFNEWTGVENFKRIVNSMTYI 135 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T A+ A + + H+ N+ K +L+TDG + + + Sbjct: 136 GQ----GTYTYYAIMNATNIF------KAHKSAGNV---KVAILMTDGIDHPKSPDARQA 182 Query: 390 CNKAKSQGIRIMTIAFS 406 + A++ GI ++I S Sbjct: 183 SDFARAAGINFISIGLS 199 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 38/134 (28%), Gaps = 22/134 (16%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFN------DRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + ++ + R+ F+ R + K Sbjct: 100 VRKMLSDFPVAP---EATRVALVTFSSKSHVVTRADYVSAPK-AHQHKCSLFSKEIPSIT 155 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A Q A + S E+ K I L+TDG + + +A Sbjct: 156 YRGGGTYTRGAFQRAAQILRQSRENA----------TKVIFLITDGYSNGGDPRPVAAA- 204 Query: 392 KAKSQGIRIMTIAF 405 + +G+ I T+ Sbjct: 205 -LRERGVEIFTLGI 217 >gi|62001346|gb|AAX58375.1| AvrE [Pseudomonas viridiflava] gi|62001350|gb|AAX58377.1| AvrE [Pseudomonas viridiflava] gi|62001352|gb|AAX58378.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 98/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V+ + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVVMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 57/150 (38%), Gaps = 10/150 (6%) Query: 278 LASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + S I + ++ D VR+G F ++ S + + GS Sbjct: 66 VRSFISNFAGTMPLSPDDVRVGLVTFGTSAVTRWDLSDSRAQNADLLAAAAKKLPYAAGS 125 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + + A + + S + K I+++TDG +++ ++ ++ K +++ Sbjct: 126 TYTHLGLAKAEEILFSFQKGGRDNAP------KMILVMTDGASSRRSQ-TLSAAEKLRNR 178 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 G+ I+ + E ++ C + + Sbjct: 179 GVIIVVLGVGTGVNSAECRS--IAGCDTSD 206 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 73/249 (29%), Gaps = 57/249 (22%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSS---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 S +++ P P + S + V+K+ + A + + + D V Sbjct: 576 SDFFVPNPTPGTYKIIIVPLTKESIVIEPKVVMKRMDAAKLAAITFNNMLGEGDFV---- 631 Query: 296 RMGATFFNDRVIS----DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 G F + +R V +T DA+ Sbjct: 632 --GLATFTTYAERISVNQTPLKYMTKDKLR--VNNEIEGLYAKLATDHADALYWGVKVFP 687 Query: 352 ---SSNEDEVHRMKNNLEAKKYIVLLTDGENT-----------------------QDNEE 385 ++ NN ++LLTDGE T ++ Sbjct: 688 IWPDETQNNCTECINNTRP--LMILLTDGETTTCDKNEDYFNNTCKNKCVRDNGHYGAQQ 745 Query: 386 GIAICNKAKSQ------GIRIMTIAFSVNKTQQEKARYFLSNCASP------NSFFEANS 433 + + + K I I TI F + + FL + ASP +F A + Sbjct: 746 ALCVADYIKRNIKVNGFNIPICTIGFGTDIGS--DGQAFLRDIASPRPDNGEACYFFATT 803 Query: 434 THELNKIFR 442 + EL + ++ Sbjct: 804 SEELIEAYK 812 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 55/366 (15%), Positives = 103/366 (28%), Gaps = 86/366 (23%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYS 216 I +A A + I V+D S SM D + L+ + + Sbjct: 142 ITLEAYASGSMSIVEQGIPCDIVLVLDQSGSMDDRFGSQGSYHALSGYSNKRLGDLAENG 201 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE----------HFVDSSSLRH 266 + + G VS SL Y L+ L+ E S Sbjct: 202 NLYVRGGDGSYVAVDVDVSGFISLKYSYTWEGLEAPLTSEGRYTVPQFEGVTFYSFQTDQ 261 Query: 267 VIKKKHLVRDALASVIRSIK--------------------------KIDNVNDTVRMGAT 300 + + ++DA S ++S++ DN V + Sbjct: 262 TVTRIAALKDAANSFVQSVRSNSLGEDGIAGTVDDVPHRIAVIGFASGDNTELFVGSASY 321 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAI--DENEMGSTAINDAMQTAYDTIISSNEDEV 358 + S ++ A + G T ++D + A + + S+N Sbjct: 322 PYGQSAQSQYGSAFQDMTTEMGFDNALASIGQLSADGGTLVDDGLDMA-NGVFSANP--- 377 Query: 359 HRMKNNLEAKKYIVLLTDGE------NTQDNEEGIAICNKAKSQGIRIMTIAFS------ 406 + + V+LTDG + E I+ ++ K+ G + +I Sbjct: 378 --LVTGELRNRVTVVLTDGAPGLYGNDRGVANEAISQASELKTAGSTVFSIGIFPGADAS 435 Query: 407 -----------------------------VNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + AR+ +SP+ + A+ + L Sbjct: 436 GDLPDQSAMGWGDNDSNRFMHLLSSNYPDASSMGSPGARFVDEEGSSPDYYLSASDSAGL 495 Query: 438 NKIFRD 443 N IF+ Sbjct: 496 NSIFQS 501 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 68/538 (12%), Positives = 153/538 (28%), Gaps = 104/538 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ I+ +G + +I LL ++ + + ++ + L + + A+++ S Sbjct: 10 KRFIQDESGVYAVIGGLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSAENNSG 69 Query: 72 LEE-----VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR----DTAVEM 122 + ++ A+N P KI E + ++ + + I T + Sbjct: 70 RKSNDYKLSNTDAENGHFNPNSKIGERDLEISKSFVTTYLPQTDPNKIKLQPVCTTTDKK 129 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH---GVSI 179 N + + S F +G + +I T+ + S++ K + + Sbjct: 130 NRQGHTASTETICTVAGTIEHKSWFPLKVG-STEVIPTEVNIASNSKAIKKNTISIPIDL 188 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV----------KSYSSQNGKVGIRDEKL 229 D S SM + + V S+ N ++ + L Sbjct: 189 MVAADLSGSMRYDLENRYEPKDGTSKIDILKAVLTELSSNSLFSQESNDNNRIAVSPFAL 248 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD------ALASVIR 283 + +L + L + ++ + ++ I K +L + ++ Sbjct: 249 GAEYSTTECTLPFALKNNNRTINYTKSLGIPTTENVQDIIKNYLTKSGSSNSQLSRAIFT 308 Query: 284 S--IKKIDNVNDTVRMGAT----------FFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + +ID N +G+ + + W + TF Sbjct: 309 QSLVTQIDVTNTLSSIGSLDKVGLKFPKNAYCLGDKNRNQHQWFTREEQDKF-STFVNSL 367 Query: 332 NEMGSTAINDAMQTAYDT--------------------IISSNEDEVHRMKNNLEAKKYI 371 +GST + A D ++S DE+ + Y Sbjct: 368 EAIGSTFAGSGLLAAADKMLKETSRTQKLGEETKRVLLVLSDGNDELRADDTGVPFTNYS 427 Query: 372 VLLTD--------------------------------------GENTQDNEEGIAICNKA 393 L D G + + +CN Sbjct: 428 RLTEDLILGYQEEIFTSPSEQKSFHDITYYGRRIYSGKSDIILGNRRTPLSKDLQMCNII 487 Query: 394 KSQGIRIM----TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + ++ T V + K+ +C +++ A L F+ IG+ Sbjct: 488 RDKLNKLNDDKNTSIVFVEFGYKSKSADAWKHCVGDGNYYSAKDKESLLNSFKQAIGH 545 >gi|62001320|gb|AAX58362.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|62001416|gb|AAX58410.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 42/147 (28%), Gaps = 15/147 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + S L + + TA+ D + A + + Sbjct: 130 RIGLVAFGSRAYAVMPPSADRAALTGALARLAVGA--AGKRTAMGDGLGLAVKRLSDAPG 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + V+ DG + A +G+ + ++ ++ Sbjct: 188 -----------LSRLAVVFGDGRSNAGEVSPEDAAKAASERGVTVYSVGVGGDEPAPFLV 236 Query: 416 RY--FLSNCASPNSFFEANSTHELNKI 440 + S + + +A + + K Sbjct: 237 THPLLGSQIVTEKAAVDATTLAAMAKA 263 >gi|62001398|gb|AAX58401.1| AvrE [Pseudomonas viridiflava] gi|62001400|gb|AAX58402.1| AvrE [Pseudomonas viridiflava] gi|62001406|gb|AAX58405.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|62001454|gb|AAX58429.1| AvrE [Pseudomonas viridiflava] gi|62001462|gb|AAX58433.1| AvrE [Pseudomonas viridiflava] gi|62001464|gb|AAX58434.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 53/165 (32%), Gaps = 18/165 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ + + R+G ++ +V S + + T Sbjct: 57 LVGLLRGLKVGANATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRDLVPLAQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q + S E + V++TDG + + + +A+++GI I Sbjct: 115 LAIQYVMNVAFSVAEGAR---PPEERVPRVAVIVTDG---RPQDRVAEVAAQARARGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + Q L ASP F S +L + F Sbjct: 169 YAVGV------QRADVGSLRAMASPPLDEHVFLVESF-DLIQEFG 206 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 54/184 (29%), Gaps = 26/184 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP---SFSWGVHKLIRTIVK 325 K+ ++ A + + +N + +G + + R Sbjct: 345 KRIERLKVAAKNFVS---LAENGTE---LGIVSYASDAAVASGRTEVAIAPLGANRAAWN 398 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T I +Q A D I ++ + A YIVL++DG N + + Sbjct: 399 NAIDGLGPSTRTNIGAGLQKARDLITAA---------GGVTANTYIVLMSDGLNNEPAPQ 449 Query: 386 GIAICNKAKS------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 A + GI + + + + + + ++ + L + Sbjct: 450 ANADADLNGKIAMLLADGIPVYVTCTGSDLGLASQCSEIGTG--TGGHYVDSADSARLPE 507 Query: 440 IFRD 443 F D Sbjct: 508 AFAD 511 >gi|62001362|gb|AAX58383.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus] gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus norvegicus] Length = 933 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 77/216 (35%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S KK R+AL +++ + D N V Sbjct: 256 NGYFVHHFAPEDLPTMAKNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVF 315 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G + + +++ + +A G T IN A+ +A + + SN+ Sbjct: 316 SGEAN-QWEQLLVQATEENLNRAV-----DYASKIPAQGGTNINKAVLSAVELLDKSNQA 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQE 413 E+ K+ I+LLTDGE T I +A + + + F + Sbjct: 370 ELLPSKSVSL----IILLTDGEPTVGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYP- 424 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + +++S +L +++ Sbjct: 425 ----FLEKLALDNGGLARRIYEDSDSALQLQDFYQE 456 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 73/461 (15%), Positives = 139/461 (30%), Gaps = 52/461 (11%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTA-IITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 ++L V L W Y + + + IT + + E S + PK Sbjct: 13 LLLAVVTALPVRENWDIYSYHINSTVTSRYAITVITSRVANRMEESKEIEFHVRIPKNAF 72 Query: 91 EEYLIRNFENNLKKNFTDREVRD---------------IVRDTAVEMNPRKSAYQVVLSS 135 + + + IV + K++ V Sbjct: 73 ITKFKMVIDGQEYDGIVKEKEKAQQQYSQAVSRGQSAGIVSSVGRTLEEFKTSVTVAAHK 132 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAET----VSRSYHKEHGVSIQWV---IDFSRS 188 + L L R +G I + V ++ G++ V + Sbjct: 133 KVTFELTYEELLKRRLGKYELQIHARPMQPVKDFKVDVYINENAGINSFMVKGGLSTKEL 192 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL-------- 240 + P KS S + K G+ + + Y V+ N SL Sbjct: 193 ANAVTTTQAEKEAWVHFYPTVDQQKSCDSCDDK-GMNGDLIVVYDVNRNTSLGDIQRSPG 251 Query: 241 YYMLYPGPLDPSLSEEH--FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 Y++ + P + ++ F+ S +K R AL ++ + + D G Sbjct: 252 YFVHHFAPSNLQRIPKNVVFIIDQSGSMHGRKIEQTRTALIHILNDLAEDD------FFG 305 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + K KTFA + G+T IN A+ + S+ Sbjct: 306 LLTFDSNIFQWKRELVQATKANLESAKTFARNIRANGATNINAAV------LKGSSMLNA 359 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR---IMTIAFSVNKTQQEKA 415 H + + ++LLTDG+ T AI + ++ + + F + + Sbjct: 360 HPREGSAS---ILILLTDGDPTTGETNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLE 416 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L N +E + K F + + + V+ I Sbjct: 417 KMSLQNNGVARRIYEDSDADLQLKGFYEEVATPLLTNVMMI 457 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 74/224 (33%), Gaps = 35/224 (15%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 CN+ + + D S + R + V+ +D + + Sbjct: 267 CNRRSTWDVIKASADFQNSPPMRGTEAPPPPTFSLLKSRRRVVCLVLDKSGSMDKEDRLI 326 Query: 296 RM--GATFFNDRVISDPSF----SWGVH-KLIRTIVK----------TFAIDENEMGSTA 338 RM A + +++ S ++ + ++K T + + G T+ Sbjct: 327 RMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQATGGTS 386 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQG 397 I +Q + I SS++ IVLLTDGE+ GI+ C +A G Sbjct: 387 ICHGLQAGFQAITSSDQSTSGSE---------IVLLTDGEDN-----GISSCFEAVSRSG 432 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I TIA + + LS+ F+ L F Sbjct: 433 AIIHTIALGPSA---ARELETLSDMTGGLRFYANKHVSSLIDAF 473 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 47/157 (29%), Gaps = 24/157 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L G +I L ++G+ G ++ R L+ A ++ L + Sbjct: 22 RLRHESDGAVAVIFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNALKLVV 81 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S + + + + D+ V + P K++ + Sbjct: 82 SSTDSIVGLTTQTIQTEAK------------------APADVPVTVQVTVAPDKTSVEAR 123 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 L +G+ S I +A+A V R Sbjct: 124 AEQVIKLTFGAF------VGMASMPISARAKASVVGR 154 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 63/180 (35%), Gaps = 32/180 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS----------DPSFSWGVHK 318 K + AL V+ ++ DT ++G + +V D + Sbjct: 58 TKIEAAKRALTGVVGALP------DTAQVGLRVYGAKVDGKGKPTPAACADTQLVHPIAT 111 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L + + + +G T I ++ A + K+ IVL++DGE Sbjct: 112 LDKPKLTSTIAAIKALGETPIAHSLTEALKDL-------------GTSGKRNIVLVSDGE 158 Query: 379 NTQDNEEGIAICN-KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + AI A ++I T+ F VN + + + A ++++A L Sbjct: 159 ESCVPDPCPAITKLTAAGVDLQIDTVGFGVNTKARAQLQCI--AAAGKGTYYDAKDASAL 216 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 6/81 (7%) Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIA 388 G T + +AM A D + ++ + +IVL+TDG + Sbjct: 91 ADGGTPMGEAMNMALDMLEKRK--SEYKASGVDYYQPWIVLMTDGMPNGSQAELSRSIQR 148 Query: 389 ICNKAKSQGIRIMTIAFSVNK 409 C+ + + I I + Sbjct: 149 TCDMINDRKLTIFPIGIGEDA 169 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 65/180 (36%), Gaps = 18/180 (10%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 S K + S G I+ AL++ V++G+ + VD+ R + L+ AA A + + Sbjct: 3 RLRSIKRLHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGAGS 62 Query: 68 LIQSLEEV---------SSRAKNSFTFPKQKIEEY-LIRNFENNLKKNFTDREVRDIVRD 117 LI + + + + + + I ++ ++ ++ + + + Sbjct: 63 LIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQEQDVNVAIQPGYWNLITPSFTSN 122 Query: 118 TAVEMNPRKS---AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 T + + A QV ++ + F +GI +Q A A + Sbjct: 123 TGLVTHTGDGNIPAVQVTITLSHLKFF-----FAPILGIPEGTVQATAIAAVSPPTGGTG 177 >gi|62001324|gb|AAX58364.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKDKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|156409361|ref|XP_001642138.1| predicted protein [Nematostella vectensis] gi|156229279|gb|EDO50075.1| predicted protein [Nematostella vectensis] Length = 989 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 70/228 (30%), Gaps = 25/228 (10%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQ------PADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 +D + +D +E + + G P + + Y ++ V P + Sbjct: 270 LDIAGDTVDTDAPAEEETTDEEGSGQDYWGPEEDSAYDYPPRSSCVPCPPGPELPISNTA 329 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + L + V+ S + + + RD +S + + + R Sbjct: 330 VPAKTAASANVDLVILIDGSRSVERSGVGNFRRAIDFARDLTSSFV-----VSPRH--TR 382 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + R F R A+++AY T N+ Sbjct: 383 VGLMVYGKRAYKVFGF-NDYRDNNRLFTGFNKPIRYPRERAQTATALRSAYRTFFGRNKR 441 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 +K +VL+TDG+ + + K +G++I + Sbjct: 442 SA---------QKVLVLVTDGK--IRDAKAKRQSQSIKRRGVKIYVVG 478 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 51/299 (17%), Positives = 96/299 (32%), Gaps = 44/299 (14%) Query: 124 PRKSAYQVVLSSRYDL-LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 P Y ++ + + L NPLS F S + T A V R + Sbjct: 70 PNTETYDLIAENNFQLVAANPLSTF-------SIDVDT-ASYSNVRRFIN---------- 111 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 R +D R E + P + + +S + + Sbjct: 112 -QRQRPPIDAVRIEELINYFSYDYPQPQAEEPFSITTEVSSAPWSPQHQLVHIGLQGKTL 170 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + P + +D S + + L+++ ++ + + D V V GA Sbjct: 171 AIEELPPS---NLVFLLDVSGSMNQPNRLPLLKEGFKLLVDQLTEQDTVAIAVYAGAAG- 226 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V+ P+ K+I I GSTA + ++ AY+ + K Sbjct: 227 ---VVLPPTPGNEKQKIIAAID-----GLQAQGSTAGGEGIKLAYELATRMLSEG----K 274 Query: 363 NNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 NN ++L TDG+ + E + + + +GI + + F + + K Sbjct: 275 NNR-----VILATDGDFNVGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKL 328 >gi|206575582|ref|YP_002235851.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] gi|206570426|gb|ACI12072.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] Length = 212 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQTVPLT--------DLLNFNLPS 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGLNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 AK G+ + + L + + + F+ Sbjct: 131 KAAKK-GVVV------ACAAGHDADTGVLKEITEIVLQLDTADSSSIKAFFK 175 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 75/477 (15%), Positives = 135/477 (28%), Gaps = 49/477 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G ++ AL + +G+ G+ VD + L AA A + V + + Sbjct: 18 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 77 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + ++ K + L N K F + + + + +Y + Sbjct: 78 ANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSGQTLDATVSYTATV 137 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S F R G+ + + A Y + ++D S SM Sbjct: 138 QST----------FGRLFGLSVTTLTNRVNASADIAGY-----LDFYLMVDVSGSMGLPT 182 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK-LSPYMVSCNKSLYYMLYPGPLDPS 252 DS+ L + + G + L P Sbjct: 183 TDSDAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASKPI 242 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID----------NVNDTVRMGATFF 302 + ++ + + + + D S+ D N+ DT Sbjct: 243 VPNQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTN 302 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ND S + + T N S + D + +S + K Sbjct: 303 NDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSSWKDK 362 Query: 363 NNLE--AKKYIVL-LTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARY 417 K+ D + +Q + A C K G I I+ I ++ K+ + Sbjct: 363 KTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTI 422 Query: 418 F----------------LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CASP FF ANS ++ + +V RIT+ Sbjct: 423 VWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQAL--KVARITQ 477 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 69/471 (14%), Positives = 137/471 (29%), Gaps = 83/471 (17%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + F+ +L K G F +ITA+L+ + + ++VD V L++ Sbjct: 1 MFQNKNFLLGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDHV 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I S + L E F + +IE+ L N N+ + T + + Sbjct: 61 IYRTSPKNLYDLREAGRD-----NFIRHQIEKAL--NTYNSRDLSNTGSIESIVKDAVIL 113 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 N Q + + L+ + + + Y + G+ + Sbjct: 114 TKNVNSLPLQFTVDIALSTTVQLRGSLLQMFSQSKGKVDISRRKKVM---YKQNIGL-MI 169 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + L + ++ + PY C Sbjct: 170 MPFAWDGYWLASRGKVADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFCMAPY 229 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +Y ++ A+ ++ S+ + Sbjct: 230 HYS----------------------------SILYWAVGTLTYSVDNKTTTREY------ 255 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + D + +W H I F + N+ G + ++N H Sbjct: 256 -YKDPYYA----TWD-HFPYSFIKNVFDMTSNQFGDG----------QVLTNTN----HC 295 Query: 361 MKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKS------QGIRIMTIAFSV 407 + KY+++L G + + + C+ I I ++ FS Sbjct: 296 FPHGASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGRDAITIFSVGFSP 355 Query: 408 NKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + + RY L CAS P+ ++E NS + I + N I + T Sbjct: 356 D----QDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFSQFT 402 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 76/272 (27%), Gaps = 60/272 (22%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + K V+ +S P D + + S+ + + + Sbjct: 599 YRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHNSRLEVAK 658 Query: 276 DALASVIRSIKKIDN--VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A+ S+ + + +N + +RM F+ + +F+ ++ + Sbjct: 659 TAVNSMAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAV-----NGLGA 713 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------ 387 G T + KKYIV ++DG+ T Sbjct: 714 DGGTNWE----------AALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 763 Query: 388 --------------------------------AICNKAKSQGIRIMTIAFSVNKTQQEKA 415 A+ + + ++ S + K Sbjct: 764 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVS---SDPTKM 820 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R F + S++ A ST ELNK F D IG Sbjct: 821 RGFADQ--TKGSYYSATSTDELNKAFADIIGQ 850 >gi|219127465|ref|XP_002183955.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404678|gb|EEC44624.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 582 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 50/351 (14%), Positives = 96/351 (27%), Gaps = 38/351 (10%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G +L+ + + A + I + + I+ +V + + Sbjct: 5 GFASVLIHLSWGLIFTLATEADFDALIASMTADAIELARQVELLYQKRCNEISLRQCARG 64 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 N +L N T D+ + + +S+ Sbjct: 65 SYNECTSLYPNQTCPGGEDLNVAQCGDGVSCSGLWDYSISN------------------T 106 Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 A+ + + +D + + Q Sbjct: 107 RLHQNLVDSADGNPSDPNVIETICFTQQLD-----EFFVQKRAEQKPYWDSLGLRTPQMY 161 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + SQNG I + S Y + +D+S + L+ Sbjct: 162 FGSQNGAFRIYPARQSE-TCGVYDPRLRPWYIAASSGPKNVVLVLDTSGSMTDGNRLSLL 220 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRV---ISDPSFSWGVHKLIRTIVKTFAIDE 331 + A VI ++ D R+ F+ + D F + + + ++ T Sbjct: 221 KQAAKQVIETLTVGD------RVAIVEFSSQAKLFAQDNKFLFTATQKNKELLATHIDSF 274 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+T DA A+ + S + E H I+ LTDGE TQ Sbjct: 275 TAAGATNFLDAFTAAFAVLNDSIDQEYHVGCTTA-----ILFLTDGEMTQP 320 >gi|62001396|gb|AAX58400.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/364 (10%), Positives = 98/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQNELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 65/396 (16%), Positives = 134/396 (33%), Gaps = 38/396 (9%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIR---NFENNLKKNFTDREVRDIVR 116 A +AS ++ S A+ +++ + N E + T Sbjct: 153 ATESASSDRVEVSSTDSQAAETGTESTSEQVSNEDSKGVSNEETQEETKETGTTTSSSED 212 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +T +E ++ YQ S+ + P +S S ++ ++ +K+ G Sbjct: 213 ETLIEPFKQERLYQKTTVSQAFASIEPDYTTDQSGTYPSANWTIDEKSNVLNHQGNKDAG 272 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + V ++ + + G AD ++ ++ + G+ D V Sbjct: 273 ETWDGVTTWNGDPENL--TNSYIEYGGVGDEADFALRKFAKETNTPGLFD-------VYL 323 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N P+D L VD S + + + V+ + + I+ + D+V Sbjct: 324 NVRGNVQRQIDPIDVVL----VVDWSGSMNEMGRITEVKKGVDRFLNQIEG-SGIQDSVY 378 Query: 297 MGATFFND--RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 MG ++ + + G ++ ++T E G T ++ A D + + N Sbjct: 379 MGYVGYSSDGNNYQNKTCQLGKFSEVKETIRTM-TPETAAGGTFTQRGLRQAGDMLSTQN 437 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 KK IVLLTDG T ++ G AFS+++ Q Sbjct: 438 GH-----------KKVIVLLTDGVPTYSYHVSKV---HTQADG-SYYGTAFSLSQDQPMN 482 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + + + + ++ +N F IG + Sbjct: 483 TSHLYNGYFASDQY---GNSKWINNTFVATIGEAMA 515 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 51/263 (19%), Positives = 82/263 (31%), Gaps = 31/263 (11%) Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 DR S Q+ I + C SL +D S Sbjct: 2 WRLDRCYFSAEQQHTMPNILARFRGFLSLLCVFSLLAGGAQAQAPAPADVRLVIDVSGSM 61 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +L + A+ +++ + + R G F V V R + Sbjct: 62 KRNDPNNLRQPAVDLLVQLLPEGS------RAGVWTFGKWVNMLVP-HRDVTDPWRATAQ 114 Query: 326 TFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG------E 378 A + N +G T I +A++ A E +K I+LLTDG Sbjct: 115 AKASEINSVGLFTNIGEALEKA----------TFEGADGGAEFRKSIILLTDGMVDIDKS 164 Query: 379 NTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 Q+ E I ++ K G+ + TIA S N + L A+ A+S Sbjct: 165 PEQNKREWRRIADEVIPRLKEAGVTVHTIALSANADTNLLNKISL---ATGGMAEVAHSA 221 Query: 435 HELNKIFRDRIGNEIFERVIRIT 457 +L +IF + +T Sbjct: 222 DDLMRIFLKAFDVAAPAEQVPLT 244 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 26/219 (11%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P P L+ +D+S + L + AL +I ++ D R+ + Sbjct: 220 YATPRAGQPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQD------RVSMVAY 273 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + G KL ++ GSTA ++ AY + N D + Sbjct: 274 AGSAGAVLSPTDGKSKLK---MRCALTALRSGGSTAGGQGLELAY-ALARQNLDPKAVNR 329 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFL 419 ++L+TDG+ + + + + G+ + F + Sbjct: 330 --------VILMTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALA 381 Query: 420 SNCASPNSFFEA-NSTHELN-KIFRDR---IGNEIFERV 453 N ++ + +L F I +++ +V Sbjct: 382 QNGNGTAAYVDGLQEARKLLRDDFDSALFPIADDVKIQV 420 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I Sbjct: 345 GMVTFDSAAHVQNELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 86/269 (31%), Gaps = 25/269 (9%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + N + + S + ++ + ++ K + + Sbjct: 241 SIDSVVEFCTEKNHNTLAPNPQNKMCNSRSTWEVIQNSEDYKNSIPMTEAKPPHPIFSLK 300 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + S S+ + L + + +++ I+K G F+ Sbjct: 301 QIRERVLILVLDKSGSMAGEDRLNRLNQASQLFLLQIIEKGS------WTGMVTFDSSAT 354 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + R + + + G T+I ++TA+ I + H + Sbjct: 355 IQSALIQIETDAQRNSLISR-LPTAAGGGTSICSGLRTAFTVI--KKKFSTHGSE----- 406 Query: 368 KKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 IVLLTDGE + I+ C ++ K G I T+A + + ++ + Sbjct: 407 ---IVLLTDGE-----DSTISSCFDEVKQSGAIIHTVALGPSADPGLEKLAEMTGGMKTS 458 Query: 427 SFFEANSTHELNKIFRD-RIGNEIFERVI 454 + A + + L F GN + Sbjct: 459 ATDNAQN-NGLIDAFSALSSGNGAITQRS 486 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 64/194 (32%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + V+ L+ VI S+ N R+G + V + Sbjct: 5 VFVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASSVKQEFPLR--A 59 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H +++ + T A+ A I + D + + K ++++TD Sbjct: 60 HSSKAELLQAVRRIQPLSTGTMTGLAIHFA---ITKALSDAEGGRPRSPDISKVVIVVTD 116 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + A +A++ GI + I + L AS S Sbjct: 117 GRPQDSVRDVSA---RARASGIELFAIGVG------RVDKATLRQIASEPQDEHVDYVES 167 Query: 434 ---THELNKIFRDR 444 +L+K F++ Sbjct: 168 YRVIEKLSKKFQEA 181 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 64/197 (32%), Gaps = 25/197 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ + + D +G F L I + Sbjct: 107 RLTAAKESARRFVDGLP------DEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRL-VE 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ T+ + + + + IVLL+DG NT + + Sbjct: 160 GATGVQGTAIGEAINTSLGAVKALDGEAAK-----DPPPARIVLLSDGANTSG-MDPMEA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELN 438 A + + TIAF ++ L A + F EA S EL Sbjct: 214 AADAVEMEVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDAVARETGGQFHEAVSVEELR 273 Query: 439 KIFRDRIGNEIFERVIR 455 ++ D IG+ + R R Sbjct: 274 AVY-DDIGSSVGYRTKR 289 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 61/461 (13%), Positives = 132/461 (28%), Gaps = 61/461 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ G I + +P + + +L D + + L A + + + + + Sbjct: 38 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAE--NNAK 95 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + NF + T + V Sbjct: 96 NDTRNNELISAYI----------NFYLGHRHQLTQY------NNITVNYQQNPDRLYHTQ 139 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDY 192 S+Y + N L S LI S + K+ + + +V DFS SM Sbjct: 140 LSQYHIDANIEQPTL--FPFTSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGD 197 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + R + KS P Sbjct: 198 FHNPDDPEVLSKLDELKRIFFKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPK 257 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALA-----SVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + S+ + H + +A + +I+ I N+ + + +D + Sbjct: 258 EKNKIYPIPSNEIERNTEAHQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELC 317 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S ++ + NE GST ++ + + + + Sbjct: 318 LYSSNAYPISLAKDIDDLNVIKTMNEGGSTLVSSGIMQG-----------ADLLMDGVNH 366 Query: 368 KKYIVLLTDGE---------------------NTQDNEEGIA--ICNKAKSQGIRIMTIA 404 K +++L+DG N + + + +C K + RI+ I Sbjct: 367 NKLMIILSDGHDYPTTAVVHDKTITSAKEVRVNVDISRQLVQHGMCKKIRETVGRIVFIG 426 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 N + + +C +F+ A +T EL R + Sbjct: 427 IGYNPSANHYINW-AEDCVDTENFYLAMNTKELEDSIRSAL 466 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/297 (11%), Positives = 88/297 (29%), Gaps = 54/297 (18%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + ++ + + NG + + + + + + L Sbjct: 138 ANRAQSVAQNETGSQLGAPEKHKRIKKNDNGSYTVNVDVKGAVNSTTVTTTQPIDFTLVL 197 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--DTVRMGATFF----- 302 D S S + D S ++ +++A+ + + + + V +G F Sbjct: 198 DVSGSMD---DPMSKTDRTRRLDALKEAVKAFLDEAANTNTEAGSELVHVGLVKFAGDKT 254 Query: 303 ---NDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISS 353 D + +++ +++ + K G+T ++ A + S+ Sbjct: 255 DKIGDDMYRSGGYTYNYSQIVSNLTADMNGLKNKVSKLKAAGATRADNGFNRAVKVMGSA 314 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFS 406 + +AKK ++ DG T + + + + K G + +I Sbjct: 315 S--------ARTDAKKVVIFFADGSPTSSSGFEGKVANKAVEAAKELKDGGAAVYSIGIF 366 Query: 407 V---NKTQQEKARYFLSNCASP-----------------NSFFEANSTHELNKIFRD 443 + F+ +S + A + ELN IF + Sbjct: 367 ASANPSSLSSNENQFMHAVSSNFPKATKYNQLGEGNIEAGYYKSATNASELNTIFDE 423 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 54/169 (31%), Gaps = 26/169 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S + + AL I DNV T+ F+ S Sbjct: 390 IDTSGSMRQENRLREAKTALGDFIDIFADQDNVQVTI------FSTNATELSDLS--PIG 441 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 R + T G T + + Y I E + R +V+LTDGE Sbjct: 442 PKRADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQRIRA---------LVVLTDGE 492 Query: 379 NTQDNEEGIAICNKAKSQ----GIRIMTIAFSVNKTQQEKARYFLSNCA 423 +T + + + + I+I TIA+ + + L A Sbjct: 493 DTASSLSLEQLNEQIRQDESGTSIKIFTIAYGSDAN-----QEVLQRIA 536 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 60/188 (31%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++K Sbjct: 43 KIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLI-D 95 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I+D +QT + ++ + ++L++DG+ + Sbjct: 96 ALQPGGNTNISDGLQTGLKVLAD------RKLSSGRVVG--VMLMSDGQQNRG-----EP 142 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + + T F + N + +F N + L+ F + + Sbjct: 143 AANVKIGNVPVYTFGFGAD-YDPTVLNAVARN-SMGGTFSVVNDVNLLSMAFSQCLAGLL 200 Query: 450 FERVIRIT 457 V +T Sbjct: 201 TVVVQDLT 208 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 48/143 (33%), Gaps = 20/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++ +I + R+G + + + + ++ Sbjct: 111 RMEKAKRLVSEIINELASD-------RIGIIAYAAQAYPQLPITTDF-GAAKMFLQGMNT 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D TAI+DA++ A + + + + +++DGE+ + Sbjct: 163 DMLSSQGTAISDAIELATTYYNDAAQTN-----------RVLFIVSDGED-HSEGGAVNA 210 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 +KA GI+I TI K Sbjct: 211 VSKATEAGIKIFTIGVGTEKGAP 233 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 66/179 (36%), Gaps = 16/179 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++DA+ +IR ++ D R+ FNDR V + + R I + Sbjct: 75 SRLDVLKDAMKFIIRKLEDGD------RLSIVAFNDRPVKEYSTGLLDISGNGRRIAEKK 128 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TA+ A++ A + D +R+ +I+LLTDG++T Sbjct: 129 VDWLEGRGGTALMPALEEAIRVLDCRPGDSRNRVG-------FILLLTDGDDTSGFRWSR 181 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + N A + + T + + Y SF + + ++ +G Sbjct: 182 DVINGAVGK-YPVHTFGLGAAHSSEA-LLYIAQESRGTYSFVDDENMDKIAGALAVCLG 238 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 75/256 (29%), Gaps = 28/256 (10%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + N + + S + + + ++ + Sbjct: 243 LNSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQQAPPMRGTEAPPPPTFSLL 302 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + + A + I + +++ +G F+ Sbjct: 303 KSRQRVICLVLDKSGSMDTEDRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQI 357 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + T + + +G T+I ++ + I SS++ Sbjct: 358 QNYLI-KITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE------- 409 Query: 369 KYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP 425 IVLLTDGE ++ I+ C K G I TIA + AR LS+ Sbjct: 410 --IVLLTDGE-----DDLISSCFEVVKHSGAVIHTIAL-----GPKAARELETLSDMTGG 457 Query: 426 NSFFEANSTHELNKIF 441 F+ + L F Sbjct: 458 LRFYANKDVNSLMDAF 473 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 75/477 (15%), Positives = 136/477 (28%), Gaps = 49/477 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G ++ AL + +G+ G+ VD + L AA A + V + + Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + ++ K + L N K F + + + + +Y + Sbjct: 74 ANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSGQTLDATVSYTATV 133 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S F R G+ + + + A Y + ++D S SM Sbjct: 134 QST----------FGRLFGLSATTLTNRVNASADIAGY-----LDFYLMVDVSGSMGLPT 178 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK-LSPYMVSCNKSLYYMLYPGPLDPS 252 DS+ L + + G + L P Sbjct: 179 TDSDAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASKPI 238 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID----------NVNDTVRMGATFF 302 + ++ + + + + D S+ D N+ DT Sbjct: 239 VPNQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTN 298 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ND S + + T N S + D + +S + K Sbjct: 299 NDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSSWKDK 358 Query: 363 NNLE--AKKYIVL-LTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARY 417 K+ D + +Q + A C K G I I+ I ++ K+ + Sbjct: 359 KTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTI 418 Query: 418 F----------------LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CASP FF ANS ++ + +V RIT+ Sbjct: 419 VWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQAL--KVARITQ 473 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 72/196 (36%), Gaps = 11/196 (5%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +D + ++I I + + + DR+IS F G K + I++ Sbjct: 268 ERAKDVIKTLIEKISYYEVSPNYEILLFATDTDRIISMREFKNGQGKDLLKIIQKLQDYA 327 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEEGIAIC 390 + + AY+ I S E+ K + +A ++IV++ TDG+ + Sbjct: 328 YDKKGQRSGTNIAQAYNKIYESMTIELMTNKEDFKATQHIVIMFTDGQANMGG-SPKPLV 386 Query: 391 NKAKS--------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 +K KS + + + N E ++ A+ FF+ S +L + F Sbjct: 387 DKIKSLVRQNSVEEKLELYVFGLG-NDVHAEDINDLKTDRANEKFFFKLKSLDDLKETFD 445 Query: 443 DRIGNEIFERVIRITK 458 + I + + K Sbjct: 446 NMIDEGNSVELCGLYK 461 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 71/196 (36%), Gaps = 18/196 (9%) Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 ++ K Y ++ D L + F+ SS + L ++ + S+ +I Sbjct: 18 NQIADGQTKKKGKKSYSVVQSDEDDLCLIDIVFIVDSSESAKNQLFDLQKNFVLSLTDNI 77 Query: 286 KKIDNVNDT---VRMGATFFNDRVISDPSFS-W-GVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ V V++ F+ V D F+ W V I I + T Sbjct: 78 FQMKPVKSQNYNVKLAGMQFSSTVSIDHPFTAWKNVQNFKEKIRALVYIGQ----GTYSY 133 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ A ++ + + + K L+TDG + ++ I A+S GI Sbjct: 134 YAISNA---------TQLFKTEGRERSIKVAFLMTDGVDHPNSPNVQGIATAARSLGIHF 184 Query: 401 MTIAFSVNKTQQEKAR 416 TI S ++EK R Sbjct: 185 FTIGLSKKNVKEEKLR 200 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 57/179 (31%), Gaps = 22/179 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I ++ + T R+G F+ +V S + ++ T A++ Sbjct: 802 IDEVSANHATTRIGIINFSHKVELVSSLE--TYTTKESLKSAVDKMLYLGEGTYTASAIK 859 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTI 403 A ++ +K +++TDG + +D + +A + I I I Sbjct: 860 KAISLFQAARPA----------VRKVALVVTDGQADNRDKVHLDLVVKEAHAANIEIFVI 909 Query: 404 AFSVNKTQQEKARYFLSN---CASP---NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 FL A+ F++ L D++ +I E V I Sbjct: 910 GIVQ--KTDPHYHNFLKEMHLIATDPDEEHFYQIEDFKTL-SALADKLITKICENVSEI 965 >gi|300776752|ref|ZP_07086610.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300502262|gb|EFK33402.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 335 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 54/147 (36%), Gaps = 18/147 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + +A ++ ++KK++N +G F +S + + I Sbjct: 112 LTEAKNLMMATMKKMNNDK----IGIVIFAGNAMSIMPLTTDYNSAETYISGIETSSMQI 167 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T MQ A + + KN + + +VLL+DGE+ + N+ A A Sbjct: 168 QG-TDFLKGMQAAVE-----------KFKNVSKGSRKVVLLSDGEDNEGNDN--AAIRLA 213 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLS 420 +G+ I ++ ++ F Sbjct: 214 NKEGVSITSVGIGTDEGAPVPEYVFGQ 240 >gi|198426249|ref|XP_002120426.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1937 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 77/450 (17%), Positives = 144/450 (32%), Gaps = 55/450 (12%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEH---ALKQAAQTAIITASVPLIQSLEEVSSRAK 80 ++ ++ + + + + W + + A + V + + V R++ Sbjct: 1476 LMHSVFDIIFIMDSSTSIGLQNWVLMKSFVTDMISAFDVSADGTRVAVFRYNRGVDRRSQ 1535 Query: 81 NSFT--FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS--- 135 F + + N N + R ++ N + V+++ Sbjct: 1536 ILFNRFINDKSGLITAVENIPYNGSGTWIGRALQYAQDHVLRYRNGNRLVRDVIITITDG 1595 Query: 136 -RYDLLLNPLSLFLRSMGIKSWLIQ-TKAEAETVSRSYHKEHGVSIQWVI---------- 183 YD + P+S LRS G ++ + T S S E S+ V+ Sbjct: 1596 RSYD-SVMPVSDQLRSQGALTYALGITPGNGLGPSYSLLLEIAGSMDRVVFSTPALLGIS 1654 Query: 184 ---------DFSRSMLDYQRDSEGQPLN----CFGQPADRTVKSYSSQNGKV--GIRDEK 228 D Q P N C + +V S++ + G V G + Sbjct: 1655 DIFKQKLTNDLCGHPCASQCPRLLPPTNGLISCTNSNVEDSVCSFTCRPGFVLSGEPQTR 1714 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 S N ++ P L + + V SS +K ++ + S+I S Sbjct: 1715 CVDGPNSSNDGVWTSPAPMCLAQCRNYGVIDLVVVLDSSRSIGLKSWTDMKAFVRSLIGS 1774 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 N +R+ A +N RV +D + AI N G T A+Q Sbjct: 1775 FVIGPNA---MRVSAFRYNRRVDTDTQILLNSYSDNSFFAAYDAIPYNGAG-TRTALALQ 1830 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D I++ R A + +VL +TDG + + I + ++ G+ + Sbjct: 1831 HVRDVILT-------RANGEWPAARNVVLTVTDG---RSVSSIVPISEQLRANGVLTYAL 1880 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANS 433 L+ SP+ F AN+ Sbjct: 1881 GVPPIIGNGMNTVNLLALAGSPSRRFLANN 1910 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 70/199 (35%), Gaps = 33/199 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A D++ + + +G F + S + + + K Sbjct: 108 SRIRAAQSAAKKF------ADDLTEGINLGLISFAGTPSTLVSPTPDHTATKKAVDKLVL 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN----- 383 D+ TA + + A D I + N +IVLL+DG+ T + Sbjct: 162 ADK-----TATGEGIFAALDQIRTLN---AVLGGPEAAPPAHIVLLSDGKQTVPDEPTDP 213 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 KAK +GI + TI+F E L A S +FF A+ Sbjct: 214 RGAFTAARKAKEEGIPVSTISFGTAYGTVELDGDRVPVPVDDPSLKQIANLSGGNFFTAS 273 Query: 433 STHELNKIF---RDRIGNE 448 S ELN+++ + IG E Sbjct: 274 SLDELNEVYEKLQSEIGYE 292 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 45/314 (14%), Positives = 96/314 (30%), Gaps = 51/314 (16%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S + R + ++ K + K G + L + + Sbjct: 57 ALDD--STKNVGRIWTDKSVSAGDVTLTSREKESGTATIKKGADSDFLVGLSALSSTAKI 114 Query: 242 YMLYPGPLDPSLSEE---HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV--- 295 PLD L + D K+ ++ A+ S I K+++ V Sbjct: 115 TGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKAAVNSFIDGSAKVNDQRADVNKQ 174 Query: 296 -RMGATFF--------NDRVISDPSFSWGVHKLI----------RTIVKTFAIDENEMGS 336 R+ F + S + + +++ ++ +T G Sbjct: 175 NRIAVVKFAGNKTDKIGNDQYSQNRYWYNYTQVVSGYKAYTSGNKSEWETTVNALKPAGC 234 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAI 389 TA + AM ++ ++ + + + K+ ++ TDGE + + I Sbjct: 235 TAADYAMDLTKT-LVDQSKTDANNNADRKNVKRVVIFFTDGEPNHQSGFDDDVANDAITS 293 Query: 390 CNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCASPNSFFEANSTHE 436 K+ I TI + +EK F+ +S ++ +A + Sbjct: 294 AKTIKTD-ADIYTIGIFSGADVSITGHSGSGSWSAKEKFNAFMHGLSS--NYPDAERYKK 350 Query: 437 LNKIFRDRIGNEIF 450 L +D G + Sbjct: 351 LGTRAKDSKGQDAT 364 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 39/117 (33%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + D TAI +A++ A + Sbjct: 130 RIGIIAYAGSAYPQLPITTDYSSA-KLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ E I +A +GIRI TI ++ Sbjct: 189 TN-----------RVLFIISDGEDHVG--ESSNIAEQANDEGIRIFTIGVGKSEGGP 232 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 66/176 (37%), Gaps = 27/176 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + L++DA+ +++ I + V +G N V+ G + + + + Sbjct: 1168 TEFQLMKDAINNLLAKFSGISQLQ--VEIGTFADNSNVV-------GTYSSVTAAQQAVS 1218 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 G T A+ T +T+++ + H KY+ LTDGE T + Sbjct: 1219 NLTRSGGGTNYQAALTT-LNTMMTVDPVADH---------KYVYFLTDGEPTVGSWTNST 1268 Query: 385 ---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 G+A N + G+ I + V + + +P+++ ++ +L Sbjct: 1269 QIANGMAALNALTAPGVVINAVGIGVP-SGASFGNNLNAIDNTPDNYLAVDNFDDL 1323 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 74/196 (37%), Gaps = 29/196 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +++ S I S+ + G F+ S + + +I Sbjct: 107 RLTSAKESAQSFIESL---PPRFNV---GLVAFSSVATVVASPTQDHQAVADSIAN---- 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TAI + + + I S +E + IVLL+DGENT A Sbjct: 157 -LTISSGTAIGEGVFASLQAIRSFDEKA-----TDDPPPAAIVLLSDGENTSG-RPVAAA 209 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEANSTHELN 438 ++A++ G+ + TIAF + E + Y L A + F+EA ST EL Sbjct: 210 ADEARAAGVPVSTIAFGTGVSIIEIEGHYVPANIDKETLKELAMTTGGRFYEAESTGELK 269 Query: 439 KIFRDRIGNEIFERVI 454 ++ D IG+ + + Sbjct: 270 DVYAD-IGSSLGTETV 284 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 54/304 (17%), Positives = 98/304 (32%), Gaps = 72/304 (23%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N + L+ + + T ++E S EH S V+D + ++ Sbjct: 500 NLMFLWYDEKKKEFVELDTFYDSENSIVSIETEH-FSKYMVVDKTEWFNNW--------- 549 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + SY++ + V C+ S+ S +++ F D Sbjct: 550 -------RKIYNSYAAIFSAIPSYTAIC----VDCSGSM-----------STNDKSFKDD 587 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + + A+ + + S+ DN + F + + L Sbjct: 588 NGVLTCYRNI-----AVQNYVESMFVFDNAS------IITFESSASEECEMTNNKRTLSG 636 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T N A+ A D + N++ KK I+LL+DG+ Sbjct: 637 K------ASFYNRGGTNANSAIDIAIDEL------------NHVYGKKNIILLSDGDVNV 678 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNK 439 ++ I C K+ GIRI T+A Q L A+ + A + L K Sbjct: 679 SDDN-IKYC---KNNGIRIHTVALGSGANSQ-----LLKQYANDTGGTPLTATTAEGLTK 729 Query: 440 IFRD 443 I+ Sbjct: 730 IYES 733 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 86/223 (38%), Gaps = 40/223 (17%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLKDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIRGLTRMPSG--RDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY+V++TDG E ++ G+ N+AK GI++ ++A + + + LS Sbjct: 149 ENKYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIRI 456 A+ +++ F A E+ + I + I V ++ Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTIETITDMIKNNVEQV 246 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 67/494 (13%), Positives = 144/494 (29%), Gaps = 87/494 (17%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G I TAL + + G+ ++ R+ ++ L A + A + + Sbjct: 15 RHQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTANQ------ 68 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 I+ Y+ RN + N I + N + +Q +++ Sbjct: 69 -DDKTYENQLATGYIQAYI-RNITS--INNIKIERSEGIDNYPTPDGNEEREYFQYRVTA 124 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ-- 193 + + ++ LS + + + +A A Y + + I +V DFS SM + Sbjct: 125 KTN-HISWLSSDIIPSFAPTETVANRALARNY-PIYLGDKDIDIVFVSDFSGSMKGNKIR 182 Query: 194 ----------------RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 RD E + N N + I P N Sbjct: 183 ALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCITQLDYRPNFNGGN 242 Query: 238 KSLY-------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL------------ 278 S Y + + S ++ + R + + + +A Sbjct: 243 YSSYEDIDWSTWSTWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNASKSETGSGWYFPD 302 Query: 279 --------ASVIRSIKKIDNVNDTVRMGATFFNDRVIS----DPSFSWGVHKLIRTIVKT 326 SV ++ N ++ + S + +T + Sbjct: 303 PYSYINFPGSVAKTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLT-----SEKTALSP 357 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGE-----N 379 + + G T++ + + + + I++L+DG+ + Sbjct: 358 IQNNMSPDGGTSVYQGLIRGAQILEQGRPTSPSTETSAAYNSRIKMILMLSDGQEMPYVS 417 Query: 380 TQDNEEGIAICNKAKSQG--------IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 T + +CN K+Q + ++ I F + + NC N+ Sbjct: 418 TFNQLVNQGLCNTIKAQFNDSDQPLYMGVLGIEFDA------QGQQGFKNCVGQNNITNV 471 Query: 432 NSTHELNKIFRDRI 445 + +L K + I Sbjct: 472 DDVDDLIKEILEMI 485 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 59/179 (32%), Gaps = 36/179 (20%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED 356 G F + + + + + A+ +G TA+ + T+ Sbjct: 133 GLVLFGKEALLEVPP---TIDVEYFLERLEAVRLFSLGDGTALGMGVGTS---------- 179 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA------FSVNKT 410 +H + N + +V+LTDG+NT A+ GI + T+ S++ Sbjct: 180 LLHLSRVNASF-RAVVILTDGKNTTGEILPETAAEMARELGIPLFTVGVGSDRPVSLDVI 238 Query: 411 QQEKARYF------------LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + L A S FF + L++IF+ IG V R Sbjct: 239 DPSTGTRYAGVLEEGYDEETLRRIAEISGGQFFSGYTPTSLHRIFQ-YIGATATADVRR 296 >gi|296119163|ref|ZP_06837733.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] gi|295967789|gb|EFG81044.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] Length = 585 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 45/121 (37%), Gaps = 17/121 (14%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 ++ D G T + A++ A + + + E + IVL++DG +T Sbjct: 35 IRGEVDDIEASGYTPMGSALRQAAEELPAEGE-------------RSIVLVSDGIDTCAP 81 Query: 384 EEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + QGI I T+ F V+ + + A + +A+ L + Sbjct: 82 PPVCEVAEELHDQGIDLIINTVGFLVDDEARSELECIAD--AGGGQYLDADDAESLAESM 139 Query: 442 R 442 + Sbjct: 140 K 140 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 72/235 (30%), Gaps = 26/235 (11%) Query: 182 VIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V D + D + + C G D N I D ++ N Sbjct: 188 VTDKKAHVSFSPTLDQQRKCTECDGTLIDGDFFITYDVNRPHDIGDIQIV------NGYF 241 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P L FV +S K ++AL +++ + +D + Sbjct: 242 VHFFAPANLPRVPKMVVFVIDNSYSMYGNKMAQTKEALGTILGEL----PEDDYFAI--I 295 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ + + + + + +G T ++DA + + ++ Sbjct: 296 VFSTTFVVWRPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQ------ 349 Query: 361 MKNNLEAKKYI---VLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 +N K + +LLTDG+ Q I + I + +AF + Sbjct: 350 -RNGTAPKNMVLMMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDA 403 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V M F + + + ++T Sbjct: 110 SRLEKSKKLISRLVETF-----NNDKVAM--IVFAGEAFTQLPITSDYISA-KMFLETIT 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A+ A + N + IVL+TDGEN + + Sbjct: 162 PSLISTQGTDIRGAIDLAMKSFT-----------PNEGVGRAIVLITDGENHEG--GAVE 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +A +G+R+ + Sbjct: 209 AAQEAAKKGVRVFVLGVGSPDGSP 232 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 73/201 (36%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + + V +G F S + + + K Sbjct: 111 RLTAAKEAAKKFVTELP------NGVNLGIVSFAGTASLLVSPTPDRTLALNAVDKLELA 164 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + T+ +I N +V ++N + I+L +DG+ T D Sbjct: 165 QR-----TATGEGIYTSIQSI--KNIRDVLGGEDNAPPAR-IILESDGKQTVPTDLDDPR 216 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANS 433 G KAK +GI I TI+F L A S FF A+S Sbjct: 217 GGFTAARKAKEEGIPISTISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQFFAASS 276 Query: 434 THELNKIF---RDRIGNEIFE 451 ++LN+ + RD IG E+ + Sbjct: 277 LNDLNEAYGSLRDEIGWEMQK 297 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 65/190 (34%), Gaps = 28/190 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A + ++ F+ + + + + Sbjct: 104 KLQRSRMAAREFF---HISNPEDEFF---LVEFDSSPRLVVPLTSDTGTIEDHLTFSR-- 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 GSTA+ DA+ A +H MK++ + KK +++++DG + + Sbjct: 156 ---SHGSTALLDAIFLA-----------LHEMKHSKKNKKALLIISDGGDNHSRYSEKEV 201 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQE--KARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + K + I +I E LS + + FEA S EL I + +I Sbjct: 202 SSVVKESDVLIYSIGVFGGGGSPEEAGGPGLLSKVSEQTGGRLFEA-SAVELPDIAK-KI 259 Query: 446 GNEIFERVIR 455 G E+ R I Sbjct: 260 GIELRNRYIL 269 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 59/194 (30%), Gaps = 28/194 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + + + +G F R V++ Sbjct: 107 TRLEAAKQGAQAFVDQL---PPRIN---LGLVSFAGSAAVLVP-----ASTDRESVRSGI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TA+ + + + I ++ E + IVLL+DGE T+ Sbjct: 156 RGLQLGPATAVGEGIFASLQAITTAGERMSD--EGQPPPPAAIVLLSDGETTRGRPNT-Q 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHEL 437 A+ + + TIA+ + + L A + S+ A + EL Sbjct: 213 AATAARDAEVPVDTIAYGTSDGTLDVGGQQIPVPVNEDALRELAEQTGGSYHRATTGDEL 272 Query: 438 NKIFR---DRIGNE 448 ++R IG Sbjct: 273 QSVYRGLGSSIGYR 286 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 80/249 (32%), Gaps = 32/249 (12%) Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-SLYYMLYPGPLDPSLSEE 256 P N RT+ +Y G + + + SL L+P LDP EE Sbjct: 349 DTPANAVSLALQRTLTTYVRDLGGGLLFIGGPQSFGAGGWRRSLLESLFPVSLDPPSREE 408 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA--------TFFNDRVIS 308 F +L VI + + + + + + +G F+ Sbjct: 409 RFD--LALTLVIDRSGSMSELVDGLRTQLDLAREAAFQASLGLSRQDQLSIIAFDSVADV 466 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 ++ + G T I M A +TI +++ H Sbjct: 467 ILPLQ---PLPDLATIEDALSRLSAGGGTNIRSGMALAAETIATADARIRH--------- 514 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 ++LLTDG + + + +A ++QG+ + T+A +N + + + + Sbjct: 515 --VILLTDGVSETEYADLVA---NLRAQGVTVSTVAIGLNTDPELERVAQIGG----GKY 565 Query: 429 FEANSTHEL 437 + L Sbjct: 566 YVVRQAEAL 574 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 69/195 (35%), Gaps = 35/195 (17%) Query: 268 IKKKHLVRDALASVIRSIKKID--NVNDTVRMGATFF--------NDRVISDPSFSWGVH 317 IKK ++ A+ + + +I+ + ++ +R+G F + ++ F + Sbjct: 614 IKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYS 673 Query: 318 KLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +++ + K G+T + + A + + D K+ + Sbjct: 674 QIVSPLTADMSDLKNKVSALRHNGATRADLGFKHASTVMSGARTDA----------KRVV 723 Query: 372 VLLTDGENTQDNE-------EGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNC 422 + TDG T+ ++ + K G + +I N + E+ + Sbjct: 724 IFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEENQKNQFMN 783 Query: 423 ASPNSFFEANSTHEL 437 A +++ A + +L Sbjct: 784 AVSSNYPHATAYDKL 798 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 58/201 (28%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G F + R VK+ Sbjct: 122 RLAAAKEAGKQFADQLTPAIN------LGLVEFAANATLLVPPT-----TNRGAVKSGID 170 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + TA I + V + IVL +DG + Sbjct: 171 SLQPAPKTATGEGIFTALQAIATVG--SVMGGGEGPPPAR-IVLESDGAENVPLDPNAPQ 227 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ----------QEKARYFLSNC-ASPNSFFEANS 433 AK QG++I TI+F + C + F A+S Sbjct: 228 GAFTAARAAKGQGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGEAFHADS 287 Query: 434 THELNKIF---RDRIGNEIFE 451 L ++ + +IG E + Sbjct: 288 LDSLKNVYTTLQRQIGYETVK 308 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 75/256 (29%), Gaps = 28/256 (10%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + N + + S + + + ++ + Sbjct: 243 LNSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQQAPPMRGTEAPPPPTFSLL 302 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + + A + I + +++ +G F+ Sbjct: 303 KSRQRVICLVLDKSGSMDTEDRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQI 357 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + T + + +G T+I ++ + I SS++ Sbjct: 358 QNYLI-KITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE------- 409 Query: 369 KYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP 425 IVLLTDGE ++ I+ C K G I TIA + AR LS+ Sbjct: 410 --IVLLTDGE-----DDLISSCFEVVKHSGAVIHTIAL-----GPKAARELETLSDMTGG 457 Query: 426 NSFFEANSTHELNKIF 441 F+ + L F Sbjct: 458 LRFYANKDVNSLMDAF 473 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 41/111 (36%), Gaps = 14/111 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + T I A+Q A ++ Sbjct: 131 KIGLIVFAGEAFVQLPITSDYVSAKMFLDNINP-SLIGTQGTDIGKALQLAANSFT---- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N +A K I+L+TDGE+ + E A+ +A+S+GI++ + Sbjct: 186 -------PNSKAGKAIILITDGEDNEGGAE--AMAKQARSKGIKVFILGIG 227 >gi|195999034|ref|XP_002109385.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] gi|190587509|gb|EDV27551.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] Length = 356 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 47/299 (15%), Positives = 95/299 (31%), Gaps = 40/299 (13%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL-FLRSMGIKSWLIQTKAE 163 N + ++ V+ + S+ Q S+ Y L NP++L F + + K L Sbjct: 86 NASLADIIKFVQSAKSNIESGYSSKQAQSSTSYQLCCNPINLRFNKELNDKINLSSPCIT 145 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + + + + +F+ ++ + G + P T + Y N Sbjct: 146 FPVETTQSYIPNSLKLAYRQNFADNLSVKWQYFAGADNIFYQYP---TTQRYCKTN---- 198 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 Y +Y+ P L S K A+ Sbjct: 199 --------YTTETKFKQWYVNAASPSSKRLVLVLDRSGSMSGDRFLKVKEAATAV----- 245 Query: 284 SIKKIDNVNDTVRMGATFFNDRVI-----SDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 + + ++ +G F+D + + S + I + GST Sbjct: 246 -LDSLGPNDE---IGVIAFDDEIRIHGGCKVTTVSPATPQSIIFLKDFINNKIQPEFGST 301 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 A++ A+D + ++ + N IV LTDG +E I + K++ Sbjct: 302 GYVPALKHAFDMLSTNMTSKAKTKTN------LIVFLTDG---HPDEPESQILDVIKNR 351 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 45/132 (34%), Gaps = 18/132 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P F S S+ K + AL S + K N ND R F+D V Sbjct: 78 PPIDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPND--RFALVPFSDDVQYS 135 Query: 310 P--SFSWGVHKLIRT--IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F G + + + + T A G T A+Q A N+ Sbjct: 136 KVVPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQA------------QSFFNDP 183 Query: 366 EAKKYIVLLTDG 377 KKYI+ LTDG Sbjct: 184 ARKKYIIFLTDG 195 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 49/412 (11%), Positives = 122/412 (29%), Gaps = 59/412 (14%) Query: 40 LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI--RN 97 V ++ A + A + S + ++ + + FP + Sbjct: 641 GVLMLDIGGIHDAHSGSINFATGAITYDSAVSGTTIKAQFQAAGVFPDGSAWDDSKVSEY 700 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 F N K T + + + K + + + Y++ N ++ ++ + Sbjct: 701 FNGNTLKYITTHNFKMFYMERGAGASNLKMKFNLEVIPTYEVNFNKVTTQGAALAGAEFE 760 Query: 158 IQTKAEAETV-SRSYHKEHGVSIQW---------------VIDFSRSMLDYQRDSEGQPL 201 I+ +++ V + + + VS++ + S + + Sbjct: 761 IRDDSDSSKVYNVTSDSDGRVSVRLHEGTYTMTETQTPTGYLAASGTWKITVNADGTYTI 820 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-----SLYYMLYPGPLDPSLSEE 256 G+P D+ S K D + + K Y + G Sbjct: 821 TKNGRPIDKGSDSVYKIVNKGQHEDAEANLTTSKTVKVTDYNKREYEITLGASTSGREAG 880 Query: 257 HFVDSSSLRHVIKK--------KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 ++S+ V+ + + +A + I ++K + +FN Sbjct: 881 TEAKAASVVLVLDRSGSMGADGMTALVNAADTFIDTLKTASPDSQV---AVVYFNGTQDE 937 Query: 309 DPSFS-------WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 D + + + +++I + + G T + DA++ A + + Sbjct: 938 DDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYSYGGTPMGDALEKAKGLLDADQTGN---- 993 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAF 405 +KY++ TDG +++ C I T+ + Sbjct: 994 ------QKYVLFFTDGLPGHSSDDAFNCMVANSAVNCATDIKANATIYTVGY 1039 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/258 (13%), Positives = 82/258 (31%), Gaps = 37/258 (14%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSS 263 G+ ++ + +P ++ S + L P + + + + Sbjct: 369 GKAGKLITEANGLLPAEPRTALSPPAPNVLDKVLSSWADLRKPANVLMVIDVSGSMGAGV 428 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP------SFSWGVH 317 K L + A + + D ++G F+ + + V Sbjct: 429 PDTGRSKLDLAKQAAINALPQFGPHD------KVGLWMFSTKRDGEKDHLELAPLD-TVD 481 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R ++T G T + D AY + + E ++ LTDG Sbjct: 482 AAQRKTLRTRLDGLTPDGGTGLYDTALAAYQHVRDRHSGEAINA---------VIFLTDG 532 Query: 378 ENTQDNEEGIAI------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 +N +N + A + +R+ TIA+ Q+ L + + + + Sbjct: 533 KNEDNNSLSLENLLPDLRAESA-EESVRMFTIAY-----GQDADLGVLKQISETTNAAAY 586 Query: 430 EANSTHELNKIFRDRIGN 447 ++ + ++++F + N Sbjct: 587 DSRESGSIDQVFTAVVSN 604 >gi|258543794|ref|ZP_05704028.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] gi|258520969|gb|EEV89828.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] Length = 1128 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 52/358 (14%), Positives = 98/358 (27%), Gaps = 85/358 (23%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + + ++E++T S K + + +D S SML+ + + + Sbjct: 40 FANRPLHLQSESKTTSAGGVKPN---VMLFLDDSGSMLENANTGSYRIPDLWNTYLGNCT 96 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + ++ + S + LYY + S + Sbjct: 97 YNQNNLLISRNLLPGNGQQTNPSMGRCLYYYNADIIYYRNKGLPPGPKSFLRPAAQVRMD 156 Query: 273 LVRDALASVIRSIKKIDNVNDTVRM-----------GATFFNDRVISDPSFSWGVHKLIR 321 +A+ V+ K D+VN + G F++ + S K + Sbjct: 157 ANINAINEVLT--KTGDSVNWHLLTLWGSEFRHLNGGFYKFDNIGTASVGLS--AEKARK 212 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + I A + + + H++K K YIV ++DGE Sbjct: 213 LVNNMSPIAGTP------------ATERYLKAANVLDHQIK-YRCQKNYIVFMSDGEANG 259 Query: 382 D--------------------------------------------------NEEGIAICN 391 + EG + + Sbjct: 260 GGFPWPGGVYGTQPQWWSQHDSSGPNSGDPGKGVSVFSSKLYNMDMRVGGTDVEGGSW-D 318 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGN 447 K I TI + R +L N A P ++F AN+ EL F + Sbjct: 319 DPKYPKQNIETITIGYGNGLTPQGRNYLKNAAQPSEGAYF-ANNASELADAFLKALAK 375 >gi|62001366|gb|AAX58385.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKALTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 43/117 (36%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + H ++ D TAIN+A++ A + + Sbjct: 112 RVGIIVYAGNSYPLLPITTD-HAAANMFLQNANPDMVSSQGTAINEALELAKTYYNNDEQ 170 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +++++L+DGE+ Q+ E + + G++I TI + Sbjct: 171 TN-----------RFLIILSDGEDHQE--ETKQVAQNLANNGVKIYTIGVGTARGGP 214 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 50/344 (14%), Positives = 96/344 (27%), Gaps = 28/344 (8%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + T I + NF + + + + ++ + + V L Sbjct: 108 EKAYQQGKTAAHVGIRDRESENFRISTMLAAGTKATFALAYEELLQRHQGRYQLVVGLRP 167 Query: 136 RYDL-LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + LN GI I ++ + + E + I + + Sbjct: 168 GQLVRKLNVEITVSERTGIAYVHIPPLRTSQVCTNNQTSEGDLPPSTRIQRGETCVHITF 227 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + F + + E + + + P L P Sbjct: 228 SPTLRDQSAFSSSGIMADFTIHYD-----VSMEDIIGDVQVYGGYFIHYFAPRGLQPLEK 282 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 FV S K + A+ +++ ++ D FN SD W Sbjct: 283 NVVFVIDVSGSMFGTKLQQTKKAMDTILSDLQASD-----------SFNIITFSDTVNIW 331 Query: 315 GVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 I+ V K + G T IN A+ A + SN++ Sbjct: 332 KAEGSIQATVQNIHSAKNYVSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP- 390 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 I+ LTDGE T I + + + + ++AF + Sbjct: 391 -LIMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDA 433 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + + + ++ ++ +V A + G T A + Sbjct: 199 IQVGIVQYGEDAVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + + + ++ ++ +V A + G T A + Sbjct: 176 IQVGIVQYGEDAVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 227 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 228 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGI 285 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 286 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 328 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + + + ++ ++ +V A + G T A + Sbjct: 199 IQVGIVQYGEDAVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 59/164 (35%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +++G + + + + ++ ++ +V A + G T A + Sbjct: 199 IQVGIVQYGEDAVHEFHL--NDYRSVKDVV-EAASHIEQRGGTETRTAFGIEFAR----- 250 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E + AKK ++++TDGE + D+ + + +++ + +A + Sbjct: 251 -SEAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGI 308 Query: 412 -QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 E + AS FF L I D +G+ IF Sbjct: 309 NPETFLNEIKYIASDPDDKHFFNVTDEAALKDI-VDALGDRIFS 351 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 57/180 (31%), Gaps = 20/180 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + +R L ++R + N R+G ++ +V S Sbjct: 36 VFVIDSSRSVRPFEFETMRQFLVGLVRGLNVGPNA---TRVGVIQYSSQVQSVFPLR--A 90 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + T A+Q A + S E + V++TD Sbjct: 91 FSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVPRVAVIVTD 147 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + +A+++GI I + Q L ASP F S Sbjct: 148 G---RPQDRVAEVAAQARARGIEIYAVGV------QRADVGSLRAMASPPLDEHVFLVES 198 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 60/197 (30%), Gaps = 20/197 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 Y C + D + + S + LV+ + ++ + Sbjct: 318 SYRCLCPEGRQLQADGKSCDRCREGHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFLDV 377 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + R+G F+ RV ++ G + + + E T A++ Sbjct: 378 SP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMV 432 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + S + R N + +V TDG + ++ +AK +GI + + Sbjct: 433 EHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAARAKEEGIVMYAVGVGK 486 Query: 408 NKTQQEKARYFLSNCAS 424 + L AS Sbjct: 487 AVEAE------LREIAS 497 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 47/152 (30%), Gaps = 22/152 (14%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS-----DPSFS 313 DSSS+ V + L + ++ + R+ F+ + S Sbjct: 89 DDSSSVGEVNFRSEL--MFVRKLLSDFPVVPTA---TRVAIVTFSSKNYVVPRVDYISTR 143 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +++ G T A Q A ++ + E+ K I L Sbjct: 144 RARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS----------TKVIFL 193 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 +TDG + + IA + G+ I T Sbjct: 194 ITDGYSNGGDPRPIAA--SLRDSGVEIFTFGI 223 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 54/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A I Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSASTVI------- 396 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 R K + + IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 397 --RKKYPTDGSE-IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 448 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 449 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 486 >gi|156120138|ref|NP_001095285.1| complement C2 [Sus scrofa] gi|148724911|emb|CAN87699.1| complement component 2 [Sus scrofa] Length = 752 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 45/139 (32%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V +++ ++ D T I +A+ + Y + + ++ E + I+ Sbjct: 314 SRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAII 373 Query: 373 LLTDGENTQDNEEGIAICNKAK---------SQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ + K + I I + + S Sbjct: 374 LLTDGKSNMGG-SPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKD 432 Query: 424 SPNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 GERHAFILKDTEALSQVFE 451 >gi|38455778|gb|AAR20890.1| complement C2 [Sus scrofa] Length = 734 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 45/139 (32%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V +++ ++ D T I +A+ + Y + + ++ E + I+ Sbjct: 314 SRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAII 373 Query: 373 LLTDGENTQDNEEGIAICNKAK---------SQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ + K + I I + + S Sbjct: 374 LLTDGKSNMGG-SPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKD 432 Query: 424 SPNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 GERHAFILKDTEALSQVFE 451 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 61/180 (33%), Gaps = 20/180 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + FV S +K ++A V+ ++ + D N ++ V S Sbjct: 291 KTKKTVIFVVDRSGSMSGEKIEQAKEAAKFVLNNLNEGDLFN------IIAYDSDVESFE 344 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + R F + GST I+ A+ A + K++ Y Sbjct: 345 PELQKLDDKTREKALGFVDNLYAGGSTNIDGALAKAMGML-----------KDDKRPS-Y 392 Query: 371 IVLLTDGENTQDNEEGIAICNKAK-SQGIRIMTIAFSVNKTQQEKARYFLSN-CASPNSF 428 ++ LTDG T + I + AK +R I+F V + LS C + + Sbjct: 393 MLFLTDGLPTHGEQNEAKIVDNAKQKNDVRARVISFGVGYDVNSRLLDRLSRECFGQSEY 452 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 53/390 (13%), Positives = 120/390 (30%), Gaps = 31/390 (7%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 AI+ +S ++ ++ + F + Sbjct: 199 AIVYSSEKKQRTKIKLGEHKDRGSLVKAVDELPFFSGITATGQALKFAANHTEGRRENFT 258 Query: 120 VE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + +Y ++ S L P S + I + E E ++ + Sbjct: 259 LNYVILTDGYSYDLIESGARVLREVPNSAIY---AVSIGEIFLRKELEMITGNPDNVLTG 315 Query: 178 SI-QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S+ + + D + + + G S Q+ + + + Sbjct: 316 SMSYGTLVKRLKLCDARIKAATLKDSNPGLVRPGEFLSDRFQHRSRLTANLEAKKHTEDF 375 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 K+ + S ++ K+ + ++ + DN R Sbjct: 376 VKTPEKRGPVKDCIYDIGIIFDSSGSLEKNFQKQLAFAKQ----LVEQMPISDNA---TR 428 Query: 297 MGATFF--NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +G F +V +FS +L I ++ G+T N A++ + + Sbjct: 429 VGIVQFAGKTKVRVLANFSQNKSQLKTIIDRS----PFYSGTTFTNQALKK-MAALYEES 483 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + ++K ++L TDG + +D EG KSQG+ + T+ S +K+ Sbjct: 484 KRPNAKLK--------LMLFTDGYSAEDTSEGE---EALKSQGVVVYTVGISTDKSAGLN 532 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + S +++++ +L K F Sbjct: 533 MKELRGMATSSEHYYDSSDFADLLKHFPSS 562 >gi|254881903|ref|ZP_05254613.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|319641094|ref|ZP_07995798.1| von Willebrand factor [Bacteroides sp. 3_1_40A] gi|254834696|gb|EET15005.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|317387338|gb|EFV68213.1| von Willebrand factor [Bacteroides sp. 3_1_40A] Length = 212 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 60/186 (32%), Gaps = 17/186 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + +S + V++ + ++I +++ +T + FN + Sbjct: 7 YLLLDTSGSMYGEPIEAVKNGVQTLISTLRSDPYALETAYISIITFNSSAQQVTPLT--- 63 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + G TA+ A++ I S +K + +I +TD Sbjct: 64 -----ELAAFQQPNIDASGCTALGGALELLSQKIDSEITKTTAEVKGDWRPLIFI--MTD 116 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G T D G+A K + G+ + Q + L + + Sbjct: 117 GVPTDDISRGLAEFKK-RKCGMVV------ACAAGQGASTDTLKQITENVVQLDTADSAT 169 Query: 437 LNKIFR 442 + F+ Sbjct: 170 IKAFFK 175 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + + + K D TA + + TA Sbjct: 122 ADQLTPGINLGLIAYAGTATVLVQPTTNREATKNGLDKLQLADR-----TATGEGIFTAL 176 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL++DG+ N + + AK QG+ I T Sbjct: 177 QAIATVG---AVIGGGDEPPPARIVLMSDGKETVPSNPDNPKGAFTAARTAKDQGVPIST 233 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 ++F E L A S F A+S +L +F + +IG E Sbjct: 234 VSFGTPYGYVEINDQRQPVPVDDEMLEKIAQLSGGDAFTASSLEQLKAVFTSLQQQIGYE 293 Query: 449 IFE 451 + Sbjct: 294 TIK 296 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 66/192 (34%), Gaps = 24/192 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 VD+S H K L R+A+ ++++ + D V G N R + P+ + Sbjct: 249 LVDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAG----NTRDVLPPTPATDAK 304 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + G TA+ M+ AY + V LTDG Sbjct: 305 SIHAALDS-----LTAGGGTAMGSGMELAYRHAVKKASGSVVSRVVV---------LTDG 350 Query: 378 ENTQD-NEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + N A+ + +G+ + T+ F + + + + F +S Sbjct: 351 DANIGRNVSANAMLDSIHKYTAEGVTLTTVGFGMGNYRDDLMEKLADK--GNGNCFYVDS 408 Query: 434 THELNKIFRDRI 445 E K+F ++ Sbjct: 409 LREAKKVFETQL 420 >gi|126723120|ref|NP_001075478.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] gi|11041722|dbj|BAB17303.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] Length = 951 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 72/221 (32%), Gaps = 36/221 (16%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L F+ S + +K R+AL ++ + D R Sbjct: 257 NGYFVHHFAPEGLPTMPKTVIFIVDQSGSMLGRKIQQTREALLKILDDLNPRD------R 310 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI-----VKTFAIDENEMGSTAINDAMQTAYDTII 351 F+ + W + ++ +++A G T IN+A+ A + Sbjct: 311 FNLILFSS-----SATPWKTSLVQASLETVSEARSYAGAIQAAGGTDINEALLLAVSLLD 365 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 E+ + ++LLTDGE TQ I + + + F N Sbjct: 366 HEQEELRAGSVS------LLILLTDGEPTQGKTNPTEIQRNVREAIGGRYSLFCLGFGFN 419 Query: 409 KTQQEKARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + ++++ +L +++ Sbjct: 420 VNYP-----FLEKLALDNGGLARRVYEDSDAALQLQDFYQE 455 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 69/195 (35%), Gaps = 35/195 (17%) Query: 268 IKKKHLVRDALASVIRSIKKID--NVNDTVRMGATFF--------NDRVISDPSFSWGVH 317 IKK ++ A+ + + +I+ + ++ +R+G F + ++ F + Sbjct: 614 IKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYS 673 Query: 318 KLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +++ + K G+T + + A + + D K+ + Sbjct: 674 QIVSPLTADMSDLKNKVSALRHNGATRADLGFKHASTVMSGARTDA----------KRVV 723 Query: 372 VLLTDGENTQDNE-------EGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNC 422 + TDG T+ ++ + K G + +I N + E+ + Sbjct: 724 IFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEEDQKNQFMN 783 Query: 423 ASPNSFFEANSTHEL 437 A +++ A + +L Sbjct: 784 AVSSNYPHATAYDKL 798 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 67/191 (35%), Gaps = 19/191 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S M+ S++ +D SS HL+ + ++++ ++ + D ++ Sbjct: 649 SHEQMMCSKTCYNSVNIAFLIDGSSSVGD-SNFHLMLEFVSNIAKTFEISDIGA---KIA 704 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A F FS+ + ++ G TA DA+ + D Sbjct: 705 AVQFT--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS- 761 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K ++V++TDG++ D A A GI I ++ + + + Sbjct: 762 -------PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDLKDM 809 Query: 419 LSNCASPNSFF 429 S ++FF Sbjct: 810 ASKPKESHAFF 820 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 63/193 (32%), Gaps = 32/193 (16%) Query: 270 KKHLVRDALASVIRSIKKID-----NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 K V A + I+S D N N+ V +G + ++ + Sbjct: 93 KMASVIAAARTFIQSSSPEDQMFVVNFNEDVTLGLSTEIPFTNRPEDLTYAISH------ 146 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G TA+ DA+ A + + + D KK +V+++DG + Sbjct: 147 ------SPPTGKTALYDAVWKAREWVARGSRD-----------KKVLVVVSDGGDNASTH 189 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 I A I++ TI + +K L A+ F + E+ I Sbjct: 190 TLSEILEAANKSNIQVFTIGIF-DPDDPDKNPGVLRQLARATGGEAFVPDELSEVVAI-C 247 Query: 443 DRIGNEIFERVIR 455 + I +I + Sbjct: 248 ESIAKDIRSQYTL 260 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 67/184 (36%), Gaps = 24/184 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R A A + ++K D+V F V F+ H++ + A ++ Sbjct: 107 ARKAAAIFLDELEKDDHVA------LITFGQGVYHLSDFAAKKHEVREQLQHLEAKEQWT 160 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 A +AM+ A + ++LLTDG++ A+ ++ Sbjct: 161 WLYQATLEAMERAVQAPTTRAA---------------VILLTDGKDEGSPVSEEAVLDRI 205 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 K + I + F +K Q + + S AS +F +L +++ + + + Sbjct: 206 KGAQVPIYAMGFG-SKAQVDYLQKVAS--ASQGAFLSTPQAADLTNLYQTVLDYLKNQYI 262 Query: 454 IRIT 457 +R T Sbjct: 263 LRFT 266 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 84/276 (30%), Gaps = 34/276 (12%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S+ + + N + +Y S + + S S + Sbjct: 238 SLSSVTQFCDESNHNIKAPNMQNKMCNYRSTWDVIMDSTDFASSSPRSAPPPRPTISLLQ 297 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D +D S + + + +R A + I + + G FN + + Sbjct: 298 TQD--RVLCLVLDISGSMNGFDRIYRLRQAGEQFLLQILETGSWA-----GIVVFNSQAL 350 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + +R + + + G T I ++ + + Sbjct: 351 TKTYLKQITGDSVRQTLSAY-LPTAAGGGTNICSGIREGFQVFLKKYPSTEGCE------ 403 Query: 368 KKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ----EKARYFLSNC 422 IVLLTDGE + G++ C + + G I TIA + ++ L Sbjct: 404 ---IVLLTDGE-----DAGVSSCFAEVQRSGSIIHTIALGPSAAKELEMLADMTGGLKFS 455 Query: 423 ASPNSFFEANSTHELNKIFR--DRIGNEIFERVIRI 456 A+ + ++ L F +I ++ I++ Sbjct: 456 ATDS-----LDSNGLIDAFSGISSGSGDISQQSIQL 486 >gi|313235671|emb|CBY11123.1| unnamed protein product [Oikopleura dioica] Length = 362 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 82/282 (29%), Gaps = 29/282 (10%) Query: 185 FSRSMLDYQRDSEGQPLN--CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 FSR + D + P N F +T + V Y++ + Sbjct: 28 FSRHLDDVTEYCDNNPSNQKTFHNRDAKTRHNEVCDTKSVWEVIRDHPDYLIGAGEQPVD 87 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL------------VRDALASVIRSIKKIDN 290 D + V + + + V+ A + ++ + + Sbjct: 88 DFDSVNADLWSKTQFTVLRQNNTKTVLAIDISTSMGIGKRWVNVKKAA---TQYLEAVPH 144 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 G F+ + +K T+I A++ + + + Sbjct: 145 DAHV---GIVLFHRVAYVKVPLKQIENADSSDFLKNRLEKVELGVGTSIASALKVSMNVL 201 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +S + N I+LL+DG+ + D + + + + TIAF + + Sbjct: 202 QTSGPKDSRDSGGN------IILLSDGQESHDPRIDDELIQELIDNKVTVNTIAFGTDAS 255 Query: 411 QQEKARYFLSNCASPNSFFEANSTHE-LNKIFRDRIGNEIFE 451 Q K S + F + + + L F + + Sbjct: 256 Q--KLEEVSSRTKGSSYFSDPENPNSILQDAFIAELQKDGSN 295 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 63/163 (38%), Gaps = 18/163 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAINDAM 343 IDN+ R+ F+D I F K + +VK +N+ G T + D + Sbjct: 140 SLIDNLEGNRRIAFMTFDDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDGQTGVRDMI 199 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-AICNKAKSQGIRIMT 402 AY+ I +++++ H +++++DG + D+ I A+ + I I T Sbjct: 200 NEAYELIQNNSKN--HSGS--------LIMISDGAPSDDSASNIPALVSNYVQNNIPIYT 249 Query: 403 IAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 I A +L + A + + + T + F Sbjct: 250 IGMM---YGDNSAEQYLIDIANLTGGQHYSTSDTTMIAGAFGQ 289 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 51/299 (17%), Positives = 96/299 (32%), Gaps = 44/299 (14%) Query: 124 PRKSAYQVVLSSRYDL-LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 P Y ++ + + L NPLS F S + T A V R + Sbjct: 70 PNTETYDLIDENNFQLVAANPLSTF-------SIDVDT-ASYSNVRRFIN---------- 111 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 R +D R E + P + + +S + + Sbjct: 112 -QRQRPPIDAVRIEELINYFSYDYPQPQGEEPFSVTTEVSSAPWNPQHQLVHIGLQGKTL 170 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + P + +D S + + L+++ ++ + + D V V GA Sbjct: 171 AIEELPPS---NLVFLLDVSGSMNQPNRLPLLKEGFKLLVDQLSEQDTVAIAVYAGAAG- 226 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V+ P+ K+I I GSTA + ++ AY+ + K Sbjct: 227 ---VVLPPTPGNEKQKIIAAID-----GLQAQGSTAGGEGIKLAYELATRMLSEG----K 274 Query: 363 NNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 NN ++L TDG+ + E + + + +GI + + F + + K Sbjct: 275 NNR-----VILATDGDFNVGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKL 328 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 26/164 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ ++ D R+ FN R + + R +K Sbjct: 59 VENVKRAAWLLVDKLRDQD------RLSIVVFNHRAEV---LLSNQNVVDRDHIKQQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAI 389 + G T+I++ ++ + + +D + + LLTDGEN + + Sbjct: 110 LSANGGTSIDEGLRLGIEELAKGRKDTISQAF----------LLTDGENEHGDNNRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCASPNS 427 A + + T+ F N Q + LS+ P+ Sbjct: 160 AQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHPDQ 203 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 66/217 (30%), Gaps = 11/217 (5%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + +L + + S G+ + C+ Sbjct: 123 ALRRAGILLYAIGVRDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDMLAK 182 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + P+ E D L + + I +D +D VR+G Sbjct: 183 AAPHVDHVSPACREAALADIVFLVDNSTSIGPQNFQKVKNFLYSVILGLDISSDRVRVGL 242 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +ND + H L I++ G T A++ + E+ Sbjct: 243 AQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALE--FIRTNYLTEESGS 298 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 R K+ + + ++L+TD E N+E + ++ K Sbjct: 299 RAKDRVP--QIVILVTDRE---SNDEVQEVADRLKED 330 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 44/124 (35%), Gaps = 15/124 (12%) Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK-YIVLLTDGENTQDNEEGIAICN 391 G T + + S E + + + +V+L+DG + + I Sbjct: 46 GGGGTNMAPG-------LFISREILERPIFPSQIYLRPVVVVLSDGLTSHP-AKTSEIAT 97 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + K I+T+AF + + Y +S S F+ + +L F +G + Sbjct: 98 QLKKD-ADIVTVAFGDDADEP----YLISLATSSEHFYHCRTGTDLRAFFAS-VGTTLSV 151 Query: 452 RVIR 455 + R Sbjct: 152 SLQR 155 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 58/179 (32%), Gaps = 32/179 (17%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ AL + ++ + G +D+ ++ A A++ + + + RA+ + Sbjct: 1 MLWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNGIVDDGVRAEAT- 59 Query: 84 TFPKQKIEEYLIRNFENNLKKN--FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 ++ NF+ + + FT T V +N + Sbjct: 60 --------KFFNFNFQQGMFGSKPFTPSISSATTSKTTVVINAATTV------------- 98 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + +R G + A + I +V+D + SM D S+ Q Sbjct: 99 --PTSLMRIFGSDELPVSVSCNASQDFVNTD------IVFVLDTTGSMRDKATSSDSQT 149 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 50/344 (14%), Positives = 96/344 (27%), Gaps = 28/344 (8%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + T I + NF + + + + ++ + + V L Sbjct: 74 EKAYQQGKTAAHVGIRDRESENFRISTMLAAGTKATFALAYEELLQRHQGRYQLVVGLRP 133 Query: 136 RYDL-LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + LN GI I ++ + + E + I + + Sbjct: 134 GQLVRKLNVEITVSERTGIAYVHIPPLRTSQVCTNNQTSEGDLPPSTRIQRGETCVHITF 193 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + F + + E + + + P L P Sbjct: 194 SPTLRDQSAFSSSGIMADFTIHYD-----VSMEDIIGDVQVYGGYFIHYFAPRGLQPLEK 248 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 FV S K + A+ +++ ++ D FN SD W Sbjct: 249 NVVFVIDVSGSMFGTKLQQTKKAMDTILSDLQASD-----------SFNIITFSDTVNIW 297 Query: 315 GVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 I+ V K + G T IN A+ A + SN++ Sbjct: 298 KAEGSIQATVQNIHSAKNYVSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP- 356 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 I+ LTDGE T I + + + + ++AF + Sbjct: 357 -LIMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDA 399 >gi|332823606|ref|XP_003311226.1| PREDICTED: complement C2 isoform 3 [Pan troglodytes] Length = 538 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 47/152 (30%), Gaps = 9/152 (5%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEV 358 ++ P+ ++ T ++E G T A+ + Y + + Sbjct: 85 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 144 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKT 410 E + I+LLTDG++ A+ N+ ++ + I I Sbjct: 145 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 204 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + S F T L+++F Sbjct: 205 DWRELNELGSKKDGERHAFILQDTKALHQVFE 236 >gi|295849303|ref|NP_001171534.1| complement C2 isoform 3 [Homo sapiens] gi|194390502|dbj|BAG60565.1| unnamed protein product [Homo sapiens] Length = 538 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 47/152 (30%), Gaps = 9/152 (5%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEV 358 ++ P+ ++ T ++E G T A+ + Y + + Sbjct: 85 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 144 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKT 410 E + I+LLTDG++ A+ N+ ++ + I I Sbjct: 145 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 204 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + S F T L+++F Sbjct: 205 DWRELNELGSKKDGERHAFILQDTKALHQVFE 236 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 47/152 (30%), Gaps = 9/152 (5%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEV 358 ++ P+ ++ T ++E G T A+ + Y + + Sbjct: 146 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 205 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKT 410 E + I+LLTDG++ A+ N+ ++ + I I Sbjct: 206 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 265 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + S F T L+++F Sbjct: 266 DWRELNELGSKKDGERHAFILQDTKALHQVFE 297 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 765 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 819 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 820 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 876 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 877 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 934 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 935 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 971 >gi|123283203|emb|CAI17449.2| complement component 2 [Homo sapiens] gi|123857991|emb|CAM25861.1| complement component 2 [Homo sapiens] gi|168983783|emb|CAQ06834.1| complement component 2 [Homo sapiens] gi|168984349|emb|CAI41857.2| complement component 2 [Homo sapiens] gi|168984417|emb|CAQ09273.1| complement component 2 [Homo sapiens] gi|168985078|emb|CAQ07482.1| complement component 2 [Homo sapiens] gi|168985956|emb|CAQ07112.1| complement component 2 [Homo sapiens] Length = 525 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 47/152 (30%), Gaps = 9/152 (5%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEV 358 ++ P+ ++ T ++E G T A+ + Y + + Sbjct: 72 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 131 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKT 410 E + I+LLTDG++ A+ N+ ++ + I I Sbjct: 132 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 191 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + S F T L+++F Sbjct: 192 DWRELNELGSKKDGERHAFILQDTKALHQVFE 223 >gi|94968893|ref|YP_590941.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550943|gb|ABF40867.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 628 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 53/152 (34%), Gaps = 20/152 (13%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +DA ++ + V F++ ++ F+ ++ +I Sbjct: 418 QDAAGEFLQRVLTGPEDLGFV----VGFSNSILMAQDFTHDSKQIAHSIQ-----AFAPS 468 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G TA+ DA+ A + + S E K +++++DGE+ +A+ Sbjct: 469 GGTALWDAVNFAAEKLASHPE--------RQPVAKILIVISDGEDNSSATTAKQAIQRAQ 520 Query: 395 SQGIRIM---TIAFSVNKTQQEKARYFLSNCA 423 S+ + + T+ + + L A Sbjct: 521 SEEVAVYAINTLEITQRSEEPPVGVRALKTLA 552 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 71/221 (32%), Gaps = 27/221 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 N + L P D F+ SS + V+ LA ++ ++ Sbjct: 40 RNNGFPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRPGEFEKVKIFLADMVDTLDVGP- 98 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDT 349 D R+ + V + L + +K + T A++ A D Sbjct: 99 --DATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDE 153 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + KN K +++TDG + ++ + A++ GI I + Sbjct: 154 AFTEKSGARPKSKN---ISKVAIIVTDG---RPQDQVEEVSAAARASGIEIYAVGV---- 203 Query: 410 TQQEKARYFLSNCASP--NSFFEANS---THELNKIFRDRI 445 + R ++P + F + +L FR+ + Sbjct: 204 -DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|307726382|ref|YP_003909595.1| hypothetical protein BC1003_4370 [Burkholderia sp. CCGE1003] gi|307586907|gb|ADN60304.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 350 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 53/296 (17%), Positives = 92/296 (31%), Gaps = 23/296 (7%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K + G ++TAL + V++G + VD+ R + L+ AA A + + L + Sbjct: 8 KPALSRQRGAVAVMTALCLTVLIGTTALAVDLGRAWVVRNELQNAADAAALAGAGSLGPN 67 Query: 72 LEEV----SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + S + L+ N T K Sbjct: 68 YASPNWTQAQTKAQSAITLNRTEGTSLLTAQVQTGYWNVTGTPAGMQALPVPSPGAYDKP 127 Query: 128 AYQVVLSSRYDLLLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEH-----GVSIQW 181 A QV +S PLSL L +GI + I A A + Y +S+ Sbjct: 128 AVQVTVSRAAGQNGGPLSLVLAPVLGISTMPISATAVAVISAPGYAGPGALFPTAISLCL 187 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK---SYSSQNGKVGIRDEKLSPYMV---S 235 Y S GQP+N + K +Y G+ + + + Sbjct: 188 -------YTAYWNASTGQPVNDPATGLPQVFKIGSTYQYPGCAAGVGSGQWTSFQYNVQD 240 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + PL ++ ++ ++ S + D L V+ S+ V Sbjct: 241 VTSIRNLIANGNPLPMNIGDKTWIQSGVKNTIYNSVPSPADVLLPVVNSLDTGSPV 296 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 70/201 (34%), Gaps = 33/201 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S D + + +G F S + R K Sbjct: 111 TRLAAAQDAAKSF------ADGLTPGINLGLVAFAGTASVLVSPT-----TNREATKVAI 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ + ++ S + ++ IVLL+DG EN D Sbjct: 160 DNLQLSERTATGEAI---FTSLQSIDTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDP 216 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 G +AK + + I TI+F + + E L A S SFF A+ Sbjct: 217 RGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTAS 276 Query: 433 STHELNKIF---RDRIGNEIF 450 S EL ++ ++IG E Sbjct: 277 SLEELRDVYDTLEEQIGFETT 297 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 65/194 (33%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + V+ L+ VI S+ N R+G + V + Sbjct: 44 VFVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASSVKQEFPLR--A 98 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H +++ + T A+Q A I + D + + K ++++TD Sbjct: 99 HSSKAELLQAVRRIQPLSTGTMTGLAIQFA---ITKALSDAEGGRPRSPDISKVVIVVTD 155 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + A +A++ GI + I + L AS S Sbjct: 156 GRPQDSVRDVSA---RARAGGIELFAIGVG------RVDKATLQQIASEPQDEHVDYVES 206 Query: 434 ---THELNKIFRDR 444 +L+K F++ Sbjct: 207 YSVIEKLSKKFQEA 220 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 65/171 (38%), Gaps = 24/171 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + D++ + + + + +K F + T I + A + Sbjct: 77 KVGVVAYTDKIEREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAVKIL----- 131 Query: 356 DEVHRMKNNLEAKKYIVLLTDGEN----------TQDNEEGIAICNKAKSQGIRIMTIAF 405 + R NN IVLL DG N + ++E +AK +G + TI Sbjct: 132 -DAGRNPNNAP---IIVLLADGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGL 187 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + R L A + FFE ++ +L +I + N + +V+ Sbjct: 188 NADG---QLNRTTLQQIAAETNGKFFETSTADKLPQILSEIFANHLKLKVV 235 >gi|62001354|gb|AAX58379.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDSLKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK GI++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 31/79 (39%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G I L M V++G G + +DV + L+ +A ++A+ + + Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDSGSDHA 71 Query: 76 SSRAKNSFTFPKQKIEEYL 94 ++ + I + Sbjct: 72 AAILAGNAAINNNLILDGF 90 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 32/72 (44%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + +LI + G II L+ +L G +D +R L+ A +A++ A Sbjct: 2 TRRLSFFSRLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLAA 61 Query: 65 SVPLIQSLEEVS 76 + + E S Sbjct: 62 AAKYKHGMPEAS 73 >gi|189524901|ref|XP_693183.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Danio rerio] Length = 963 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 69/218 (31%), Gaps = 26/218 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S + +K ++AL +++ + + D Sbjct: 318 NGYFVHFFAPEGLPQMPKNVVFVIDRSGSMMGEKMKQTQEALTTILSELHEDD------Y 371 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G F+D + S T K + N T IN + A D + S Sbjct: 372 FGLVTFDDVIESWRPSLSKATPENVTEAKEYVQTINARSMTDINKGILYAVDMLTSEKSA 431 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 M I+LLTDG+ + ++ I ++ + + + F + Sbjct: 432 SFPNMS-------MIILLTDGQPSSGEQDLSKIQENVRNAINGSMSLFCLGF-----GYD 479 Query: 414 KARYFLSNCASPN-----SFFEANSTHELNKIFRDRIG 446 L A N +EA+ + F + + Sbjct: 480 LDYILLDTLAKQNDGLARRVYEASDAALQLQGFYEEVA 517 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 39/117 (33%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + + + + ++T T I A+ A + S Sbjct: 130 KIGLIVYAGEAYTQLPITSDYVSA-KIFLETINPSMITTQGTDIKQAIDLAMKSFTS--- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + K I ++TDGE+ + + + A +GI++ + + Sbjct: 186 --------NQDVSKAIFVITDGEDNEG--GAVEMAKAAAEKGIKVYVLGVGSPQGAP 232 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/352 (11%), Positives = 97/352 (27%), Gaps = 29/352 (8%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 ++ L L E+ + + + I + + F +L + +V ++ Sbjct: 189 STTALSTVLAELYAANGKTSDLTVEDINQEKSQQFLRDLAQGIKHYGSNTLVFSQNMQKY 248 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS---RSYHKEHGVSIQ 180 + + N + + + I + + + ++ S+ Sbjct: 249 GMAYISAFPMEEITLIDFNKQAPNVPLVAIYPKEGTFIHDNPFIVMSDATADQKAAASVF 308 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + + +P N A + + P V Sbjct: 309 YDFLLTPESQNLAMQQGFRPANVDVALASPLTAQFGVDPNQPR-NSLATPPADVIVAAKN 367 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P + L VDSS K + + + + DNV G Sbjct: 368 AWANNRKPANIML----VVDSSGSMRDDDKMDQAKLGVEVFLNRLPSKDNV------GMI 417 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ + ++ G+T++ DA+ A + + + Sbjct: 418 GFSSSPAVLVPLA--TRSENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQ----- 470 Query: 361 MKNNLEAKKY--IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + IV+L+DG +T + I+I IA+ + Sbjct: 471 ------PDRINAIVVLSDGADTASQLSIDQMLGNFGESSIQIFPIAYGADAE 516 >gi|330447678|ref|ZP_08311326.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491869|dbj|GAA05823.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 257 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 87/258 (33%), Gaps = 33/258 (12%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE-------- 255 F K S Q + + Y K+ + L S + Sbjct: 15 FFINKSMKDKPTSQQLVSAPSLPDVSTGYGFDALKNNWPTLNNTQTSSSDNWLAANYLLI 74 Query: 256 ---EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+++ + +K ++A+ + I I + NV G F++ +D S Sbjct: 75 FDGSGSMDNTNCGNGQRKIVAAKEAMQTFINDIPQDANV------GLYVFDN---ADSSL 125 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + R +K D G+T + ++ + Y + E ++ + N +V Sbjct: 126 RVPLGINNRATLKQAIYDVKAGGTTPLKSSLTSGYTALEKQAEKQLGYGEYN------VV 179 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 ++TDG+ + E +AI ++ + I TI F + A + AN Sbjct: 180 IVTDGDASVGEEPEVAISRIYQNSPVTIHTIGFCIGNRHALNAEGI-------TYYQSAN 232 Query: 433 STHELNKIFRDRIGNEIF 450 + +L + + Sbjct: 233 NPEKLLAGLQSVLAESAQ 250 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 61/191 (31%), Gaps = 34/191 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 302 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSL-LAV 354 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 G T I +A++ S E + KN + + I+LL+DG++T Sbjct: 355 NSLTSNGGTNIAEALRK------GSKVIEERQAKNPVCS---IILLSDGQDTYTVSPTAG 405 Query: 388 ------AIC----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 C + +Q + + F + L + + S +F + Sbjct: 406 VHKGAPEYCALLPSTNGNQQVPVHVFGFGADHDSVS-----LHSISQTSGGTFSFIETEA 460 Query: 436 ELNKIFRDRIG 446 + F IG Sbjct: 461 AIQDAFAQCIG 471 >gi|294674674|ref|YP_003575290.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294471870|gb|ADE81259.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 212 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 61/188 (32%), Gaps = 21/188 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + +S + V++ + +++ +++ +T + FN + Sbjct: 7 YLLLDTSGSMYGEPIEAVKNGVQTLVSTLRSDPYALETAYISIITFNSSAQQIAPLT--- 63 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL-LT 375 + + G TA+ +A+ + + +K + + IV +T Sbjct: 64 -----ELASFQPPVIDASGCTALGEALNLLAQKVDTEIVKTTAEVKGDWKP---IVFIMT 115 Query: 376 DGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 DGE T D ++G+ K + G+ + Q L + + Sbjct: 116 DGEPTDDLQKGLN--EFRKRKFGMVV------ACAAGQGANTNTLKQITENVVQLDTADS 167 Query: 435 HELNKIFR 442 + F+ Sbjct: 168 ATIKAFFK 175 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 56/154 (36%), Gaps = 29/154 (18%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKTFAID 330 +V+ A AS++ + D R+ F+ + V+ W + I Sbjct: 58 MVKQAAASLVDRLLPSD------RLSVIAFDHKAKVLVPNQTVWDKEAIKAQI-----AT 106 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G TAI++ M+ I + + + + I LLTDGEN DN+ + + Sbjct: 107 LEPGGGTAIDEGMKLGLKEIAAGKQGTISQ----------IFLLTDGENEHGDNQRCLEL 156 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 A I + + F V+ + L A Sbjct: 157 AKLAAEYNITLNALGFGVHWN-----QDVLEQIA 185 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + + + + G T I A+ A + Sbjct: 130 KIGLIVYAGEAYTQLPITSDYVSAKMFLETINPSMITTQG-TDIKQAIDLAMKSFT---- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + K I ++TDGE+ + + + A +GI++ + + Sbjct: 185 -------PNQDVSKAIFVITDGEDNEG--GVVEMAKAAAEKGIKVYVLGVGSPQGAP 232 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 71/204 (34%), Gaps = 19/204 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P + F+ S K V+D + + + + +++G F V Sbjct: 28 CRPRPLDIVFMLDGSRSVRPKNFQTVKDYVKNFTDIFEAFGPND--MQVGVIQFGSGVRE 85 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK-NNLEA 367 + + +++ T A++ +++ H + +N Sbjct: 86 EILL--NQFYVRHELMEAIDNIRYMETGTMTGLALR----KLVTETLTVEHGARVDNPIV 139 Query: 368 KKYIVLLTDGENTQDNEEGI-AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +V++TDG++ + G+ +AK++G I I ++ + L + P Sbjct: 140 HTVVVIITDGKSQDYSRGGVTKWTKEAKARGFEIFAIGI----GRKANRKELLEMASEPK 195 Query: 427 --SFFEANSTHELNKI---FRDRI 445 F + + + ++ +DRI Sbjct: 196 ELHTFRVQNFNAIKRVDVNLKDRI 219 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 59/215 (27%), Gaps = 22/215 (10%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQ-PLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 V+ Q +D R + + + D P + V + + E Sbjct: 110 SAPGTTVTYQITVDAYRVLGNGKEDVYFSFPQTPYQYTRQTGVSTAKLDFSLSFSQPEYA 169 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSI 285 P L L P + FV S K + AL + + Sbjct: 170 KPPNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYF 229 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF---AIDENEMGSTAINDA 342 K N N R F+D V +G + + + + G T + A Sbjct: 230 KS--NYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAA 287 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A KKYI+ LTDG Sbjct: 288 LSLAKSYFTDPTR------------KKYIIFLTDG 310 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 414 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 469 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ A G TA DA+ + D Sbjct: 470 T--YDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS----- 522 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A A GI I ++ + + R S Sbjct: 523 ---PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDLRDMASKP 574 Query: 423 ASPNSFF 429 ++FF Sbjct: 575 KESHAFF 581 >gi|62001386|gb|AAX58395.1| AvrE [Pseudomonas viridiflava] gi|62001434|gb|AAX58419.1| AvrE [Pseudomonas viridiflava] gi|62001436|gb|AAX58420.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 97/364 (26%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ LN L + Sbjct: 1414 FGPNATITASIDSRTTKRTKFQTKEGVAMTAPELTKLN-KQLQSAFKDKPTQDKLKGLAD 1472 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 K IQ +D ++ + + + + T K ++ + + Sbjct: 1473 AKNPDHADKTPKEKIQAHLDGLNTLFEGRSANNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + V G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKVDQGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 58/175 (33%), Gaps = 9/175 (5%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ + G II L + +M+G G+ +D+ + L+ +A + ++A+ L Sbjct: 11 TRHSLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTS 70 Query: 71 S-----LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + E A ++ Q+ + + + T+ + T + Sbjct: 71 AISLQVAEADGIAAGHANYAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKYV 130 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 K Q+ + + + + ++ Q A A + + + Sbjct: 131 KCTAQLSNIAHWFIEVLNTIPGVQVANAS----QVAASAIATVGAGQTTCAIPVF 181 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 51/395 (12%), Positives = 124/395 (31%), Gaps = 38/395 (9%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS-----SRAKNS 82 +L+ L + G + + ++ A I+ E+ S + + Sbjct: 8 MLLMSSLILSGCGPESENKESLQQQPSTPTDQQVLAAQHAAIKEAEQRSVAAKATADAKA 67 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-VVLSSRYDLLL 141 +Q+ ++Y + + + + + NP + YQ + + Sbjct: 68 KALAQQEAQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQ 127 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 NPL+ F S + T + A H + ++ + + Sbjct: 128 NPLATF-------SLDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSN 180 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N PA + + +E+ + V + P+ + +D+ Sbjct: 181 NKEPVPASKPIPFAMRYELAPAPWNEQRTLLKVD----ILAKDRKSEELPASNLVFLIDT 236 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S ++ L++ +L +++ +++ DN+ G + R+ + + Sbjct: 237 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS-----ISGSHK 287 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + GST ++ AY +K + I+L TDG+ Sbjct: 288 AEINAAIDSLDADGSTNGGAGLELAYQQAAKG------FIKGGINR---ILLATDGDFNV 338 Query: 382 DNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++ +I + K Q G+ + T + + Sbjct: 339 GIDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEA 373 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 63/518 (12%), Positives = 147/518 (28%), Gaps = 125/518 (24%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ ++P++L G+ D A K + A++ I++ + Sbjct: 14 DKGGVAIIMGLAVIPLVL-ASGLAADYA----IVQAAKSRLDASADAAALAAIKTAQTTI 68 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + P+ + + E + R D++ A+++ + ++ Sbjct: 69 AELSATNPNPRPQAIAAAMSQAEKSFYAQAGKR-AADLLGKPAIDVQIKGQEVTANVAYS 127 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + N F R G+K A A+ + + ++D S SM + Sbjct: 128 AAMPSN----FGRIAGVKLMNYNGGAGAQLTMAKF-----LDFYLLLDVSGSMGLPSTPA 178 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 L + L+ Y C + ++ G Sbjct: 179 GEAAL--------------------AAKNPDDLAQYPTGCRFACHFAGSQG--------- 209 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + S ++ + V A+A ++ K + R+G F + + + Sbjct: 210 --YNVSRANNIQLRIDAVGAAVAQLMEKAKDTATLPKQYRVGVYPFVTHANAFVDLTDNL 267 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII---------------------SSNE 355 V++ + +T + D + + Sbjct: 268 RGDQ-YSVESAINYDPATRTTDFGRLLDAGKDWVFARDLNPNYKANPNIPADVTPMGAGG 326 Query: 356 DEVHRM--------------KNNLEAKKYIVLLTDGENTQDN------------------ 383 +H + + ++ ++DG + Sbjct: 327 SHIHNIFQDINAKIPSVGDGSGASSPQPFVFFVSDGMQNSQSFVSATGTWPGVTPYPTPP 386 Query: 384 EEGIAI-------CNKAKSQGIRIMTIAF----SVNKTQQEKARYF------------LS 420 + ++I CN K++GI + + N A+ F + Sbjct: 387 GQTVSIRAMDPTLCNVLKARGITVSVLEIPYPTFTNPKPFAAAQEFKANDAVPNLSGAMR 446 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 CASPN +F A++ + + + + R+T+ Sbjct: 447 ACASPNFYFMADTPEGIADAMKKMFEQAV--QSARLTQ 482 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 46/143 (32%), Gaps = 18/143 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A + + K+DN R+G F + + Sbjct: 109 KPDRLTRAKQLISKLADKLDND----RVGLVVFAGNAYLQMPLTIDYSAAKMYLTTVSP- 163 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D TAI A+Q A + + E H K +++++DGE +E I+ Sbjct: 164 DMIPTQGTAIGQAIQVA-NDAFNKKE-RKH---------KSLIIISDGE--DHDEAAISK 210 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 A G+ I TI Sbjct: 211 ARAAFEDGVVINTIGIGSPTGSP 233 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 34/87 (39%), Gaps = 18/87 (20%) Query: 383 NEEGIAICNKAKSQG--------IRIMTIAFSVNKTQQEKARYFLSNCASPNS------- 427 ++ +C K+ G I I TIAF + + + + +CAS Sbjct: 551 DQNVAKVCENVKADGRKPGGTDGILIFTIAFDLRDGEP--VKKLMEDCASNGLIDASEKL 608 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVI 454 +++A S EL F+ I +I I Sbjct: 609 YYDAQSQEELAAAFQS-ITEQISSLRI 634 Score = 40.0 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 59/209 (28%), Gaps = 33/209 (15%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNG------KVGIRDEKLSPYMVSCNKSLYYMLY 245 + R + G+PL F + KS G+ +P Sbjct: 298 WTRITTGEPLTRFYVYDNARYKSQFGTWRGCVEARPNGLAATDTAPVSSKPETLFVPTFA 357 Query: 246 PGPLDPS-LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P D S ++DS S K+ + A V G ++ Sbjct: 358 PDEYDDSDYGWNDYLDSGSGSPGSAKEAMAEQA------------KVAKYFDSG---YSI 402 Query: 305 RVISDPSFSWGVHKLIRTI-----------VKTFAIDENEMGSTAINDAMQTAYDTIISS 353 S WG + T V G+T I + + + + Sbjct: 403 TTPSSNRSDWGPNSTCATTPITPLTKTLKTVTDAIDVMGAQGATNIPHGLAWGWRLLTAR 462 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 R + + K +VL+TDG NT + Sbjct: 463 PPFTEGRSHDEPDNLKVLVLMTDGNNTYN 491 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 59/140 (42%), Gaps = 17/140 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV++ L +I + +D R+ FN+ +F+ + +T++ Sbjct: 238 KIQLVKETLVKIINLMSSMD------RICIVCFNESGDRPLTFTRVTDENKQTLLNLIQ- 290 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I++ + A I + + KNN+ + I+LL+DG++T+ A Sbjct: 291 QIYAGGGTNISEGINHALKAIQN------RKFKNNVTS---ILLLSDGQDTKAYTRVKAY 341 Query: 390 CNKAK-SQGIRIMTIAFSVN 408 +K + I TI F + Sbjct: 342 IDKYQIKDAFNIETIGFGED 361 >gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Nomascus leucogenys] Length = 1313 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 57/179 (31%), Gaps = 24/179 (13%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P L P FV S K + A+ ++ +K D Sbjct: 269 IHYFAPRGLPPMEKNVVFVIDVSGSMFGTKIEQTKKAMNVILSDLKAND----------- 317 Query: 301 FFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +FN SD W I+ + K + G T IN A+ A + SN Sbjct: 318 YFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSN 377 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQ----DNEEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ I+ L DGE T + +C + + + ++AF + Sbjct: 378 QEPGRGPSVGRIP--LIIFLMDGEPTAGVTTPSVILSNVCQALGHR-VSLFSLAFGDDA 433 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 45/134 (33%), Gaps = 15/134 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ A F FS+ + ++ G TA DA+ + Sbjct: 406 KIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVR 463 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D K ++V++TDG++ D A A GI I ++ + + Sbjct: 464 DS--------PNKNFLVIITDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDL 510 Query: 416 RYFLSNCASPNSFF 429 + S ++FF Sbjct: 511 KDMASKPKESHAFF 524 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 62/159 (38%), Gaps = 23/159 (14%) Query: 300 TFFNDRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FND+V + F+ + + G+TAI A+ AY T + + Sbjct: 415 ITFNDKVTATRQFTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKA 474 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS----QGIRIMTIAFSVNKTQQ 412 + + + IVL+TDGEN + + + +G+R + F Sbjct: 475 DPSALTS-------IVLMTDGENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFG------ 521 Query: 413 EKARYFLSNCASP--NSFFEANSTH-ELNKIFRDRIGNE 448 + R L+ A+ + F+A + L+ +FR+ G + Sbjct: 522 DADRAALTQIATSTGGAVFDATAPGVSLSDVFREIRGYQ 560 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 25/56 (44%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + + G+ I+ AL+ P+ + M +D+ + ++ A ITA+ L Sbjct: 13 RRAREQGGNVAILFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAAGDL 68 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 68/205 (33%), Gaps = 37/205 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S + V +G F + + ++ I Sbjct: 109 TRLRAAQDAARSFAEGLTPG------VNLGLISFAGTATVLAAPTTEREGVVHAIENLKL 162 Query: 329 IDENEMGSTAINDAMQT--AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-- 384 G I A+Q ++ +I E + IVL+TDG+ T + Sbjct: 163 AQSTATGE-GIFAALQAIESFSAVIGGAE--------GPPPAR-IVLMTDGKQTVPQDEY 212 Query: 385 ---EGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFE 430 AK +GI I TI+F + E + A S F++ Sbjct: 213 APRGAFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASMREIARLSGGDFYK 272 Query: 431 ANSTHELNKIF---RDRIGNEIFER 452 A + EL +++ ++IG EI E Sbjct: 273 AATAEELKQVYDSLGEQIGYEIKET 297 >gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster Length = 889 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 48/301 (15%), Positives = 90/301 (29%), Gaps = 41/301 (13%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 202 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 250 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 251 VYDVNRESPGNVQVVNGYFV-------HFFAPQGLPVVPKNIVFVIDISGSMAGRKIQQT 303 Query: 275 RDALASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R AL ++ +K+ D +N G T + D ++ +TF ++ Sbjct: 304 RVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQATP-------ANLEEARTFVRSISD 356 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEGIAIC 390 G T IND + + + E + ++ I++LTDG+ I Sbjct: 357 QGMTNINDGLLRGIRMLTDARE-------QHTVPERSTSIIIMLTDGDANTGESRPEKIQ 409 Query: 391 NKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + I + + F N L N +E + + + F + + Sbjct: 410 ENVRKA-IEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYEEVA 468 Query: 447 N 447 N Sbjct: 469 N 469 >gi|3024063|sp|P97280|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus] Length = 886 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 48/301 (15%), Positives = 90/301 (29%), Gaps = 41/301 (13%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 199 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 247 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 248 VYDVNRESPGNVQVVNGYFV-------HFFAPQGLPVVPKNIVFVIDISGSMAGRKIQQT 300 Query: 275 RDALASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R AL ++ +K+ D +N G T + D ++ +TF ++ Sbjct: 301 RVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQATP-------ANLEEARTFVRSISD 353 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEGIAIC 390 G T IND + + + E + ++ I++LTDG+ I Sbjct: 354 QGMTNINDGLLRGIRMLTDARE-------QHTVPERSTSIIIMLTDGDANTGESRPEKIQ 406 Query: 391 NKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + I + + F N L N +E + + + F + + Sbjct: 407 ENVRKA-IEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYEEVA 465 Query: 447 N 447 N Sbjct: 466 N 466 >gi|329851995|ref|ZP_08266676.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] gi|328839844|gb|EGF89417.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] Length = 399 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 27/77 (35%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + G+ +I+AL +L V +D +L+ AA + + L Sbjct: 5 FVQWAHCQGGNVAVISALCAAPLLYVITATIDHSAMVKDRFSLQAAADAGALMGAAKLAL 64 Query: 71 SLEEVSSRAKNSFTFPK 87 +E+ A + + Sbjct: 65 GSDELVFGAAEAAAHQQ 81 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 60/147 (40%), Gaps = 18/147 (12%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV-HRMKNNLEAKKYIVLL 374 + + + G T + + + +++ N+D ++K N ++ +++L Sbjct: 431 FSSKKSSELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKDATPDKIKTNT--QRILLVL 488 Query: 375 TDGENTQDNEEGIAI-----CNKAKSQ---------GIRIMTIAFSVNKTQQE-KARYFL 419 +DG +T + CNK +++ I IAF +Q+ + R Sbjct: 489 SDGVDTALPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFGYEQDSELRKEW 548 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIG 446 NC P ++ +A + L ++F+ IG Sbjct: 549 ENCVGPGNYHQAKNEKALLEVFKQIIG 575 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 44/308 (14%), Positives = 97/308 (31%), Gaps = 42/308 (13%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + K+ G + ++TALL +L + VD L QA + A + + Sbjct: 9 SLFYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAALLLT 68 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK-KNFTDREVRDIVRDTAVEMNP 124 E + + ++ + I+N + + K + + + R+ Sbjct: 69 T-------ENNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRN------- 114 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Q ++ L L +S + AE T + + S+ +++ Sbjct: 115 -----QELVQGMVKLYLRSYDKEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVE 169 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + S N + KSY+ + + I + + + + S++ Sbjct: 170 GDVKRKFWLPWSYTLTSNNRNTVDINSGKSYAVKEKDILIPIDLM--LVNDISTSMFKPP 227 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR---DALASVIRSIKKIDNVNDTVRMGATF 301 P P KK ++ A+A+++ + N++ R+G T Sbjct: 228 KDDPQGP-----------------KKIDSLKTVVKAVANILIPDEPPKNISKYNRIGITS 270 Query: 302 FNDRVISD 309 F Sbjct: 271 FGLGAQQA 278 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 380 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 435 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ A G TA DA+ + D Sbjct: 436 T--YDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS----- 488 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A A GI I ++ + + R S Sbjct: 489 ---PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDLRDMASKP 540 Query: 423 ASPNSFF 429 ++FF Sbjct: 541 KESHAFF 547 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 43/263 (16%), Positives = 90/263 (34%), Gaps = 21/263 (7%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + DY S N G P Y+ ++ +D L+ + + Sbjct: 496 LTDYANGSVRTSDNSLGAPEHNKRIKYNEKD-----KDYTLTLDVTGKRGKKAGVDVLLV 550 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S S + + + ++ + +++ +I + + + R+ A F+ + Sbjct: 551 IDKSGSMGLNDNGRTDSNYFNLMPTLKKTVPTLVDTI--LPDSDSVNRVAAISFSSDDYT 608 Query: 309 --DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 D S W V ++ G T AM+ A + E + Sbjct: 609 GNDISTDW-VDYNGKSGFNRKIEGLGTKGGTNWQLAMRNADKKLKPRAESQN-------- 659 Query: 367 AKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 KK +V L+DGE T E+ + ++ G + + A N + +L + A Sbjct: 660 -KKVVVFLSDGEPTYRYEKRSYYPYEEYETGGGQSYSSANLTNAVDEVAGSDYLKD-AEI 717 Query: 426 NSFFEANSTHELNKIFRDRIGNE 448 S + + T F +++ + Sbjct: 718 YSVYLTSQTSTRMTEFANKLTAK 740 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 49/295 (16%), Positives = 103/295 (34%), Gaps = 51/295 (17%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D +EGQ + + + + S + V +++ + + + S Sbjct: 1007 SDGTTLTEGQKIVNYTEKEEAVKNLESDKTASVVNEADRV--FQIELSASTKGRDEGIAA 1064 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV---RMGATFFNDRV 306 + S+S++ KK ++DA + + + K+ +++ G+ + + Sbjct: 1065 QAASIVLVLDASASMQENGKKLKDIQDAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTI 1124 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ + I + F ++N G T + DA++ A + Sbjct: 1125 TDSGFYTLDTSDNVDAINR-FISNKNASGGTPMGDALEEANSILSGRPNSS--------- 1174 Query: 367 AKKYIVLLTDGENTQDNEEGIAIC-------NKAK--SQGIRIMTIAFSVNKT------- 410 KY +L TDG ++ C N+AK + ++ TI + ++ + Sbjct: 1175 --KYALLFTDGMPGYNSSNNSFNCMVANHANNEAKEIKEYAKLYTIGYKLSGSFKWEEGH 1232 Query: 411 ------------QQEKARYFLSN--CASPN----SFFEANSTHELNKIFRDRIGN 447 + KA FL N +SP + ++T L KIF D G Sbjct: 1233 SQDSTNNHGSHKTETKAADFLKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQ 1287 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 76/264 (28%), Gaps = 16/264 (6%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ A +MP +L G +D+ R Y ++ A A++ + Sbjct: 19 DARGTIGIMFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMAVVGNTSFDADFGR 78 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 K S +F + + FT + + M R + ++ Sbjct: 79 QAFKTSASFAYALDGAAPGSDPLTITRLTFTQNPDGTVTAEATATM--RTTVVSIIGVRS 136 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR------SYHKEHGVSIQWVI--DFSRS 188 DL + ++ I + + + + + +S V+ D++ Sbjct: 137 LDLSFRSTAKGTTTLKISTITFKVLSAQGAFDKDIYFFTRDGNGNLLSESLVLQYDYNYP 196 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSY------SSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S P K+Y G+ Y S N S + Sbjct: 197 AGGTATKSYNPPPPQTFTQPVSNYKTYGFKMVVYKDPSYRGLHVNPSVYYSDSSNSSQWI 256 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRH 266 + DP S +++ D + Sbjct: 257 RIQGKCYDPGGSTQYWEDGGDFNY 280 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 61/203 (30%), Gaps = 21/203 (10%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 L+ ++ V S K + L + + I D R+G Sbjct: 360 AAKNAWSLNRKRADILLVVDVSGSMEGDKLEAAKAGLGTFLSRILPED------RVGLVT 413 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + R + G TA+ DA+ + + S R+ Sbjct: 414 FSTESRLVVPPA--PLSDTRIRLDDAIAVMRAQGRTALYDALIDGKEALDSLPSTGDDRI 471 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + IVLL+DG + + + GI I +A+ + L Sbjct: 472 -------RAIVLLSDGLDNSSRATLEQVRLAFEESGISIFPVAYGADADTDA-----LQQ 519 Query: 422 CASPNSFFEAN-STHELNKIFRD 443 A+ + ++ +IF + Sbjct: 520 IATFSRTILVQGDAGDIGQIFEN 542 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 48/145 (33%), Gaps = 19/145 (13%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F + + T + G T+I ++ + II SN+ Sbjct: 20 GMVTFESLATILNYLT-DIIGDNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQST 78 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE ++ I+ C + K G I TIA + + Sbjct: 79 SGSE---------IILLTDGE-----DDQISSCFEEVKQSGAVIHTIALGPSA---AREL 121 Query: 417 YFLSNCASPNSFFEANSTHELNKIF 441 LSN F+ + L F Sbjct: 122 ETLSNMTGGYRFYANKDINGLTDAF 146 >gi|268558414|ref|XP_002637197.1| Hypothetical protein CBG09720 [Caenorhabditis briggsae] Length = 630 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 59/157 (37%), Gaps = 10/157 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +I + + VR+G ++ + H +++ + E G+T DA Sbjct: 475 SAISSLPISQEAVRVGLISYSGPGRTHVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADA 534 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ A E H + N+ KK +V+ TDG + A+++G++++ Sbjct: 535 IRYATKIF----EGMAHPARKNV--KKVLVVFTDG---YSQDHPRDAARGARAKGLQLIA 585 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +A ++ + F + + EL + Sbjct: 586 VAVK-DRLAPPDEEQLAEIGGHAKNVFISPNGRELRE 621 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 71/209 (33%), Gaps = 21/209 (10%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N + A + + ++ + +V +L P+ + VD Sbjct: 148 NYDFKNAKKQGNAPFLVTTEMVKSPWHATNRIVKVGIKAEDVLAAKQNQPAANLVFLVDV 207 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + K L + +L + + ++ D + G N V+ + K++ Sbjct: 208 SGSMNSDDKLQLAKASLKMLTKQLRAQDTITLITYAG----NTEVVLPATSGNQTQKILN 263 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + GST A++ AY + E+ + N I++LTDG+ Sbjct: 264 AIDN-----LSANGSTNGEAAIKLAYQQ---AEENFKKQGINR------ILMLTDGDFNV 309 Query: 382 DNEEGIAICNKAKSQ---GIRIMTIAFSV 407 + + ++ GI + T+ F Sbjct: 310 GVSNVKDMLDIIRNNRDKGISLSTLGFGQ 338 >gi|157412073|ref|YP_001481413.1| TerY1 [Escherichia coli APEC O1] gi|99867098|gb|ABF67743.1| TerY1 [Escherichia coli APEC O1] Length = 239 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT--------DLLSFQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 158 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 202 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 56/153 (36%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 215 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 269 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S +E + R ++ + +V TDG + ++ Sbjct: 270 YMERGTMTGLALRHMVE--HSFSEAQGARPRDLNVPRVGLVF-TDG---RSQDDISVWAA 323 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + ++ L AS Sbjct: 324 RAKEEGIVMYAVGVGKAVEEE------LREIAS 350 >gi|116619435|ref|YP_821591.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222597|gb|ABJ81306.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 377 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 46/152 (30%), Gaps = 21/152 (13%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG-ATFFNDRVISDPSFSWG 315 + ++S H + + + + +++ G + + + +W Sbjct: 107 VDLSNNSRIHWKTYQDAILELVWNLL------PGDKRY--TGYLISYGNTADIAVNTTWD 158 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 K+ + K G A+ DA+ A E ++ IV++ Sbjct: 159 SDKIADKVRKM-----KPGGGAALYDAIYLACTR-------RELVKGEPYEPRRVIVVIG 206 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 DG + + A+ + I ++ Sbjct: 207 DGHDNASKHNLEEVLELAQRNLVTIYAVSTMA 238 >gi|325116955|emb|CBZ52508.1| unnamed protein product [Neospora caninum Liverpool] Length = 765 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 44/129 (34%), Gaps = 9/129 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + K+ ND V F+ V S T ++ G+T +Q Sbjct: 104 LSKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQ 163 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG--IRIMT 402 T + +E K ++ +TDGE + + + + +G I +++ Sbjct: 164 TCNQMLFDYPREER------QTVPKLVIAMTDGE-SDSDFHTVNEAKVIRERGGIITVLS 216 Query: 403 IAFSVNKTQ 411 + VN + Sbjct: 217 VGMYVNHNE 225 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 62/228 (27%), Gaps = 23/228 (10%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQ-PLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 V+ Q +D R + + + D P + + + E Sbjct: 110 SAPGTMVTYQITVDAYRVLGNGKEDVYFSFPQTPYQYTRKTEASTAKLDFSLSFSQPEYA 169 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEE----HFVDSSSLRHVIKKKHLVRDALASVIRSI 285 P L L P + FV S K + AL + + Sbjct: 170 KPPNGDAQGRLDVTLIPQGAVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYF 229 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF---AIDENEMGSTAINDA 342 K N N R F+D V +G + + + + G T + A Sbjct: 230 KS--NYNQNDRFALVPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAA 287 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 + A KKYI+ LTDG T + + I Sbjct: 288 LSLAKSYFTDPTR------------KKYIIFLTDGMPTVLNTVDTITY 323 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 60/191 (31%), Gaps = 34/191 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 308 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSL-LAV 360 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 G T I + ++ S E + KN + + I+LL+DG++T Sbjct: 361 NSLTSNGGTNIAEGLRK------GSKVIEERQAKNPVCS---IILLSDGQDTYTVSPTAG 411 Query: 388 ------AIC----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 C + +Q I + F + L + + S +F + Sbjct: 412 VHKGAPEYCALLPSTNGNQQIPVHVFGFGADHDSVS-----LHSISQTSGGTFSFIETEA 466 Query: 436 ELNKIFRDRIG 446 + F IG Sbjct: 467 AIQDAFAQCIG 477 >gi|6017001|gb|AAF01565.1|AF061273_1 thrombospondin-related adhesive protein homolog [Neospora caninum] Length = 765 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 44/129 (34%), Gaps = 9/129 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + K+ ND V F+ V S T ++ G+T +Q Sbjct: 104 LSKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQ 163 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG--IRIMT 402 T + +E K ++ +TDGE + + + + +G I +++ Sbjct: 164 TCNQMLFDYPREER------QTVPKLVIAMTDGE-SDSDFHTVNEAKVIRERGGIITVLS 216 Query: 403 IAFSVNKTQ 411 + VN + Sbjct: 217 VGMYVNHNE 225 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 42/360 (11%), Positives = 105/360 (29%), Gaps = 35/360 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + Q E++ + K + + ++ +++ D + + T+ ++ Sbjct: 33 SDKSDDQQKRAELADQTKLAAEQQAELKQQVELKDSVERQANRQRDAAIAIHEQATSTKL 92 Query: 123 NPRKSAYQ-VVLSSRYDLL------------LNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + ++ + + + P ++ + T + Sbjct: 93 RTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAI 152 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 +++ ++ + + LN F K+ + + + Sbjct: 153 DVDTGSYTTLRRML-KEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPY 211 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + M+ L + + +D S K L++ AL + + + D Sbjct: 212 NYDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLGAQD 271 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V+ V GA G + + GST +Q AY Sbjct: 272 KVSIVVYAGAAGVVLD---------GAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL 322 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFS 406 + H ++ + ++ TDG+ +E I + + K GI + T+ F Sbjct: 323 ------AKKHLVEGGINR---VIFATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFG 373 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 54/168 (32%), Gaps = 15/168 (8%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 L+ ++ V +S K +V+ + + + I D R+G F Sbjct: 362 NSWSLNRKRADIVLVVDTSGSMEGDKLTMVKAGIETFLMRILPED------RLGLITFAS 415 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + R ++ G TA+ DA+ + + R+ Sbjct: 416 AARLVVPMA--PLSDNRIALQDAVQAMRASGRTALFDALVLGKQVLEQLPPADDDRI--- 470 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + IVLL+DG + I GI I +A+ + +Q Sbjct: 471 ----RAIVLLSDGADNSSQASLDQIRTLFDESGISIFPVAYGNDADRQ 514 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 58/208 (27%), Gaps = 28/208 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 PLD + K L + I +D T R+ + Sbjct: 47 ACKNRPLDLVFIIDSSRSVRPEEFEKVKIFLSKM--------IDTLDVGERTTRVAVMNY 98 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 V + K + + T A+Q A D + + Sbjct: 99 ASTVKVEFPLRTYFDKASMKEAVSRIQPLSA--GTMTGLAIQAAMDEVFTEEMGTRPANF 156 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 N K ++++TDG + ++ + A++ GI I + Q L Sbjct: 157 N---IPKVVIIVTDG---RPQDQVENVAANARTAGIEIYAVGVGRADMQS------LRIM 204 Query: 423 AS---PNSFFEANS---THELNKIFRDR 444 AS F + +L FR+ Sbjct: 205 ASEPLDEHVFYVETYGVIEKLTSKFRET 232 >gi|62001450|gb|AAX58427.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 25/174 (14%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTA 338 +I I ++D R+G ++ +V + S ++++ I + + + M A Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLA 104 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I AM A+ + + H V++TDG + + + A+ GI Sbjct: 105 IRYAMNVAFSAEEGARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGI 151 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 I + + L ASP + F S +L F + +++ Sbjct: 152 EIYAVGVARADMTS------LRAMASPPFEDHVFLVESF-DLIHQFGLQFQDKL 198 >gi|328884707|emb|CCA57946.1| hypothetical protein SVEN_4660 [Streptomyces venezuelae ATCC 10712] Length = 535 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 60/387 (15%), Positives = 109/387 (28%), Gaps = 66/387 (17%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYE---HALKQAAQTAIITASVPLIQSLEEV 75 TG I+T+ PV LGV V + W + L QA A+ L + + Sbjct: 127 TGETPIMTS---PVALGVRPAAVARLGWDPAKVTWSQLHQAV------AAGKLTYGMTDP 177 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ + + R T ++ S + + Sbjct: 178 KRSHSGFSALVSVASGLSGAQSALTAQ--DVGEATPRLKEFFTGQKLTSGSSGWLATAYA 235 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 R + + +S L+ T E + S + V+ + + + + Sbjct: 236 R-------RTDVDALVNYESVLLSTPGELTVIRPS---DGVVTADYPLTLLTAARPEAKA 285 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + G+ P + + + V + + L + D L Sbjct: 286 AAGRLTAFLRTPDAQRQITVKTLRRPVVPGVDPAPALARDQRRELPFPGDRKVADGLLDA 345 Query: 256 EHFVDSSSLRHVI----------KKKHLVRDALASVIRSIKKIDNV----------NDTV 295 R V + ++ AL + + + V D V Sbjct: 346 YDNTLRRPSRTVYVLDTSGSMNGDRLERLKTALVELTGDFRDREEVTLMPFGSAVKRDEV 405 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R V + ++ A G TAI ++Q AY ++ S+ Sbjct: 406 RT------HTVDPASP------RQALDAIRADARKLTASGGTAIYSSLQEAYRSLGKSSG 453 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQD 382 D IVL+TDGENT Sbjct: 454 DTFTS----------IVLMTDGENTDG 470 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 46/154 (29%), Gaps = 25/154 (16%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRT---IVKTFAIDENEMGSTAINDAMQTAYDT 349 + R+ F+ V + K + K + E G T A + A + Sbjct: 114 NHTRVAIATFSSSVSKNIDQISDPRKENNKCFLLSKLLSKIEYTGGGTNTLKAFEVAKEI 173 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S D K + L+TDG + + I + + K ++I TI Sbjct: 174 FTQSRNDSE----------KVLFLITDGFSNGG--DPIPLAAELKKDQVKIFTIGI---- 217 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + ++P + L F + Sbjct: 218 -ANGNYKELYELASTPGEIY-----SYLLDSFEE 245 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 61/181 (33%), Gaps = 27/181 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVH 317 S I K + ++A S+++ D R+G F+ V+ G+ Sbjct: 403 SMGPETGISKFTMAKEAAIMATESLRQED------RIGVLAFDVSTRWVVDFQPVGVGLS 456 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 V+ G T I +A+Q + H VLLTDG Sbjct: 457 LAD---VQRRISTLPLGGGTDIYNALQEGLPALAQQPGRVRHA-----------VLLTDG 502 Query: 378 EN-TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + T D + + +A+SQ I + TIA + + A + A ++ Sbjct: 503 RSFTDDRQAYRMLLEEARSQNITLSTIAIGTDA-DINLLQELARWGA--GRYHYAAEPND 559 Query: 437 L 437 + Sbjct: 560 I 560 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 53/148 (35%), Gaps = 24/148 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G FN + + V ++I + + G+T I DA++ Sbjct: 673 RVGIVGFNTKAEIVVDITSDVEEIITKV-----MSLKPGGATDIGDAIRV---------G 718 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQ 411 E+ R + +++LLTDG T+ + ++ A G+ I TI + Sbjct: 719 TELFRRCGRPDRDWHMILLTDGVPTKGEPDPETKALSEATAASRMGVTISTIGIKL---- 774 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHEL 437 E+ + + A S EL Sbjct: 775 PEEGIRLIEHIAGISGGRSHHITDPEEL 802 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 53/166 (31%), Gaps = 4/166 (2%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ----SLEEV 75 G II L++ V++G G+ +D+ + + L+ A + + A+ L ++ E Sbjct: 20 GAVAIIVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSCALAAARDLTGATPLTVSEA 79 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + N + ++D + +V Y S Sbjct: 80 AGLTAAARNLVLFQGNLEQQPNITSAESVTYSDSLANPFLDKNSVTYALNTIKYVKCDVS 139 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 R ++ + GI A A + S + + Sbjct: 140 RGNIANWFAQVLNAIPGIHIGANTVGAFAVATTTSAQTTCAIPVYI 185 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + I K D TA + + TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKTAIDKLQLADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDEPPPARIVLFSDGKETVPSNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKAR---------YFLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E L A S F A+S +L +++ + +IG E Sbjct: 242 ISFGTPYGYVEINEQRQPVPVDDQMLKKIADLSEGEAFTASSLEQLREVYANLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 51/143 (35%), Gaps = 20/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++ +I + R+G + + + + + +++ Sbjct: 111 RLEKAKRLVSEIINELASD-------RVGIIAYAAQAFPQLPITTD-YGAAKMFLQSMNT 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D TAI++A++ A + + + L++DGE+ ++ + Sbjct: 163 DMLSSQGTAIHEAIELAATYFDDEEQTN-----------RILFLVSDGED-HAEDQVMDA 210 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 +A QGIRI TI + Sbjct: 211 IEQATDQGIRIFTIGVGSARGAP 233 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 48/340 (14%), Positives = 92/340 (27%), Gaps = 22/340 (6%) Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM--GIKSWLIQTKAEAETVSRSYH 172 PR + +S ++N +++ G + I T + Sbjct: 193 EYVATYTNPPRSWPWSSDVSPSGAAIMNSQWNAMKNNDGGWLNHSIAAFYVTSTDTAQKR 252 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 Q ++ + + T G Sbjct: 253 MYGKSCWQVLLQAPSQDGVSGDTTVKPSRVQYSFSKPVTTSGDYLNAVTPGTHLAAAQAN 312 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + M+Y LD S S D + A +++ S+ K Sbjct: 313 LKILWLAEATMVYQIVLDRSGSMGTNPD---KPDDPTPLSYAKTAACNLVDSLPK----- 364 Query: 293 DTVRMGATFFNDRVISDPSFSW-----GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 V +G F+D + R K GSTAI DA A Sbjct: 365 -NVYVGIVQFDDSTSQVYPITLIASNDAAAAATRAAAKAAINGLTSGGSTAIYDAASYAL 423 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ + LLTDGE+ ++ + + ++Q + ++T+ + Sbjct: 424 SQFVAQKTALSADLLGVTY------LLTDGEDNSSSKSVGEVIGEYQAQKVPLITVGYGA 477 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 A L++ F L ++F +G Sbjct: 478 GGQAGSFALTQLADGTGGQYFASPVDQAALQQVFFAALGK 517 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/296 (12%), Positives = 79/296 (26%), Gaps = 33/296 (11%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSM---LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 Y ++ + SR M R + + + S+ Sbjct: 92 TGVRYDMDNSIKYA----CSRGMSLVGSPHRTCLESRRWSGTEISCQYPYSFDLPEDVQE 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 LS + +S + + +F+ +S + ++ ++ Sbjct: 148 QFKASLSGILNIKERSASFGRTIKIKRDGILNVYFLLDASRSVGEANFDIYKECSVYLVD 207 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTF-AIDENEMGSTA 338 + D G + S + I D + T Sbjct: 208 ELASFDMTIQF---GIISYATVPKVIIPIYDENSDNDAHVFEVIENDLKYSDHKDKTGTN 264 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE--EGIAICNK---- 392 I A++ Y+ + E + N I+LLTDG+ + I + Sbjct: 265 IKTALEEVYNMMSFQKETYKNESVWNSIHH-IIILLTDGKANIGGRPADTIKHIEEFLDI 323 Query: 393 --AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS----PNSFFEANSTHELNKIFR 442 + + + T + LS AS + F S +E+ +F+ Sbjct: 324 KKKREDYLDVYTFGI-----GPDVDMADLSEIASKKDGESHVFRMESANEMKTVFQ 374 >gi|170594383|ref|XP_001901943.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158590887|gb|EDP29502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 415 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 56/159 (35%), Gaps = 11/159 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D V + A +++ + F H +K + G+T +A+ Sbjct: 261 IRSLDLNRTAVHVAAIYYSGPKRARTLFHLRKHSRAENAIKDLQRAPSNGGTTRTGEAIY 320 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + +K I++ TDG + A+++GI + ++ Sbjct: 321 YATNEFNEKFGARKDA-------RKMIIIFTDG---HSQDNPTEASRTARNKGIELKAVS 370 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ ++ P+ + + + ++L F + Sbjct: 371 I-EDENIPPDTNQIIAITGDPSDAYSSKNFNKLQSFFDE 408 >gi|94970371|ref|YP_592419.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94552421|gb|ABF42345.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 356 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 67/199 (33%), Gaps = 21/199 (10%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-----ISDPSFSWGVHKLIR 321 +K+ + + +I S I + + A F +++ + + Sbjct: 125 DFRKETNLPLRVGLLIDSSNSIRDRFKFEQESAIEFLNQIIRPKFDKAFVIGFDTTAEVT 184 Query: 322 T-------IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ G TA+ DA+ A D++ + N +K ++LL Sbjct: 185 QDFTDDTDLLGKGVRMLRPGGGTAMYDAIYYACR-------DKLLKENGNTAMRKAMILL 237 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANS 433 +DGE+ Q A+ + I I+ + + + + A+ F Sbjct: 238 SDGEDNQSRVTREEAVEMAQRAEVIIYAISTNTSGLKLRGDKVLERFAEATGGRAFFPFK 297 Query: 434 THELNKIFRDRIGNEIFER 452 ++ F + I +E+ + Sbjct: 298 ISDVANAFSE-IQDELRSQ 315 >gi|258624850|ref|ZP_05719778.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582848|gb|EEW07669.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 128 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 15/86 (17%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT-------------QQEKARY 417 ++LL+DG NT + + N AK I T+ + Q+ Sbjct: 1 MILLSDGSNTAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEK 60 Query: 418 FLSNCASP--NSFFEANSTHELNKIF 441 L AS +F A + +L I+ Sbjct: 61 TLQTIASTTGGQYFRARNQQDLQSIY 86 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 62/181 (34%), Gaps = 22/181 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++ + + +K D R+G F + S L I + Sbjct: 118 TRLQAVQQSVKKFVAA-RKSD------RIGLVIFANSAWPFAPVSEDKQALETRI--SQL 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + S+ + E K +LLTDG +T Sbjct: 169 TPGMAGQQTAIGDALGVTVKLLDSTGDKEA---------SKLAILLTDGNDTASQLTPRL 219 Query: 389 ICNKAKSQGIRIMTIAFS-VNKTQQEKAR-YFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A S +++ TIAF VN + +K L + A + + A ++ Sbjct: 220 AAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARMTGGRSWTAENSGASLDAVWKE 279 Query: 445 I 445 I Sbjct: 280 I 280 >gi|62001448|gb|AAX58426.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L + A Q + Sbjct: 1357 SNNRVRLANSAGATAYARAQIN-LGHTNQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATITASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 45/134 (33%), Gaps = 15/134 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ A F FS+ + ++ G TA DA+ + Sbjct: 405 KIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVR 462 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D K ++V++TDG++ D A A GI I ++ + + Sbjct: 463 DS--------PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDL 509 Query: 416 RYFLSNCASPNSFF 429 + S ++FF Sbjct: 510 KDMASKPKESHAFF 523 >gi|62001344|gb|AAX58374.1| AvrE [Pseudomonas viridiflava] gi|62001348|gb|AAX58376.1| AvrE [Pseudomonas viridiflava] gi|62001358|gb|AAX58381.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKERPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 48/159 (30%), Gaps = 21/159 (13%) Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-----V 306 SL VD SS + ++ + ++ + R+ F+ + Sbjct: 108 SLELVFLVDESSSVGHANFLNELKF-VKKLLSDFPVVP---SATRVAIVTFSSKNNVVPR 163 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + S S + + G T A Q A + S E+ Sbjct: 164 VDYISSSRAHQHKCSLLNREIPNITYRGGGTYTKGAFQQAAQILRHSRENS--------- 214 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 215 -TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 250 >gi|145540134|ref|XP_001455757.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423565|emb|CAK88360.1| unnamed protein product [Paramecium tetraurelia] Length = 522 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 71/187 (37%), Gaps = 30/187 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K HLV+ +L +++ ++ D R+ F+D+ + + + Sbjct: 128 KMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYRLTRLMRATQENMYKFL-IAID 180 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I +AM+ A + HR N A I LL+DGE + G Sbjct: 181 TIEANGATDIGNAMKMALSIL-------KHRRFKNPIAS--IFLLSDGE--DEGAAGRVW 229 Query: 390 CNKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 N +S+ I+ I T F ++ +S A F+ + ++++ F + Sbjct: 230 -NDIQSKNIKEPFTINTFGF-----GRDCCPKIMSEIAHFKEGQFYYISEISKIDECFFE 283 Query: 444 RIGNEIF 450 +G E Sbjct: 284 ALGGEAS 290 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 45/134 (33%), Gaps = 15/134 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ A F FS+ + ++ G TA DA+ + Sbjct: 405 KIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVR 462 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D K ++V++TDG++ D A A GI I ++ + + Sbjct: 463 DS--------PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDL 509 Query: 416 RYFLSNCASPNSFF 429 + S ++FF Sbjct: 510 KDMASKPKESHAFF 523 >gi|219886181|gb|ACL53465.1| unknown [Zea mays] Length = 561 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 43/129 (33%), Gaps = 16/129 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKL 319 + + L++ A ++ ++ D R+ F+DR + Sbjct: 9 TERTRTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADG 62 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R +++ G TA+ A + A + D R+ +IVLLTDG Sbjct: 63 RRNAIRSL-DQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAE 114 Query: 380 TQDNEEGIA 388 ++ Sbjct: 115 DASGSFTLS 123 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 43/129 (33%), Gaps = 16/129 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKL 319 + + L++ A ++ ++ D R+ F+DR + Sbjct: 98 TERTRTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADG 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R +++ G TA+ A + A + D R+ +IVLLTDG Sbjct: 152 RRNAIRSL-DQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAE 203 Query: 380 TQDNEEGIA 388 ++ Sbjct: 204 DASGSFTLS 212 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 43/129 (33%), Gaps = 16/129 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKL 319 + + L++ A ++ ++ D R+ F+DR + Sbjct: 98 TERTRTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADG 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R +++ G TA+ A + A + D R+ +IVLLTDG Sbjct: 152 RRNAIRSL-DQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAE 203 Query: 380 TQDNEEGIA 388 ++ Sbjct: 204 DASGSFTLS 212 >gi|189190514|ref|XP_001931596.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973202|gb|EDU40701.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 1331 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/287 (12%), Positives = 91/287 (31%), Gaps = 30/287 (10%) Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPL--SLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + + + A +++++ +++ +N + + + ++ K+ + + + Sbjct: 927 QHRNAPVPASEHAIELIVNPDHEIFINDVVSTAAVPAVNQKTEELCLYRQKFSTLSRIPS 986 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + + C G D K+ S + Sbjct: 987 QVPL------------RFWTKLRNEGDGYCTGTNYDTHWKTMYSCFNREFCTGILDEEPC 1034 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V ++ + PL L + S+ + + + +++ + I + + Sbjct: 1035 VHKVQNA-PINASQPLVFKLLLDTLSTPSNETNHLTRLDVLKQMFDAYINRVLA---YSF 1090 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G FN + + V + G TAI D++ A D I Sbjct: 1091 QPHIGLVTFNTKTQVAQKITNAVENSRHKLNN-----LAAYGDTAIWDSVALAQDQI--- 1142 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + H K AK I+ ++DGE+ + + GI + Sbjct: 1143 ---QQH-AKQYPNAKLRIICISDGEDNTSLNTVEDVAKRLTRCGIVV 1185 >gi|170079457|ref|YP_001736094.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887126|gb|ACB00840.1| Protein containing von Willebrand factor (vWF) type A domain [Synechococcus sp. PCC 7002] Length = 545 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 59/174 (33%), Gaps = 21/174 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + D S + K L++ A ++ ++ D V+ V GA ++ Sbjct: 179 PPSNLVFLFDVSGSMNDPDKLPLLKSAFRLLVNELRPEDRVSIVVYAGAAG----LVLPS 234 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + GSTA + ++ AY + D + Sbjct: 235 -----TSGAEKETILAALDNLEAGGSTAGGEGIELAYQEAADNFLDNGNNR--------- 280 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ + E I + + + Q I + + F + K +N Sbjct: 281 IILATDGDFNVGMSSDAELIRLIEQKREQDIFLTVLGFGTGNLKDAKMEQLANN 334 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 84/270 (31%), Gaps = 23/270 (8%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + LN F K+ + + + + M+ L Sbjct: 170 REGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYD 229 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S K L++ AL + + D V+ V GA Sbjct: 230 LLKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAGVVL 289 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 G + + GST + AY + H + N Sbjct: 290 D---------GASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNG 334 Query: 365 LEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ D ++ A+ K K+ GI + T+ F + + Sbjct: 335 INR---VILATDGDFNVGVTDFDDLTALIEKEKAHGIGLTTLGFGLGNYNDQLMEQLADK 391 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ +E K+ D + + +F Sbjct: 392 --GNGNYAYIDTLNEARKVLVDELSSTLFT 419 >gi|156404157|ref|XP_001640274.1| predicted protein [Nematostella vectensis] gi|156227407|gb|EDO48211.1| predicted protein [Nematostella vectensis] Length = 476 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 47/349 (13%), Positives = 111/349 (31%), Gaps = 26/349 (7%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF--------TDREVRDIVRDTA 119 ++ +++ S+ + + +K+ E I + + + D I+ D A Sbjct: 15 MLYLIKDTSTTLQTAPGELAKKLAELAINGLGTSEMQGYYDKLTFKSLDLNGNSILNDLA 74 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV-S 178 + +V ++ + + ++ ++ + + + + Sbjct: 75 TRFANKLQT-KVTIARKIKDAVEVSYAKSATVTSRTECCKADTRWLKYDSRFRTKVNLDE 133 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + +I + S Q + T+ + + + + SC+ Sbjct: 134 MCVIISGAASSNPKQLQDNVLQTMKQNIENNPTLTWQYFGSEEGLYTNYPMIRDSSSCSS 193 Query: 239 --SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV- 295 Y Y P + V S + + ++A +V+ ++ D V Sbjct: 194 YDPRYRPWYVEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAF 253 Query: 296 -----RMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ T + + S + I+K F E G T A A+D Sbjct: 254 ESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAVAFNAAFDI 313 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + D+ ++ KN + I+ +TDG D + K ++QG+ Sbjct: 314 L-----DKYYKEKNTTR-RPVILFMTDGAPNDDPGTILNTV-KMRNQGL 355 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 59/391 (15%), Positives = 121/391 (30%), Gaps = 36/391 (9%) Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 T+P + E+ + ++ + I++ T M + + V +++ L Sbjct: 89 TYPGEIKEKAAAQEKYSSAVA---QGQSAGIIKATGRSMEEFQVSVNVAPTAKVTFELVY 145 Query: 144 LSLFLRSMGIKSWLIQTKAEAET----VSRSYHKEHGVS------IQWVIDFSRSMLDYQ 193 L R +G +++ + + + + G+S D + ++ Q Sbjct: 146 EELLKRQLGKYELMLKVRPQQLVKHLQIDIHIFEPQGISSLETESTFMTNDLADALTKTQ 205 Query: 194 RDSEGQPL---NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC---NKSLYYMLYPG 247 ++ + + Q + + + + D N + P Sbjct: 206 NKTKAHIVFKPSLSQQQKEPGKEDTTVDGNFIVRYDVDRVTDAGDIQIENGYFVHHFAPA 265 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 L FV S +K R+A+ ++ ++ D N F+ V Sbjct: 266 GLPMVPKNVVFVIDKSGSMAGRKMRQTREAMVQILGDLRPEDQFN------LVIFDGHVF 319 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 K F + MG+T INDA+ A + SN K L Sbjct: 320 QWMPALLQASSQNVEQAKKFTSLISAMGATNINDAVLLAVKMLDDSNR------KEKLPP 373 Query: 368 KKY--IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNC 422 ++LLTDG+ T I K+ + + F + + L N Sbjct: 374 GSVSMVILLTDGDATDGETNPKKIQENVKAAIGGSYHLYCLGFGFDVNYAFLEKLALENG 433 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 +E + + + F + N + +V Sbjct: 434 GVARRIYEDSDSDLQLQDFYQEVANPLLTKV 464 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 33/65 (50%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ +KS TG+ + AL+MP+++ G+ +D + L+ A A I A+ + Sbjct: 8 FRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAADVSS 67 Query: 71 SLEEV 75 + E V Sbjct: 68 AEEAV 72 >gi|296474257|gb|DAA16372.1| complement component 2 precursor [Bos taurus] Length = 750 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYI 371 S V ++ ++ D T I +A+ Y ++++ + H E + I Sbjct: 314 SRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQMNRPHMNPGAWQEIRHAI 372 Query: 372 VLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +LLTDG++ +A+ N+ + + I I ++ S Sbjct: 373 ILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKD 432 Query: 424 SPNSFFEANSTHELNKIFR 442 F L+++F Sbjct: 433 GERHAFILKDVQALSQVFE 451 >gi|111120280|gb|ABH06325.1| complement component 2 precursor [Bos taurus] Length = 787 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYI 371 S V ++ ++ D T I +A+ Y ++++ + H E + I Sbjct: 314 SRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQMNRPHMNPGAWQEIRHAI 372 Query: 372 VLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +LLTDG++ +A+ N+ + + I I ++ S Sbjct: 373 ILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKD 432 Query: 424 SPNSFFEANSTHELNKIFR 442 F L+++F Sbjct: 433 GERHAFILKDVQALSQVFE 451 >gi|77735935|ref|NP_001029664.1| complement C2 precursor [Bos taurus] gi|115311857|sp|Q3SYW2|CO2_BOVIN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|74267667|gb|AAI03358.1| Complement component 2 [Bos taurus] Length = 750 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYI 371 S V ++ ++ D T I +A+ Y ++++ + H E + I Sbjct: 314 SRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQMNRPHMNPGAWQEIRHAI 372 Query: 372 VLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +LLTDG++ +A+ N+ + + I I ++ S Sbjct: 373 ILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKD 432 Query: 424 SPNSFFEANSTHELNKIFR 442 F L+++F Sbjct: 433 GERHAFILKDVQALSQVFE 451 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 62/194 (31%), Gaps = 42/194 (21%) Query: 289 DNVNDTVRM------GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAIND 341 D + + VR G F + + + + + A+ +G TA+ Sbjct: 118 DVIREFVRSYPHMAVGLVLFGKEAMLEVPP---TIDVEYFLERLEAVRLFSLGDGTALGM 174 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + + +H + N + +V+LTDG+NT AK I + Sbjct: 175 GVGIS----------LLHLSRVNASF-RAVVILTDGKNTTGEILPETAAEMAKDLDIPVF 223 Query: 402 TIA------FSVNKTQQEKARYF------------LSNCA--SPNSFFEANSTHELNKIF 441 T+ S++ + L A S FF + L++IF Sbjct: 224 TVGVGSDLPVSLDVIDPSTGTRYAGVLEEGYDEETLRRMAEMSGGQFFSGYTPTSLHRIF 283 Query: 442 RDRIGNEIFERVIR 455 + IG V R Sbjct: 284 Q-YIGATATADVRR 296 >gi|62001424|gb|AAX58414.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKERPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 56/162 (34%), Gaps = 23/162 (14%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G F+ + R + + + G T+I ++TA+ I + Sbjct: 100 TGMVTFDSSATIQSALIQIETDAQRNSLISR-LPTAAGGGTSICSGLRTAFTVIKNKFST 158 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAF--SVNKTQQE 413 + IVLLTDGE + I+ C ++ K G I T+A S + +E Sbjct: 159 DGSE----------IVLLTDGE-----DSTISSCFDEVKQSGAIIHTVALGPSADPGLEE 203 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 A+ SP A + + L F GN + Sbjct: 204 LAKMTGGMKTSPTD--NAQN-NGLIDAFSALSSGNGAITQRS 242 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 68/198 (34%), Gaps = 22/198 (11%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S E FV SS +V+D + S+I + + R+G ++ + Sbjct: 795 KCRTSPLELVFVIDSSESVGPDNYEVVKDFVNSLIDHVSVS---REATRVGVVLYSHVEV 851 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 S + VKT +G T A++ A ++ Sbjct: 852 VVASLQ---QLYDQAAVKTAVRRMPYLGEGTFTGSAIRRATQLFQAARPG---------- 898 Query: 367 AKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK-TQQEKARYFLSNCAS 424 +K V+LTDG + +D A S GI I + N +Q + + ++ AS Sbjct: 899 VRKVAVVLTDGLADNRDAVSLKDAAEGAHSAGIEIFVVGIVNNSDSQYAEFKNEMNILAS 958 Query: 425 P---NSFFEANSTHELNK 439 N + + +L+ Sbjct: 959 DPDENYVYLTDDFLKLHA 976 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 70/201 (34%), Gaps = 33/201 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S D + + +G F S + R K Sbjct: 109 TRLAAAQDAAKSF------ADGLTPGINLGLVAFAGTASVLVSPT-----TNREASKVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ + ++ S + ++ IVLL+DG EN D Sbjct: 158 DNLQLSERTATGEAI---FTSLQSIDTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 G +AK + + I TI+F + + E L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTAS 274 Query: 433 STHELNKIF---RDRIGNEIF 450 S EL ++ ++IG E Sbjct: 275 SLEELRDVYDTLEEQIGFETT 295 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 59/194 (30%), Gaps = 28/194 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + + + +G F R V++ Sbjct: 107 TRLEAAKQGAQAFVDQL---PPRIN---LGLVSFAGSAAVLVP-----ASTDRESVRSGI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TA+ + + + I ++ + + IVLL+DGE T+ Sbjct: 156 RGLQLGPATAVGEGIYASLQAIATAGQRLSD--EGQPPPPAAIVLLSDGETTRGRPNT-Q 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHEL 437 A+ I + TIA+ + + L A + S+ A S EL Sbjct: 213 AATAARDAEIPVDTIAYGTSDGTLDVGGQQIPVPVNEEALRELADQTGGSYHRATSGDEL 272 Query: 438 NKIFR---DRIGNE 448 ++R IG Sbjct: 273 QSVYRGLGSSIGYR 286 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 45/151 (29%), Gaps = 23/151 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A+ + R+G F+ + + R + Sbjct: 107 RLAAAQEAVRRFVEEQDSG------TRIGLVVFSGFAELAVAPT-----TDRDAITRALD 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMK-----------NNLEAKKYIVLLTDGE 378 T + A+ + D I + D A + +VLLTDG Sbjct: 156 GLTTGRGTTVGSAILKSVDAISEIDPDVAPSDPAPGTVPPPPRAPGTYAPEIVVLLTDGA 215 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 NT A +G+R+ I F ++ Sbjct: 216 NTTG-VTPEDAAKTAAERGVRVYPIGFGTDE 245 >gi|297190882|ref|ZP_06908280.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197722677|gb|EDY66585.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 518 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 76/212 (35%), Gaps = 27/212 (12%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR------ 296 Y P ++ FV S ++ +R+ + + VR Sbjct: 317 ADYGDPERDRAAQVIFVLDFSSSMRGERITALRETIDGLAGGDDS--PSGKFVRFYRGET 374 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + F RV+ + + ++ + + + A D+ GSTAI ++ AY + D Sbjct: 375 LTVMRFGGRVLEERNITYDGPRDLDRLRGVVASDDFA-GSTAIWSSLDHAYRAVARDLVD 433 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK----SQGIRIMTIAFS-VNKTQ 411 R + IVL+TDGEN + + A+ ++ +R TI + + + Sbjct: 434 RPERRVS-------IVLMTDGENNAGMDVDAFVRAHARLPEDARRVRTYTIRYGEADTRE 486 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ A+ +A L F++ Sbjct: 487 LDRGAR-----ATGGRMVDATD-RSLLSAFKE 512 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 53/167 (31%), Gaps = 12/167 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+D ++ I + R F+ F+ +K Sbjct: 112 VKDYKNGILALQTLITRAKEDTRYAGITFSTEANITFYFT-----DPLDAMKGLGGITYA 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A+ + + + R+ K+ ++ +TDG++ + E + + Sbjct: 167 PGMTNTQAALDICRTQLWLNKKSGFRRLS----FKRILI-VTDGQSNINMERTLYNAFQL 221 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 K+ GI I +A + S ++ + + L ++ Sbjct: 222 KNMGIEIFVVAVGKYLRGIAEIVGLAS--STDAHLYRVRNLRGLLEV 266 >gi|150251392|gb|ABR68008.1| matrilin-like 85 kDa protein [Lehmannia valentiana] Length = 716 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 79/250 (31%), Gaps = 35/250 (14%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + V + + + + P C+ Y + +E D + Sbjct: 471 TTDKTKCVAADACNTNRKCEHVCTVIPTGRKCSCRDGYSIND-------EDETLCDEVEV 523 Query: 265 RHVIKKKHLVRDALASVIRS------------IKKIDNVNDTVRMGATFFNDRVISDPSF 312 ++ DA S++ IK ++ VR G F+ + + Sbjct: 524 CQEKADIIMLFDASNSILLENFDKQFIFAKRLIKNFKIGSNDVRFGGVVFSQKTQLLFNL 583 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 H + K + ST ++A + + S E K + IV Sbjct: 584 K--DHDDFDGLSKGLTNVKYLDSSTKTDEAFNLVVNDKMFSVE------KGGRVSAPDIV 635 Query: 373 LL-TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFE 430 LL TDG N + I + K GI I+++A ++ L +S P E Sbjct: 636 LLFTDG-NPTSPTKTITSADTVKKNGISIISLAI-----GKDLDMDILRTISSKPEFAIE 689 Query: 431 ANSTHELNKI 440 A + L+ + Sbjct: 690 ATNYDMLDYV 699 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 60/193 (31%), Gaps = 28/193 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + + + +G F R V+ Sbjct: 108 RLEAAKQGAEAFVDQL---PPRIN---LGLVSFAGSATVLVP-----ASTDRESVRAGIR 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +TAI + + + I ++ + IVLL+DGE T+ A Sbjct: 157 GLQLGPATAIGEGIFASLQAINTAGKRFSDA--GQSPPPAAIVLLSDGETTRGRPNTQAT 214 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELN 438 + A+ + + TIA+ + + L+ A + S+ A + EL Sbjct: 215 -DAARQAHVPVDTIAYGTSDGTLDVGGQEVPVPVNEQALNEIADQTEGSYHRAATGDELR 273 Query: 439 KIFR---DRIGNE 448 +++ IG Sbjct: 274 SVYKGLGSSIGYR 286 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 83/276 (30%), Gaps = 52/276 (18%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS----- 262 A ++ +++ G +R +V+C L ++ +D S Sbjct: 44 ARPGRRATAARRGGPALRGLACLLLVVACAGPRRPDLVTQLPAKGIALVVALDVSGSMGA 103 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR----MGATFFNDRVISDPSF 312 + + R AL + D R +G F + Sbjct: 104 EDVVWTPGAPSVSRLEAARRALKLFLAGGAAPDGTAFDPRPGDAVGLVAFAAVPETVCPA 163 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + L + + G T I D++ A + ++++ + ++ Sbjct: 164 TLNHSVLFKVADALQPKGGADAG-TNIGDSLAEAVIRLDAADQKS-----------RVLI 211 Query: 373 LLTDGENT--------------QDNEEGIAICNKAKSQGIRIMTIAFSVNK--------- 409 LL+DGE+ + A + G+R+ TI + Sbjct: 212 LLSDGEHNILKEDVRDAQRPGIDRTLKPREAAQLAANLGVRVYTIDAGGDPPLGAPPDAV 271 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 Q+ R L + A + F+A S EL +R+ Sbjct: 272 AQRFAGRKALKDVAEMTGGKSFQATSGAELLSAYRE 307 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 68/187 (36%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 358 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 413 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ A G TA DA+ + D Sbjct: 414 T--YDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAISFTVRNVFGPIRDS----- 466 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A + A GI I ++ + + + S Sbjct: 467 ---PNKNFLVIVTDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 518 Query: 423 ASPNSFF 429 ++FF Sbjct: 519 KESHAFF 525 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 172 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 226 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 227 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 280 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 281 RAKEEGIVMYAVGVGKAVEAE------LREIAS 307 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 213 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 267 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 268 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 321 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 322 RAKEEGIVMYAVGVGKAVEAE------LREIAS 348 >gi|330904068|gb|EGH34640.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 84 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 32/93 (34%), Gaps = 12/93 (12%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + +R + I +TA+ DA+ A + + +VL Sbjct: 3 NNPRRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPATS-----------RALVL 50 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +TDG N + I A +G++I I Sbjct: 51 VTDGANNAGQIDPITAARLAAEEGVKIYPIGIG 83 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 67/180 (37%), Gaps = 23/180 (12%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S SL K ++ + ++ I + V++G T F++ + + + + Sbjct: 1250 SGSLTSAPNKDQVLMNFTNNLANMYDTI----NQVKIGLTSFSESSVLEMPLDFYNQLEL 1305 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + V + T I ++TA + + + ++L+TDG + Sbjct: 1306 QDGVS---NMTWQGSFTNITSGVETALNDMD-----------TSDAVDDVMILITDGFQS 1351 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + ++AK++G+R++ + F Y + N + A + EL I Sbjct: 1352 TNTTLMFQMIDQAKAEGVRLIALGFFGEFAFYSPNLYLM-----TNEVYHAANYAELLAI 1406 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/116 (24%), Positives = 45/116 (38%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 RMG F + ++I +E G TA+ +A+ TA +T Sbjct: 130 RMGLVAFAGTAFLQCPLTLDDAAFSQSIDSLDTRTISE-GGTALAEAINTARET------ 182 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 KN + K +VL TDGE + ++ KA ++G+ I TI + Sbjct: 183 -----FKNEKDNHKVLVLFTDGE--DQDMGAVSAAEKAAAEGMLIFTIGIGTPDGE 231 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 56/171 (32%), Gaps = 29/171 (16%) Query: 278 LASVIRSIKK---IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + + K I V++ ++ ++ + + I+K Sbjct: 463 VRAFLEVLAKSFEISPNR--VQISLVQYSRDPHTEFTLK--EFNRVEDIIKAINTFPYRG 518 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 GST AM + I N+ + K ++L+TDGE++ + K + Sbjct: 519 GSTNTGKAMTYVREKIFVPNKGSRSNVP------KVMILITDGESSDAFRDP---AIKLR 569 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFR 442 + + I + ++ R L ASP F F+ Sbjct: 570 NSDVEIFAVGV------KDAVRSELEAIASPPAETHVFTVED----FDAFQ 610 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 68/187 (36%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 358 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 413 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ A G TA DA+ + D Sbjct: 414 T--YDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS----- 466 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A + A GI I ++ + + R S Sbjct: 467 ---PNKNFLVIVTDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLRDMASKP 518 Query: 423 ASPNSFF 429 ++FF Sbjct: 519 KESHAFF 525 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 68/187 (36%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 358 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 413 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ A G TA DA+ + D Sbjct: 414 T--YDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS----- 466 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A + A GI I ++ + + R S Sbjct: 467 ---PNKNFLVIVTDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLRDMASKP 518 Query: 423 ASPNSFF 429 ++FF Sbjct: 519 KESHAFF 525 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 33/65 (50%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ +KS TG+ + AL+MP+++ G+ +D + L+ A A I A+ + Sbjct: 8 FRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAADVSS 67 Query: 71 SLEEV 75 + E V Sbjct: 68 AEEAV 72 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 53/178 (29%), Gaps = 30/178 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P V +S K ++ + + K+ G Sbjct: 33 AASPSASKVDAVLVVDVSNSMNTSDPGKIGNEAMKMFIDMLSTQNDKV---------GIV 83 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + D V + + + + +KTF N T + ++ A + + H Sbjct: 84 AYTDVVQREKALLNISSEADKQELKTFIDGLNRGAYTDTSVGVKEAIRILQD-GKTAGHA 142 Query: 361 MKNNLEAKKYIVLLTDGEN----------TQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 IV+L DG N +Q ++ +AK G+ I TI + + Sbjct: 143 P--------MIVMLADGNNDFNKTTGRTESQSAQDMAQAVAEAKKSGVPIYTIGLNAD 192 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 57/172 (33%), Gaps = 27/172 (15%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 ++ A L+PV+ G +D R + L+ A + + + ++ Sbjct: 31 RGSTLALMAAGLIPVI-AALGAGIDAGRLYLVKSQLQAGVDAAALAGARAFAVTDGSPAA 89 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 R ++ Y NF ++ + D + + + V ++ Sbjct: 90 R--------DKQASAYFYGNFASDYMGVSNLQLTPDFKTVGGINVTTITARAIVPMT--- 138 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 F+R G + +Q A+AE R + + V+D + SM Sbjct: 139 ---------FMRIFGFQPRTMQAVAKAELQPR------PLEVMVVLDDTGSM 175 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 59/184 (32%), Gaps = 30/184 (16%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + ++ A + I + D +G FN IR Sbjct: 310 SGSMYGVRVARLKQAAEIFLLQI-----IEDGSWVGIVTFNSAATIKTGLQQITSDSIRR 364 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + + G T I + ++ + E IVLLTDGE Sbjct: 365 SLTGY-LPVTAGGGTRICNGVEAGFKVFKQKYASEKGCE---------IVLLTDGE---- 410 Query: 383 NEEGIAIC-NKAKSQGIRIMTIAF--SVNKTQQEKARYF--LSNCASPNSFFEANSTHEL 437 + I+ C ++ K G I TIA S + +E A L A+ + ++ L Sbjct: 411 -DSTISYCLDEVKRSGSIIHTIALGRSADPGLEELADMTGGLKFSATDS-----LDSNSL 464 Query: 438 NKIF 441 F Sbjct: 465 IDAF 468 >gi|62001414|gb|AAX58409.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|62001328|gb|AAX58366.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 60/190 (31%), Gaps = 25/190 (13%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S K LV+ + +++ D R+ ++ Sbjct: 201 PPANLVFLIDTSGSMDDPDKLPLVKKTVCHFAEALRADD------RISLITYSGSTAEIL 254 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-NEDEVHRMKNNLEAKK 369 + + G+TA +A++ AYD + +D ++R Sbjct: 255 PP---TAGDQKETIIAALKPLRAHGATAGGEALRMAYDAAAKNYRKDGINR--------- 302 Query: 370 YIVLLTDGENTQDNEEG---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + + GI + T+ + E A Sbjct: 303 -ILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQLAD--AGDG 359 Query: 427 SFFEANSTHE 436 ++ +S E Sbjct: 360 NYSYIDSEAE 369 >gi|224090449|ref|XP_002195035.1| PREDICTED: integrin, alpha 1 [Taeniopygia guttata] Length = 1184 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 67/176 (38%), Gaps = 20/176 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ +D ++G + V+ + + ++ A + G T A+ Sbjct: 200 LRNMDIGPQQTQVGIVQYGQTVVHEFYL--NTYSTTEDVMAA-ASRIRQRGGTQTMTALG 256 Query: 345 --TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 TA + E H + + +K +V++TDGE + DN + + + + I+ Sbjct: 257 IDTAREEAF----TEAHGARRGV--QKVMVIVTDGE-SHDNYRLQEVIDDCEDENIQRFA 309 Query: 403 IAFSVNKTQ----QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA + ++ EK + + AS FF + L I + +G IF Sbjct: 310 IAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALLTI-VEALGERIFA 364 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|62001334|gb|AAX58369.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 81/210 (38%), Gaps = 35/210 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEANSTHELNKIFRDRIGNEI 449 A+ +++ F A + + I I Sbjct: 204 ATDHTYRRNFTAADWGQSRDA-EEAISQTI 232 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 15/127 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +D+ N+ ++G F + + + G T I A+ T Sbjct: 121 NLVDHFNND-QIGLVVFAGQSYVQLPITSDYVSAKMFLQDIQPSLIQTQG-TDIAGAINT 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 +H N + K I+++TDGE+ + I A +GI + + Sbjct: 179 C-----------MHAFTPNDKVGKAIIVITDGEDHEG--GAIEAAKAAHDRGINVFILGI 225 Query: 406 SVNKTQQ 412 +K Sbjct: 226 GDSKGAP 232 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 56/179 (31%), Gaps = 16/179 (8%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C + + GP D +L + S K V+ + + +++V Sbjct: 814 CPDYTCPITFTGPADITLLVDSSTSVGSKNFETTK-KFVKQLSGRFLEA---SKPTDESV 869 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ ++ R + + + I K E +T +N A+Q + S Sbjct: 870 RISVVQYSGRNQQKVEAQFQYNYTV--IAKAIDNMEFMNDATDVNSALQFITELYRRSAR 927 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQE 413 K+ +V +DG + I A+ GI I +A + Sbjct: 928 AAAK--------KRVLVF-SDGHSQGITARAIERAVQDAQKAGIEIYVLAVGSQANEPN 977 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 47/145 (32%), Gaps = 10/145 (6%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + I K R+G F+ + KL+ + Sbjct: 107 RITAAEKAATAFI----KAQPAGS--RIGLVTFSGIAGLLVPPTTDSQKLLDALQNLTTS 160 Query: 330 DENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +G DA+ A ++ + V A IV+LTDG NTQ + Sbjct: 161 RGTAIGQGILTSIDAIADADPSV-APTGSAVSGNGTGPYAADVIVVLTDGANTQG-VDPQ 218 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +A ++ +R+ TI F Sbjct: 219 TAAKQAAARRLRVYTIGFGTTTPAP 243 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V M F + + + ++T Sbjct: 110 SRLEKSKKLISRLVETF-----NNDKVAM--IVFAGEAFTQLPITSDYISA-KMFLETIN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A+ A + N + IVL+TDGEN + + Sbjct: 162 PSLISTQGTDIAGAINLAMKSFT-----------PNEGVGRAIVLITDGENHEG--GAVE 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +A +G+R+ + Sbjct: 209 AAQEAAKKGVRVFVLGVGSPDGAP 232 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 35/277 (12%), Positives = 84/277 (30%), Gaps = 26/277 (9%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + + +N F + + +P + +L + Sbjct: 227 RAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSVTRTPWNPETQLVHVALQGRM 286 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L+ +D+S K L++ + ++ ++ D V G+ Sbjct: 287 PAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVAIVTYAGSAG--- 343 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V++ R+ + + + GSTA + + AY M Sbjct: 344 EVLAP------TAANQRSTILSALDRLDAGGSTAGEEGLALAYR--------TASEMAGA 389 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 E + +VL TDG+ + + + GI + + F + N Sbjct: 390 GEVTR-VVLATDGDFNLGISDPEELARLVAHERDTGIYLSVLGFGRGNLDDATMQALAQN 448 Query: 422 CASPNSFFEANSTHE--LNKIFRDR---IGNEIFERV 453 ++ ++ + + L I +++ +V Sbjct: 449 GNGQAAYIDSLNEAQKVLVDQLSGALFPIADDVKVQV 485 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 47/146 (32%), Gaps = 19/146 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G ++ +G + ++ V+ T A++ + S + Sbjct: 78 TRVGVGLYSTFASVPIP--FGKYTSLQETVEGIKKLRYPGEGTRTGRALKLMKTHLFSQS 135 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + H K +++LTDG ++ A + G+ + + + +E Sbjct: 136 RPKAH---------KVLIVLTDG---TSVDDVKAPAKALRESGVEVFAVGIGEHYRPRE- 182 Query: 415 ARYFLSNCASPNSFFEANSTHELNKI 440 L + A+ +L + Sbjct: 183 ----LKDIATDTGHVLTAGFRDLMSV 204 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 69/189 (36%), Gaps = 24/189 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 VD+S H K L ++A+ +R++ + D V G + + + P+ + V Sbjct: 219 LVDTSGSMHSQDKLPLAKEAMKVAVRNLNENDTVAIVTYAG----STQDVLPPTPATEVQ 274 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ I G TA+ M+ AY + + IV LTDG Sbjct: 275 RIHTAID-----LLQSGGGTAMGSGMELAYRHAVKKASGNA--------ISRVIV-LTDG 320 Query: 378 ENTQ----DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + E ++ K ++G+ + TI F + + + + F +S Sbjct: 321 DANIGPNLSAESMLSGIEKYVAEGVTLSTIGFGMGNYRDDLMERLADK--GNGNCFYVDS 378 Query: 434 THELNKIFR 442 E K+F Sbjct: 379 YQEAKKVFE 387 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 46/144 (31%), Gaps = 24/144 (16%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + I Sbjct: 98 YPNRLEAAKRTLENLLQGLKGD-------RIGFIPFSDSAYIQMPLTDDYSIGKNYINAL 150 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A + N D K I++L+DG +++ Sbjct: 151 DTN-LISGGGTELYQALELAEKSFKEINSDN-----------KTIIVLSDG--GDFDDKS 196 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT 410 + K + + +I ++ Sbjct: 197 LKF---VKDNKMNVFSIGIGTDEG 217 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 355 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 410 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + D Sbjct: 411 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPMRDS----- 463 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A + A GI I ++ + + + S Sbjct: 464 ---PNKNFLVIVTDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 515 Query: 423 ASPNSFF 429 ++FF Sbjct: 516 KESHAFF 522 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 27/63 (42%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +++ G+ + AL+ P+ L V VD L+ A A + + L Q+ Sbjct: 7 RFLRARGGNLATMAALVSPIFLAVAAFSVDTSSLFLERRQLQSMADFAAVAGAASLSQAN 66 Query: 73 EEV 75 + V Sbjct: 67 DAV 69 >gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus norvegicus] Length = 706 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 77/216 (35%), Gaps = 24/216 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S KK R+AL +++ + D N V Sbjct: 256 NGYFVHHFAPEDLPTMAKNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVF 315 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G + + +++ + +A G T IN A+ +A + + SN+ Sbjct: 316 SGEAN-QWEQLLVQATEENLNRAV-----DYASKIPAQGGTNINKAVLSAVELLDKSNQA 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQE 413 E+ K+ I+LLTDGE T I +A + + + F + Sbjct: 370 ELLPSKSVSL----IILLTDGEPTVGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYP- 424 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIFRD 443 FL A N + +++S +L +++ Sbjct: 425 ----FLEKLALDNGGLARRIYEDSDSALQLQDFYQE 456 >gi|51103206|gb|AAT96347.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|47219514|emb|CAG09868.1| unnamed protein product [Tetraodon nigroviridis] Length = 1450 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 65/185 (35%), Gaps = 13/185 (7%) Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + D SP ++C + + ++ F+ S + + V + S+ Sbjct: 662 MKWTDCSFSPSALNCCLDKHLYFLATVCKGAKADLVFLIDGSWSIGDESFNKVIQFVTSM 721 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I + + I +++ ++D ++ + ++ G+T Sbjct: 722 IGAFEVISPNG--MQVSLVQYSDDAKTEFKL--NTYYNKGIVISALKSVRYRGGNTKTGI 777 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A++ Y+ + +S+ + K +V+LTDG + ++ K + G + Sbjct: 778 ALKHVYEKVFTSDSGMRRNVP------KVLVVLTDG---RSQDDVKKSAEKLQHSGYSVF 828 Query: 402 TIAFS 406 + + Sbjct: 829 VVGVA 833 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 36/117 (30%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + + T I A+ E Sbjct: 130 KIGIVVFAGDAFVQLPITSDYISAKMFLNNISP-ELIGSQGTDIGKAI-----------E 177 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H K I+++TDGEN + E + +A+ QGIR+ + + Sbjct: 178 LSEHSFSEKANFGKAIIIITDGENHEKGAE--EMAREAQKQGIRVFILGIGSPQGAP 232 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 46/144 (31%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V M F + + + ++T + Sbjct: 110 SRLEKSKKLISRLVETF-----NNDKVAM--IVFAGEAFTQLPITSDYVSA-KMFLETIS 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A+ A + N + IVL+TDGEN + I Sbjct: 162 PSLITTQGTDIRGAIDLAMKSFT-----------PNEGVGRAIVLITDGENHEG--GAIE 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +A +G+R+ + Sbjct: 209 AAQEAAKKGMRVFVLGVGSPDGSP 232 >gi|13473814|ref|NP_105382.1| hypothetical protein mll4535 [Mesorhizobium loti MAFF303099] gi|14024565|dbj|BAB51168.1| mll4535 [Mesorhizobium loti MAFF303099] Length = 373 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 66/231 (28%), Gaps = 1/231 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +S +G + L++ ++ G VD +QA A+++ + S Sbjct: 22 FARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAIQTRSNQQQALDAALLSITTMDTTSTL 81 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQVV 132 A + + +F + + M R + + Sbjct: 82 AQRQTALQDSFIANGGLGTATLNSFVAGTTTTPATGQASASFSMPTIFMKIARIDSVPIA 141 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++S ++ + + W +T T + + +SI++ + Y Sbjct: 142 VASAVSKPPALVNATFKIAKVSGWWNKTMTLYGTQFGATAAKPLMSIEYTYNGFGDPKGY 201 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + N G V+S V + ++ + S Y Sbjct: 202 GTTNVYTITNNGGADIKTLVQSQVCATASVTTFTGLPADAILQTSGSRKYS 252 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 44/117 (37%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ +K+ G + ++ +M+ G+ +D+ R ++ L++ A A + S Sbjct: 9 ERFLKNEKGTVTVYLVIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACSVLADYHTS 68 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 ++ Q +Y+ N + +NF + R V + K A Sbjct: 69 TKQDFGLFTYKGANYDQDFAKYVKANLTFSADQNFNLLDYRYEGSKADVSNSLDKEA 125 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 24 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVE 78 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 79 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 132 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 133 RAKEEGIAMYAVGVGKAVEAE------LREIAS 159 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 43/300 (14%), Positives = 95/300 (31%), Gaps = 46/300 (15%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + T +A +S S W++ +M ++ C + Sbjct: 85 IPLHTPPDAVNMSLGLSLSAAASRPWLLACGPTMHRACGENMYAEGFCLLLDSHLQTIWT 144 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-EEHFVDSSSLRHVIKKKHLV 274 E +++ + S+ + D + F +++L +I+ HL+ Sbjct: 145 VPAALPECPSQEMDIVFLIDGSGSIEQSDFKQMKDFVRAVMGQFEGTNTLFSLIQYSHLL 204 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + SW L+ IV+ Sbjct: 205 K--IHFTFTQFQS-------------------------SWNPLSLVDPIVQLD------- 230 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + S K K ++++TDG+ +D E + +A+ Sbjct: 231 GLTYTATGIRKVVEELFHSKNGARKSAK------KILIVITDGQKYKDPLEYSDVIPQAE 284 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 GI I + + A+ L N S + F ++ L+ I ++++ +IF Sbjct: 285 RAGIIRYAIGVG-DAFWKPSAKQELDNIGSEPAQDHVFRVDNFAALSSI-QEQLQEKIFA 342 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 71/219 (32%), Gaps = 27/219 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K R+AL ++ + D N Sbjct: 256 NGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFN---- 311 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ ++FA + +G T INDA+ A + SN + Sbjct: 312 --LITFSSEATQWSPSLVPASAENVNKARSFAAAIHALGGTNINDAVLMAVQLLDRSNRE 369 Query: 357 EVHRMKNNLEAKKYIVLLTDGENT-----QDNEEGIAICNKAKSQGIRIMT----IAFSV 407 E ++ I+LLTDG+ T N + + + + F V Sbjct: 370 ERLPTRSVSL----IILLTDGDPTVGEGPASNSKTRCTGENVREAVSGQYSSLPGLGFDV 425 Query: 408 NKTQQEKARY---FLSNCASPNSFFEANSTHELNKIFRD 443 + ++ A + +++S +L +++ Sbjct: 426 SYAPEKLALDTGGLARRI-----YEDSDSAMQLQDFYQE 459 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 85/223 (38%), Gaps = 40/223 (17%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + + + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKAFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K G T + A++ + ++ ++L+ Sbjct: 99 IIRGLTRMPSG--RDELKASVDAVKYFGKGTYTDCAIKKGLEELLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK GI++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIRI 456 A+ +++ F A E+ D I + I V ++ Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEIISQTIDTITDMIKNNVEQV 246 >gi|62001442|gb|AAX58423.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 46/144 (31%), Gaps = 24/144 (16%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + I Sbjct: 101 YPNRLEAAKRTLENLLQGLKGD-------RIGFIPFSDSAYIQMPLTDDYSIGKNYINAL 153 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A + N D K I++L+DG +++ Sbjct: 154 DTN-LISGGGTELYQALELAEKSFKEINSDN-----------KTIIVLSDG--GDFDDKS 199 Query: 387 IAICNKAKSQGIRIMTIAFSVNKT 410 + K + + +I ++ Sbjct: 200 LKF---VKDNKMNVFSIGIGTDEG 220 >gi|62001418|gb|AAX58411.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|62001332|gb|AAX58368.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|62001384|gb|AAX58394.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|62001330|gb|AAX58367.1| AvrE [Pseudomonas viridiflava] gi|62001336|gb|AAX58370.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|149732356|ref|XP_001492551.1| PREDICTED: similar to complement component 2 [Equus caballus] Length = 751 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 45/139 (32%), Gaps = 10/139 (7%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T +A+ + Y + + + E + ++ Sbjct: 313 SRDVMEVINSLDNIHYKDHENGTGTNTYEALNSVYIMMNNQMQRLGMNTVAWQEIRHAVI 372 Query: 373 LLTDGENTQDNEEGIAICNKAK---------SQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ +A+ + K + + I I + S Sbjct: 373 LLTDGKSNMGGSPKLAV-DNIKELLNIKQKRNDYLDIYAIGVGNLDVDWRELNELGSKKD 431 Query: 424 SPNSFFEANSTHELNKIFR 442 F L+++F Sbjct: 432 GERHAFILKDAEALSQVFE 450 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 65/194 (33%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + V+ L+ VI S+ N R+G + V + Sbjct: 42 VFVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASAVKQEFPLR--A 96 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H +++ + T A+Q A S E R + + K ++++TD Sbjct: 97 HGSKAALLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGEGGRAR---SPDISKVVIVVTD 153 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + A ++++ GI + I + L AS S Sbjct: 154 GRPQDSVRDVSA---RSRASGIELFAIGVG------RVDKATLREIASEPQDEHVDYVES 204 Query: 434 ---THELNKIFRDR 444 +L+K F++ Sbjct: 205 YSVIEKLSKKFQEA 218 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 46/143 (32%), Gaps = 20/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++ +I + R+G + + + + Sbjct: 111 RLEKAKRLVSEIINQLASD-------RIGIIAYAGQAFPQLPITTDYGAAKMFLQNMNTN 163 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TAIN+A++ A + + + +++DGE+ + Sbjct: 164 MLTSQG-TAINEAIELATTYYDDEEQTN-----------RVLFIISDGED-HSEGTTLKA 210 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 + A +GI+I TI +K Sbjct: 211 VDDAIEEGIQIFTIGVGKSKGAP 233 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + D TAI +A++ A + Sbjct: 130 RIGIIAYAGSAYPQLPITTDYSSA-KLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ +A +A +GIRI TI + Sbjct: 189 TN-----------RVLFIISDGEDHVGEASSLA--EQANKEGIRIFTIGVGKTEGGP 232 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 64/198 (32%), Gaps = 24/198 (12%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + ++ I + V+ L + ++ D R+G F Sbjct: 101 DLSGSMDARDFTNPEGERIDRLSAVKGVLDEFLTR-REGD------RVGLIVFGSAAFVQ 153 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ ++ + +T T DA+ + +E E + Sbjct: 154 VPFTQDLNACRILLEETTVRMAGPR--TVFGDALGLGIT-LFERSEVEE----------R 200 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCASP-- 425 I+ LTDG +T A +++ + T E L+ AS Sbjct: 201 VIIALTDGNDTGSRVPPAEAAKIANDNTVKVHVVGVGDPTTTGEDVLDEEALNAVASTTG 260 Query: 426 NSFFEANSTHELNKIFRD 443 +F AN+ EL I+ + Sbjct: 261 GRYFHANNREELEDIYTE 278 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 60/440 (13%), Positives = 136/440 (30%), Gaps = 88/440 (20%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS-----AY 129 + R P + + + + K + ++ ++ +T + P +S Y Sbjct: 78 IGERRIKGKIMPNAQAKAHFEQAKTQGRKASLIEQHRPNLFTNTIANIGPNESVSITIEY 137 Query: 130 QVVL-------SSRYDLLLNPLSLFLRSMGIKSWL-IQTKAEAETVSRSYHK-------E 174 Q V+ S R+ + + P + + + T+ ++V+ + Sbjct: 138 QQVVGFDEQTFSLRFPMTITPRYSPNNATDKSTVTTVNTQGWGQSVTAISQQIKTADEPA 197 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ-----------PADRTVKSYSSQNGKVG 223 + + + +D ++ SE P+N Q + + + Sbjct: 198 NPIRLSVELDSGFALTADDITSEHHPINISQQGEKNSGYHIELAQEHIANQDFALTWQPA 257 Query: 224 IRDEKLSPYMVSCNKSLYYML----------------YPGPLDPSLSEEHFVDSSSLRHV 267 + D + + Y L PS +D+S Sbjct: 258 LSDAPSAAHFSETQGKYRYGLVMLTPPVQDAYHSTGGAVAQQMPSREVVFLLDTSG-SMA 316 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + A+ + ++ DNVN FND + + + + Sbjct: 317 GESIVQAKRAVDFALTQLRPEDNVN------IIQFNDAPQALWKRAMPATAKHIQRARNW 370 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK---KYIVLLTDGENTQDNE 384 + G T + A+ A + +HR ++L + +V +TDG + NE Sbjct: 371 VASLHADGGTEMAPALTLALNK------PSLHRDDSDLLGSHKLRQVVFITDG--SVSNE 422 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++K R+ TI ++PNS+F + F Sbjct: 423 DALMSLIESKLADNRLFTIGIG----------------SAPNSYFMTQAAQAGRGTFT-Y 465 Query: 445 IGN------EIFERVIRITK 458 IG+ ++ ++T+ Sbjct: 466 IGDIQQVQHKMTALFNKLTR 485 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 15/112 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + V +T+ I G + + DA++ A D + Sbjct: 167 RVGLVVFAGETRQTLPLTRHVEDFKQTLDS-VGIHSVRRGGSRLGDAIRVASDAFLDKTT 225 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFS 406 D K +V+LTDGE+ + + ++ +A + QGIRI TI Sbjct: 226 DH-----------KAMVILTDGEDQES--DPVSEAKRAYEEQGIRIFTIGLG 264 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 41/117 (35%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + D TAIN+A+Q A + Sbjct: 129 RVGIIAYAGSAFPQLPITTDYASAKMFLQNMN-TDMLSSQGTAINEAIQLAKTYYNDDEQ 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ + + + I +A +GIRI TI K + Sbjct: 188 TN-----------RVLFIISDGEDHEGDS--VNIAEEASEEGIRIFTIGVGTTKGGR 231 >gi|309361725|emb|CAP28912.2| hypothetical protein CBG_09720 [Caenorhabditis briggsae AF16] Length = 675 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 59/157 (37%), Gaps = 10/157 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +I + + VR+G ++ + H +++ + E G+T DA Sbjct: 520 SAISSLPISQEAVRVGLISYSGPGRTHVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADA 579 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ A E H + N+ KK +V+ TDG + A+++G++++ Sbjct: 580 IRYATKIF----EGMAHPARKNV--KKVLVVFTDG---YSQDHPRDAARGARAKGLQLIA 630 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +A ++ + F + + EL + Sbjct: 631 VAVK-DRLAPPDEEQLAEIGGHAKNVFISPNGRELRE 666 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 44/380 (11%), Positives = 110/380 (28%), Gaps = 84/380 (22%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEE 92 ML G D +R +++ A+I A + + + + + + + Sbjct: 3 MLLAVGASFDYIRAYNVRQSMQSDLDAALIAAVKNVDAGDTDALKQKVSDWFHAQTE--- 59 Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + DI DT + +++ ++ + + ++ Sbjct: 60 --------------SSYSLGDIEIDT--------TNHRITATASGTVP----TTLMKLAN 93 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I + + + + + SY +++ VID S SML + Q + Sbjct: 94 INTVPVSVASAVKGPASSY-----LNVYIVIDKSPSMLLAATTAGQQAM----------- 137 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 +C+ + + + + S+ + + + Sbjct: 138 --------------YNGIGCQFACHTGDSHTIGTATYSNNYAY------STEKKIKLRAD 177 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + DA+ VI I D ++ +++G D + + + + + + Sbjct: 178 VAVDAVHEVIDMISASDTNHERIKVGLYSLGDTITEVLAPTLDTTAAGKRVDSDLTSATS 237 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI----- 387 + + + ++ K ++LLTDG +Q Sbjct: 238 T-----TYTYFDVSLAALKNKVGTGGDG-SSSATPLKLVLLLTDGVQSQREWVTSGAKYQ 291 Query: 388 --------AICNKAKSQGIR 399 A C+ K Q Sbjct: 292 PKVAPLNPAWCDYIKKQSAT 311 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 50/150 (33%), Gaps = 20/150 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ I + + R+G F+ + + L R + D G T + A+ Sbjct: 112 IESIIDNLEGDRIGFIPFSSAAYIQMPLT-DDYDLARMYLDVIDTDMIAGGGTNVGTALN 170 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A ++ ++ + +++L+DGE + N + I + +++ TI Sbjct: 171 LAENSFEETSSA-----------DRVVIILSDGE--EHNSNSVDILKSFNDEHLKVFTIG 217 Query: 405 FS------VNKTQQEKARYFLSNCASPNSF 428 V + + S F Sbjct: 218 IGTAKGGLVPDYGSDGGQKSGYKKDSNGEF 247 >gi|73745523|emb|CAI61969.2| putative TerY1 protein [Escherichia coli] Length = 239 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT--------DLLSFQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 158 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 202 >gi|291394751|ref|XP_002713732.1| PREDICTED: collagen, type XXVIII [Oryctolagus cuniculus] Length = 1132 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATF 301 + F+ SS I +D + + I ++ + +R+ A Sbjct: 38 NDFWASICFIDIVFIVDSSESSKIVHFDKQKDFVERLSDKIFQVTPGHSLKYDIRLAALQ 97 Query: 302 FNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVH 359 F+ V DP FS W K K A N +G T A+ SN + Sbjct: 98 FSSSVQIDPPFSSWKDLKTF----KQRAKSLNLIGQGTFSYYAI---------SNVTRLF 144 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + K +L+TDG + N + +I A++ GI +TI S + + + + Sbjct: 145 KREGRKNGVKVALLMTDGIDHPKNPDVKSISEDARTSGISFITIGHSTDVNEAK--LRLI 202 Query: 420 SNCASPN 426 S +S Sbjct: 203 SGDSSSE 209 >gi|62001412|gb|AAX58408.1| AvrE [Pseudomonas viridiflava] gi|62001422|gb|AAX58413.1| AvrE [Pseudomonas viridiflava] gi|62001428|gb|AAX58416.1| AvrE [Pseudomonas viridiflava] gi|62001430|gb|AAX58417.1| AvrE [Pseudomonas viridiflava] gi|62001432|gb|AAX58418.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNATVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 53/180 (29%), Gaps = 23/180 (12%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 +C S PLD +D S + + + A+ + D Sbjct: 652 YTACRGSAGSSSCAAPLD----LFFLLDGSGSVNAANFVKVKQFAVNV----VNTFDVSL 703 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+G ++DR + V+K + V + G T A+Q Sbjct: 704 TATRVGVVQYSDRNTLVFNLGNKVNK--PSTVSAINNIVYQSGGTNTGAALQYV------ 755 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K I++LTDG+++ + G+ + I Q Sbjct: 756 ----RQYAAWRGGNVPKVIIVLTDGKSSDSVSGPSQ---NLVAAGVEVYAIGVGSFDHGQ 808 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 53/409 (12%), Positives = 119/409 (29%), Gaps = 60/409 (14%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 +A +A + + L + R PK++ + N + ++ Sbjct: 70 EAIYSAPVPEAGALSELTIWAGERVLQGEVVPKEEADRIYEEEKSQG---NEVGKADKNA 126 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 ++ + P ++ V + Y PL + +G ++ ++ + S + Sbjct: 127 YQNFEFSVYPVPASGSVRMVYSY---YEPLKIDT-GVGRYTYPLEEGGTDDEASAFWTLN 182 Query: 175 HGVS----IQWVIDFS------------RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS- 217 VS ++ V+ + + + + G D Y Sbjct: 183 DVVSVDFSMEVVLKSAYPVAKTRVPGYGGKVEETETGELLYRFESQGAILDEDFVFYYML 242 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPG----PLDPSLSEEHFVDSSSLRHVIKKKHL 273 + G + + ++ PG PL+ +D S K H Sbjct: 243 EENLPGRLEVLTYRENEDKPGTFMMVMTPGVDLHPLEGGADFVFALDVSGSMQG--KLHT 300 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT---FAID 330 + + I +K D R FN+ + + G ++ Sbjct: 301 LASGVKKAIGQLKPED------RFRVVAFNN---TAFDLNRGWVSATEANLRETFARLDQ 351 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 N G T + + A + + + ++L+TDG Q + A Sbjct: 352 LNSNGGTNVYAGVHLALERLDADRVAT-------------LILVTDGVTNQGIVDPKAFY 398 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELN 438 Q +R F + + + C AS S+ +++ ++ Sbjct: 399 KLMHKQDLRFY--GFLLGNSSNWPLMQLM--CDASGGSYRAVSNSDDII 443 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 43/133 (32%), Gaps = 6/133 (4%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K F +T + DA+ A + + ++ + K IV LTDGE + Sbjct: 371 KKFIQALEPDSTTNMEDALNKA---LSIAKLGKMRFKDSAKTPKPIIVFLTDGEMNEGIT 427 Query: 385 EG---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I ++ F + + L+N +EA+ F Sbjct: 428 NPQALMKYVSDINVDNYPIYSLGFGKGADIEFLKKLSLNNTGFARVIYEASDASLQLHNF 487 Query: 442 RDRIGNEIFERVI 454 I + + V Sbjct: 488 YKEISSPVLSNVT 500 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 62/187 (33%), Gaps = 20/187 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + VR +A+++ + + + R+G ++DR + G Sbjct: 40 VFVLDSSSSVGKENFEKVRQWVANLVDTFEVGP---ERTRVGVVRYSDRPATAFEL--GR 94 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + G T DA++ + + K+ +LLTD Sbjct: 95 FGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFTP---RAGGRPGDRAFKQVAILLTD 151 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + A+ GIRI + E R L AS F + Sbjct: 152 G---RSQDLVLPAATAARRAGIRIFAVGVG------EALREELEEIASEPTAAHVFHVSD 202 Query: 434 THELNKI 440 ++KI Sbjct: 203 FDAIDKI 209 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 62/187 (33%), Gaps = 20/187 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 FV SS + VR +A+++ + + + R+G ++DR + G Sbjct: 40 VFVLDSSSSVGKENFEKVRQWVANLVDTFEVGP---ERTRVGVVRYSDRPATAFEL--GR 94 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + G T DA++ + + K+ +LLTD Sbjct: 95 FGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFTP---RAGGRPGDRAFKQVAILLTD 151 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 G + + + A+ GIRI + E R L AS F + Sbjct: 152 G---RSQDLVLPAATAARRAGIRIFAVGVG------EALREELEEIASEPTAAHVFHVSD 202 Query: 434 THELNKI 440 ++KI Sbjct: 203 FDAIDKI 209 >gi|291569213|dbj|BAI91485.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 412 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 25/157 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I + D R+ F+ R +K Sbjct: 59 LETVKQAAKELIDRLNVGD------RISVVAFDHRAKVLVP---NQDIADPDGIKKKIDG 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G TAI++ ++ + + +D + + LLTDGEN DN+ + + Sbjct: 110 LRCSGGTAIDEGLKLGIEELGKGKQDRISQGF----------LLTDGENEHGDNKRCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A + I ++ F + + L A Sbjct: 160 AKLATEYKLTINSLGF-----GNDWNQDILEKIADAG 191 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 66/208 (31%), Gaps = 23/208 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P L+ VD S K L + + +I ++ D R+ T++ Sbjct: 250 PAGERRPLNLTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPED------RVAVTYYASD 303 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + + G KL ++ N GSTA M AY+ + Sbjct: 304 VGTAVGPTPGSEKLK---LRCAVAALNAGGSTAGAQGMVNAYEQ--------AEAAFSPD 352 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNK-AKSQGIRIM-TI-AFSVNKTQQEKARYFLSNC 422 + + ++ TDG+ + + + A +G I ++ F Q + + Sbjct: 353 KVNRILMF-TDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQAG 411 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIF 450 ++ + E +F F Sbjct: 412 NGVAAY--VDDLDEARCLFGPAFDRGAF 437 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 52/161 (32%), Gaps = 17/161 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA-MQTAYDTI 350 + R+G + D ++ + + + + G + A +Q ++ Sbjct: 282 SQFTRVGMMQYGDEPHTEFDL--NTFQNGSQVFEAISNVTQIGGESGPYAAILQVLRRSL 339 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + I+ +TDG D+EE I N+ + G + TI + Sbjct: 340 TAQYGSRE-------NVSQIIIFVTDGGVVDDSEESQTILNELRFSGALVYTIGVGRMVS 392 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTH-ELNKIFRDRIGNEI 449 + + L AS P S + +I + I Sbjct: 393 RPQ-----LRMIASRPASHHVTTIASYSELSATKSQIIDRI 428 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 9/80 (11%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELN 438 T+ + IC A++QG+ I T+AF + L +CA SP+ F+ + +++ Sbjct: 4 TEADARLSDICAAARAQGVVIYTVAFEA----PSGGQSALQDCASSPSHHFDV-NGTDIS 58 Query: 439 KIFRDRIGNEIFERVIRITK 458 F I ++I R +++T+ Sbjct: 59 SAF-SAIASDI--RALKLTQ 75 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 44/116 (37%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + G + + + T I A+Q + ++ Sbjct: 131 KIGLIVFAGDAFVQLPIT-GDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT---- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N + K I+L+TDGE+ + E A+ +A+++GI++ + + Sbjct: 186 -------PNSKVGKAIILITDGEDNEGGAE--AMAKQARNKGIKVFILGIGSKEGS 232 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 60/153 (39%), Gaps = 16/153 (10%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS- 313 F+ SS I +D + S+ + ++ V +++ A F+ V DP FS Sbjct: 52 FIVDSSESSKIFLFDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSS 111 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 W + VK+ T A+ A + + + ++ K +L Sbjct: 112 WKDLHTFKQRVKSM---NFIGQGTFSYYAIANATRLL---------KREGRKDSVKVALL 159 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +TDG + N + +I A++ GI +TI S Sbjct: 160 MTDGIDHPKNPDVQSISEDARNAGIIFITIGLS 192 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 65/161 (40%), Gaps = 18/161 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ID +D R+G + F F+ +R ++ ++ T A++ Sbjct: 320 SLIDKFDDDFRIGISKFTGTYTKMCDFT-DDRTELRKVLNRIRTEDEIFDGTYNQTALKK 378 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-AICNKAKSQGIRIMTIA 404 + ++ + + IV+L+DGE+ + + E I ++ N A + + ++T+ Sbjct: 379 CINEFSAAGDGK---------YVNIIVMLSDGESDEVDAETIESLSNLANEKSVIVLTVG 429 Query: 405 FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +E R +L A + ++ A+ L+ +++ Sbjct: 430 L-----GREIDRAWLQEVAYSTGGKYYSASDATSLDDVYKQ 465 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 65/194 (33%), Gaps = 35/194 (18%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + + +++ + +I+S+ D G F + S +++ Sbjct: 101 AANDYQPTRLEAAKESASVLIKSLDPKD------YAGVVIFESGATTAAYLSPDKDRVME 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 A E + G TA+ D + A D + N KK +VLL+DG Sbjct: 155 KT----AAIEQKNGQTALGDGLALAV--------DMADSIPNQ---KKVVVLLSDGVGNA 199 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSV--------NKTQQEKARYF----LSNCA--SPNS 427 A +++ T+ + T + L + A + + Sbjct: 200 GVISPEDATAFAAQNKVQVFTVGLGSKSPVLLGTDPTGTPQYATLDEAALQSIAEKTGGT 259 Query: 428 FFEANSTHELNKIF 441 ++ + L++I+ Sbjct: 260 YYTSVDEQTLHQIY 273 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 29/91 (31%), Gaps = 1/91 (1%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 K+L +S + A ++ V+LG G + +D R L+ A + + L Sbjct: 5 NRKELKRSRGAVLVWVAASMV-VLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAWELG 63 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 + R ++ + Sbjct: 64 NKSASQALREASAAQVAGSVASDNKSEGAYA 94 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 68/471 (14%), Positives = 136/471 (28%), Gaps = 83/471 (17%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + F+ +L K G F +ITA+L+ + + ++VD V L++ Sbjct: 1 MFQNKNFLLGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDHV 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I S + L E F + +IE+ L N N+ + + + Sbjct: 61 IYRTSPKNLYDLREAGRD-----NFIRHQIEKAL--NTYNSRDLSNIGSIESIVKDAVIL 113 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 N Q + + L+ + + + Y + G+ + Sbjct: 114 TKNVNSLPLQFTVDIALSTTVQLRGSLLQMFSQSKGKVDISRRKKVM---YKQNIGL-MI 169 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + L + ++ + PY C Sbjct: 170 MPFAWDGYWLASRGKVADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFCMAPY 229 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 +Y ++ A+ ++ S+ + Sbjct: 230 HYS----------------------------SILYWAVGTLTYSVDNKTTTREY------ 255 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + D + +W H I F + N+ G + ++N H Sbjct: 256 -YKDPYYA----TWD-HFPYSFIKNVFDMTSNQFGDG----------QVLTNTN----HC 295 Query: 361 MKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKS------QGIRIMTIAFSV 407 + KY+++L G + + + C+ I I ++ FS Sbjct: 296 FPHGASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGRDAITIFSVGFSP 355 Query: 408 NKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + + RY L CAS P+ ++E NS + I + N I + T Sbjct: 356 D----QDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFSQFT 402 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 43/185 (23%), Positives = 69/185 (37%), Gaps = 32/185 (17%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D++ D + +G F + S + + + D+ TA + + A Sbjct: 121 ADDLTDGINLGLISFAGTASTLVSPTPDHSATKNALDRLKLADK-----TATGEGIFAAL 175 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDNEEGIAICNKAKSQGIRIMT 402 I + N V + + IVLL+DG E+ D G KAK +G+ + T Sbjct: 176 QQIDTLN--AVLGGPSGAPPAR-IVLLSDGKQTVPESPDDPRGGFTAARKAKEKGVPVST 232 Query: 403 IAFS-----VNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELNKIF---RD 443 I+F V+ L A S FF A+S ELNK++ + Sbjct: 233 ISFGTLTGTVDLETPGGGVERVPVPVDDESLRKIANLSGGDFFTASSLDELNKVYSTLQK 292 Query: 444 RIGNE 448 +IG E Sbjct: 293 QIGYE 297 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 9/107 (8%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 +NDR ++ I+ I G TA A++ A + + + Sbjct: 2 VLYNDRPSAEFYL--DTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQKRGSRK 59 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + K+ +++TDGE ++ + + G+ + + Sbjct: 60 AL----GVKQIAIVMTDGE---SEDDVTTTAAELRRSGVTVYALGVK 99 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 44/116 (37%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + G + + + T I A+Q + ++ Sbjct: 131 KIGLIVFAGDAFVQLPIT-GDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT---- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N + K I+L+TDGE+ + E A+ +A+++GI++ + + Sbjct: 186 -------PNSKVGKAIILITDGEDNEGGAE--AMAKQARNKGIKVFILGIGSKEGS 232 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 48/367 (13%), Positives = 102/367 (27%), Gaps = 24/367 (6%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + E ++ + + + D A Sbjct: 41 AAKSEPKIQVAKPEAPVAKEERKAEPDSEAFHAPGAASGAMQSVDQAADAAAPMSTMGGA 100 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V + R + + D+L P R + +++ A + S + S Sbjct: 101 VGLAARSRMETIPAPAPADMLPPPAENRERFGNADANPVKSVAAEPVSTFSVDVDTA-SY 159 Query: 180 QWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +V + M + + +N F R + V + + + Sbjct: 160 SFVRRSLMAGEMPNPDAVRVEEMVNYFPYDWPRPTTAAEPFKATVTVTPTPWNAGTRLMH 219 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 ++ P + +D S K L+++A ++ ++ D V+ Sbjct: 220 VAIKGYEVVQKEAPRANLVFLIDVSGSMDEPDKLPLLKNAFRLLVDRLRPDDTVSIVTYA 279 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G N + +P +T + + GSTA + AY Sbjct: 280 G----NAGTVLEP-----TAVKDKTKILSAIDTLQPGGSTAGAAGIDAAYQ--------L 322 Query: 358 VHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + I+L TDG+ +EE + + GI + + F Sbjct: 323 AEKAFVRDGVNR-ILLATDGDFNVGPSSDEELKRMVETKRRSGIFLSVLGFGRGNYNDAL 381 Query: 415 ARYFLSN 421 + N Sbjct: 382 MQTIAQN 388 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 86/296 (29%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + +S + S P S V + I V++ L ++ Sbjct: 148 VCSDISPDFQLSASFS-------PAAQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T ++ T A Sbjct: 201 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQHGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 356 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 411 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + D Sbjct: 412 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS----- 464 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A + A GI I ++ + + + S Sbjct: 465 ---PNKNFLVIVTDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 516 Query: 423 ASPNSFF 429 ++FF Sbjct: 517 KESHAFF 523 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 43/360 (11%), Positives = 106/360 (29%), Gaps = 35/360 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + Q E++ + K + + ++ +++ D + + T+ ++ Sbjct: 33 SDKSDDQQKRAELADQTKLAAEQQAELKQQVELKDSVERQANRQRDAAIAIHEQATSTKL 92 Query: 123 NPRKSAYQ-VVLSSRYDLL------------LNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + ++ + + + P ++ + T + Sbjct: 93 RTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAI 152 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 +++ ++ + + LN F K+ + + + Sbjct: 153 DVDTGSYTTLRRML-KEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPY 211 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + M+ L + + +D S K L++ AL + + + D Sbjct: 212 NDDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLGAQD 271 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V+ V GA G + + GST +Q AY Sbjct: 272 KVSIVVYAGAAGVVLD---------GAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL 322 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFS 406 + H ++ + ++L TDG+ +E I + + K GI + T+ F Sbjct: 323 ------AKKHLVEGGINR---VILATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFG 373 >gi|301788516|ref|XP_002929674.1| PREDICTED: complement C2-like isoform 3 [Ailuropoda melanoleuca] Length = 617 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T A+ + + + + + + E + I+ Sbjct: 181 SRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAII 240 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ +A+ N+ +S + I I + S Sbjct: 241 LLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDG 300 Query: 425 PNSFFEANSTHELNKIFR 442 F T L ++F Sbjct: 301 ERHAFILQDTEALYQVFE 318 >gi|301788514|ref|XP_002929673.1| PREDICTED: complement C2-like isoform 2 [Ailuropoda melanoleuca] Length = 749 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T A+ + + + + + + E + I+ Sbjct: 313 SRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ +A+ N+ +S + I I + S Sbjct: 373 LLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L ++F Sbjct: 433 ERHAFILQDTEALYQVFE 450 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 62/185 (33%), Gaps = 28/185 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA + ++ NV+ G + R V Sbjct: 115 SRLAAEKDAATKFVAALPAQYNVSVVTLSG---------HPNTLV--PPTTDRAPVNQGI 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI ++ + + + + + L +VLL+DG T + +A Sbjct: 164 KTLELADGTAIASSIDVGLEALKQAPAGDDGKQAPGL-----MVLLSDGSETGGG-DPVA 217 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNC--ASPNSFFEANSTHEL 437 +KAK Q + I TIAF + + F L AS +A S +L Sbjct: 218 SADKAKQQNVPIYTIAFGTQNGYVDLDGQRFNVAPDTDMLKRIADASSGKALDAASASQL 277 Query: 438 NKIFR 442 + +++ Sbjct: 278 DDVYK 282 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 43/108 (39%), Gaps = 1/108 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-ITAS 65 +K IK G + +I AL + V+LG+ + VDV ++ + + Q + + Sbjct: 1 MKMNIRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRMYEIVQVMRDLRFT 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 +L V + ++ ++ + FE ++ + + D Sbjct: 61 KYAESTLIHVLHGKDPAREISQEMVKYARLNGFEGEIRITYVEEHPYD 108 >gi|301788512|ref|XP_002929672.1| PREDICTED: complement C2-like isoform 1 [Ailuropoda melanoleuca] gi|281345620|gb|EFB21204.1| hypothetical protein PANDA_019912 [Ailuropoda melanoleuca] Length = 748 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T A+ + + + + + + E + I+ Sbjct: 312 SRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAII 371 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ +A+ N+ +S + I I + S Sbjct: 372 LLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDG 431 Query: 425 PNSFFEANSTHELNKIFR 442 F T L ++F Sbjct: 432 ERHAFILQDTEALYQVFE 449 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 37/256 (14%), Positives = 79/256 (30%), Gaps = 34/256 (13%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + A V ++ G+ + + S + P + Sbjct: 35 HMENTSRFSLTPLLTALIAVLGTTALAVVSGLLPVASAEEETNAPSSSSSTMAPTMVVFD 94 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG---------ATFFN 303 S + + + + +DA + I + G A Sbjct: 95 SSGSMITNDAGGQ---TRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQ 151 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + P + K+I + G T I ++++ A + Sbjct: 152 DITVVTPPEAGNSEKMIAHMD-----GLQPRGFTPIGESLRKAAAEL------------- 193 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSN 421 E ++ I+L++DG T + + K QGI I T+ F+V Q++ + Sbjct: 194 PKEGQRSIILVSDGVATCTPPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIAD- 252 Query: 422 CASPNSFFEANSTHEL 437 A+ ++ A+ L Sbjct: 253 -ATGGTYANASDADSL 267 >gi|327273523|ref|XP_003221530.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Anolis carolinensis] Length = 1091 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 81/240 (33%), Gaps = 25/240 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 R+ + L A + Y + R + + Y P D Sbjct: 203 REEDQSLLWQVFGSATGLARYYPASPWVDKSRTQNKIDLYDVRRRPWYIQGAASPKD--- 259 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 VD+S + L+R ++ ++ ++ D VN FN+ + F+ Sbjct: 260 -MLILVDASGSVSGLT-LKLIRTSVIEMLETLSDDDFVN------VVSFNENAQNVSCFN 311 Query: 314 WGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 V +R +K G T A++ +++ N N K I Sbjct: 312 HLVQANVRNKKKLKEAVYKIQAKGITDYKKGFSYAFEQLLNHNHSVFRANCN-----KII 366 Query: 372 VLLTDGENTQDNEEGIAICNKAK-SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +L TDG E+ I +K + +R+ T FSV + +K CA+ ++E Sbjct: 367 MLFTDG----GEEKAQEIFDKYNVEKKVRVFT--FSVGQHNYDKGPIQWMACANKGYYYE 420 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 13/80 (16%), Positives = 33/80 (41%), Gaps = 1/80 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 S G ++ + M ++G+ G+ +D+ + L+ A A ++ + ++ + +V Sbjct: 9 HSQRGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGA-KVLNDMHDV 67 Query: 76 SSRAKNSFTFPKQKIEEYLI 95 + T +E L Sbjct: 68 GQATAAALTTFNMHLEGELA 87 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 40/113 (35%), Gaps = 18/113 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TA+ + +A + + D + ++++TDG N Q + N A + Sbjct: 2 TALYSGINSATTEFKNYSTDA----------SRIMIVVTDGYNNQSGASSATVINNAIEE 51 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGN 447 + I + + L N + ++ N +LN IF + I Sbjct: 52 NVIIYCVGVGSVNS------TVLKNISESTGGCYYYINQFSQLNGIFENIISE 98 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 58/161 (36%), Gaps = 25/161 (15%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ +L + + ++ D + + R Sbjct: 189 PPANLVFLVDVSGSMSDNDKLPLVKSSLKMLTKQLRPQDT------ISIVTYAGRTQVTL 242 Query: 311 SFSWG--VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + G K++ I + GST A++ AY ++H K+ + Sbjct: 243 PATRGSDTDKILAAIDS-----LDASGSTNGEAAIKLAYQQ------AKIHYKKDGINR- 290 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 I+++TDG+ + + + + + G+ + T F Sbjct: 291 --ILMMTDGDFNVGVSDVDEMLDIIRRERDSGVSLSTFGFG 329 >gi|74011920|ref|XP_548489.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Canis familiaris] Length = 897 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 48/306 (15%), Positives = 92/306 (30%), Gaps = 39/306 (12%) Query: 152 GIKSWLIQTK------AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 GI + + + K+ VS + +D RS C Sbjct: 199 GISTLDAEASFITNDLLGSALTKSFSGKKGRVSFKPSLDQQRS-----------CPTCTD 247 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + N + + ++ Y V + P L FV S Sbjct: 248 SLLNGDFIITYDVNRESPANVQIVNGYFV-------HFFAPQGLPVVPKNVVFVIDVSGS 300 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG-ATFFNDRVISDPSFSWGVHKLIRTIV 324 +K +DAL ++ +K D +N + G + D ++ Sbjct: 301 MHGRKMEQTKDALLKILGDMKGEDYLNFILFSGDVITWKDDLVQATP-------ENIEEA 353 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + F + ++ G T IND + + + E HR+ + I++LTDG+ Sbjct: 354 RIFVKNIHDRGLTNINDGLLRGISMLNRARE--EHRV--PERSTSIIIMLTDGDANVGES 409 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I ++ + + F N L N +E + + + F Sbjct: 410 RPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGF 469 Query: 442 RDRIGN 447 + + N Sbjct: 470 YEEVAN 475 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 71/464 (15%), Positives = 141/464 (30%), Gaps = 81/464 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + GH I+ A+L+P++ GV + D R + ++ A+ A+ + + + Sbjct: 1 MHHKQQGHASILFAMLIPLLFGVFALGSDGARAIQSKARIEDAS-----EAAALALSARD 55 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + T + IEEYL + E D+ Q + Sbjct: 56 DEHAMSDENKTIVQAYIEEYLPVEDSDVTILGIERLECDDMPECRQGSGRGEARYTQYSV 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 D + + + + + ++ + V I + DFS SM Sbjct: 116 RVSAD-----QTPWFGGGSPEVEVPEVWRSQGGAKARKYQSNAVDIVFAADFSGSMA--- 167 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S + + RD ++ + + L P D Sbjct: 168 --------------------SPWTGGSQPKYRD------LIDILEKVTVELAPYNFDSQR 201 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S + + A+ ++ + + V D R A + + PS S Sbjct: 202 YNSSVGVSGFNALTYRNELC---AVNNLEKQ--GLLGVVDYSRTVARMWETKSCRPPSIS 256 Query: 314 W--GVHKLIRTIV----KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 G H + T G TA Y ++S H N Sbjct: 257 NSAGFHDVPLTDDYSTFNRTVDRFTARGGTAS-------YQAVMSGARLLDHGSNNRQ-- 307 Query: 368 KKYIVLLTDG-ENTQDNEEGI---AICNKA------------KSQGIRIMTIAFSVNKTQ 411 +++++DG +N ++ G+ +C + R+ I F + Sbjct: 308 --ILIVISDGQDNNLNHTNGLVNAGMCRDIISRLEGRPSANGRDVSARLAFIGFDFEPSM 365 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + C ++ F+A +T EL + I E+ R Sbjct: 366 NPA----MVRCVGEDNVFKAENTDELFEQIMFLIREEVGHLATR 405 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 66/206 (32%), Gaps = 39/206 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + + TV++G F + L+ I + Sbjct: 106 SRLVAAQTAAKAFLTELPR------TVKVGLVAFAGSAQVAQIPTVNREDLVSAIDRFQL 159 Query: 329 IDENEMGS-------TAINDA------MQTAYDTIIS---SNEDEVHRMKNNLEAKKY-- 370 +G+ T DA MQ+ + E + + + Y Sbjct: 160 QRGTAIGNGIVMSLATLFPDAGIDLQSMQSGRERQRGFAIDQEKKEAKEFTPVAPGSYNS 219 Query: 371 --IVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKTQQEKARYF 418 I+LLTDG+ T + + A +G+R+ TI F + Sbjct: 220 AAIILLTDGQRTTG-VDSLDAAKLAADRGVRVYTVGIGTVDGETIGFEGWSMRVRLDEET 278 Query: 419 LSNC--ASPNSFFEANSTHELNKIFR 442 L A+ +F A + +L K++ Sbjct: 279 LKGIARATQAEYFYAGTATDLKKVYE 304 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 62/180 (34%), Gaps = 32/180 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG----------VHK 318 K + AL V+ ++ DT ++G + V + + Sbjct: 60 TKIEAAKKALTGVVGALP------DTAQVGLRVYGATVDGKGKPTPAACADTQLIHPIAA 113 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L +T + T +G T I ++ A + K+ IVL++DGE Sbjct: 114 LDKTKLTTTIAAIKALGETPIAHSLTEALKDL-------------GTSGKRNIVLVSDGE 160 Query: 379 NT-QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + I A ++I T+ F VN + + + A ++++A L Sbjct: 161 ESCVPDPCPIVKKLTAAGVDLQIDTVGFGVNAKARTQLQCIAD--AGKGTYYDAKDAGAL 218 >gi|254414936|ref|ZP_05028700.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178425|gb|EDX73425.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 576 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 74/199 (37%), Gaps = 23/199 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S K L+++A ++ ++ D V+ V GA + P Sbjct: 211 PPSNLVFLLDVSGSMSDANKLPLLKEAFRLLVDQLRDEDKVSIVVYAGAAG----TVLPP 266 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + ++ I K GSTA ++ AY + N E + Sbjct: 267 TPGNQKDTILAAIDK-----LEAGGSTAGGQGIKLAY-KLAQDNFIESGNNR-------- 312 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ +E+ +++ + + Q I + + F Q K + + Sbjct: 313 VILATDGDFNVGISSDEQLVSLIEEKREQDIFLTVLGFGTGNLQDAKMEKIANK--GNGN 370 Query: 428 FFEANSTHELNKIFRDRIG 446 + ++ E NK+ + IG Sbjct: 371 YAYIDNILEANKVLVNEIG 389 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 56/185 (30%), Gaps = 28/185 (15%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ S+ + G ++ V + K R I Sbjct: 34 ELVKKFINQIVDSLDVSEQNAQV---GLVQYSSSVRQEFPLGRFTSK--RDIKAAVKKMT 88 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A+ +I + + +K ++ TDG + + Sbjct: 89 YMEKGTMTGTALNY----LIDNTFAISSGARPGA--QKVGIVFTDGRSQDYISD---AAK 139 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNE 448 KAK G ++ + + L AS +F + ++IG + Sbjct: 140 KAKDLGFKMFAVGVGNAVEDE------LREIASDPVAEHYFYTADFKTI-----NQIGKK 188 Query: 449 IFERV 453 + +++ Sbjct: 189 LQKKI 193 >gi|126733489|ref|ZP_01749236.1| von Willebrand factor, type A [Roseobacter sp. CCS2] gi|126716355|gb|EBA13219.1| von Willebrand factor, type A [Roseobacter sp. CCS2] Length = 699 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 43/393 (10%), Positives = 104/393 (26%), Gaps = 39/393 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + ++ +E + + + + L + L D ++ + +R Sbjct: 134 DAPADAGAELILGPMEAPEQE-LSFADRSRPAVPKELPQVTNEALMPQMEDADIDEAMRQ 192 Query: 118 TAVEMNPRKSAYQ--------VVLSSRYDLLLNPLSLFLRSMGIKSWLIQT------KAE 163 + + + + R + + + K Sbjct: 193 QQTRAVQGRQSLNAAPAAPSEITAAPRIVVPAPSVDDVTIMPAPNTETFANDDPNPLKIT 252 Query: 164 AETVSRSYHKEHGVSIQWVIDFS---RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 AE ++ + V+ S + Q + +N F + Sbjct: 253 AEEPVSTFSIDVDTPAYAVVRSSLSRGQLPPAQAVRIEELVNYFPYDYPTPDAGEAPFRP 312 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 V + + +L + P L+ +D+S K L++ + Sbjct: 313 TVTTFQTPWNADTQLVHIALQGQMPEVAARPPLNLVFLIDTSGSMDDPTKLPLLKQSFRL 372 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ D V G+ +V+ RT + GST Sbjct: 373 MLDQLRPEDQVAIVEYAGSAG---QVLVP------TSASERTTILQAIQSLGAGGSTNGQ 423 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---G 397 ++ AY + ED ++L TDG+ A+ + + G Sbjct: 424 GGLEQAYSVAEAMREDGEVNR---------VILATDGDFNVGLSNPDALKDFIADKRETG 474 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 + + F + N ++ + Sbjct: 475 TYLSVLGFGRGNLDDATMQALAQNGNGTAAYID 507 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 46/283 (16%), Positives = 87/283 (30%), Gaps = 23/283 (8%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + D + P + DR S + K + + P + Sbjct: 501 GNPSDDWLNYPSSPSDAIDMDIDRVPDSDPATRWKPLLPNAVWGPKGTLVGNTWSGDYTT 560 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P+ + +L + + + + ++++ V G ++ Sbjct: 561 DPVKAGENTSSGDTGRNLSNNSCVTASRKLTNYNGVGGNPSAQDLSNYV--GTLVPHNNT 618 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGS---------TAINDAMQTA-YDTIISSNED 356 D WG + T + G+ T A Y + D Sbjct: 619 YHDIGLLWGARLMSPTGIFASENATTGGGAQIQRHLIFMTDGATATTVNNYASYGLEWWD 678 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 Y D N +N A+C K++ I + I + + + Sbjct: 679 RRQIAPAGPNDANY----DDNLNAVNNARSNALCTAIKNKNITLWVIYYG---SSDTATK 731 Query: 417 YFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L+NCA SP+ F+EA +T L FR+ I + I +R+T+ Sbjct: 732 TRLTNCATSPSYFYEARNTTLLIGKFRE-IADRISN--LRLTQ 771 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 62/187 (33%), Gaps = 28/187 (14%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + +F +L + G+ ITA + +G+ G D+ R + L+ A + Sbjct: 6 KRALFILMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALAG 65 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ N+ + N +++++ + Sbjct: 66 RRIMGSGRWTDNNGRPNTTALATFDLNFAQNSFGAENRTRSYSESDG------------- 112 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 V ++ D+ + +R + + + ++ E + + + +V+D Sbjct: 113 -----TVSGTASADVPMT----LMRVLNVPTKRVEVTCEGQMRIPNTD------VMFVLD 157 Query: 185 FSRSMLD 191 S SM + Sbjct: 158 NSGSMNE 164 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 33/71 (46%), Gaps = 1/71 (1%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 7 RRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAREL-DG 65 Query: 72 LEEVSSRAKNS 82 ++ +RA+ + Sbjct: 66 RDDAITRAQTA 76 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 41/123 (33%), Gaps = 20/123 (16%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R+ + G T I A+ AY + + KK+++LLTDG+ + Sbjct: 532 RSRIAGEIARIQPGGGTEIFSALDAAYQDMTVT-----------QARKKHVILLTDGKAS 580 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELN 438 + A+S I + T+ + L A F + L Sbjct: 581 TGGIRDLVSAMIAES--ITVTTVGL-----GNDLDEQLLKMIADVGGGRFHAVPDPNNLP 633 Query: 439 KIF 441 +IF Sbjct: 634 RIF 636 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 45/117 (38%), Gaps = 2/117 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ +G TA+L+P +L ML D++ + + L+ A A + A+ + Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAARYTGERPS 60 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENN--LKKNFTDREVRDIVRDTAVEMNPRKSA 128 V + + E R E +F+D + D R A + R A Sbjct: 61 AVRQARVSVAVNDSFRAERLASRQIELGRWQDGSFSDMDASDPRRPNAARVTVRSEA 117 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 25/61 (40%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G I+ +L+ + + G+L+D+ + ++ + A + L + ++ Sbjct: 108 RGGAVLILIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTTT 167 Query: 78 R 78 R Sbjct: 168 R 168 >gi|62001326|gb|AAX58365.1| AvrE [Pseudomonas viridiflava] Length = 1721 Score = 45.4 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1357 SNNRVRLANSAGVTAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1415 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + +++ +Q V + +L L + T+ Sbjct: 1416 FGPNAAVTASIDSRTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPTQKELTRLADA 1475 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 K IQ +D ++ + + + T K S+ + + Sbjct: 1476 KQPEYADKTPKEKIQAHLDGLNTLFKDRPSNNSAQKAALLALSRATTKHDSAIDKHSVLD 1535 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1536 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAQLIAEEPNLKSLIGQMKASP 1595 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + V G + L + F + G T Sbjct: 1596 GTMARVRLEPKDEMMQKVDQGTRDGSITQKEIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1655 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1656 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1715 Query: 397 GIR 399 G+ Sbjct: 1716 GME 1718 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 45/127 (35%), Gaps = 16/127 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ S L + G T + + TA ++ Sbjct: 132 RVGLIVFSGTAFLQSPLSSDYEILREFLPALDPTFL-PEGGTNYDALINTALTAFGATG- 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQE 413 A +++++L+DGE T+D + + + K++GIR++ + + Sbjct: 190 ----------AADRFLIILSDGEATED--DWRSHVAELKNRGIRVIALGVGTTAGAMIPD 237 Query: 414 KARYFLS 420 A + Sbjct: 238 GAGGLVK 244 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 86/223 (38%), Gaps = 40/223 (17%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR--------MGATFFNDRVI 307 S S+ +K + D + S + IDN+ D R GA ++D V Sbjct: 41 VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNLVWNAGALHYSDEVE 98 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R +K+ G T + A++ + ++ ++L+ Sbjct: 99 IIQGLTRMPGG--RDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG--------SHLK 148 Query: 367 AKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KY++++TDG E ++ G+ N+AK G+++ ++A + + + LS Sbjct: 149 ENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPR-----LSII 203 Query: 423 ASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIRI 456 A+ +++ F A E+ D I + I V ++ Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTIDTIVDMIKNNVEQV 246 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 53/395 (13%), Positives = 112/395 (28%), Gaps = 65/395 (16%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAY-----------QVVLSSRYDLLLNPLSLFL 148 ++ N +R + + N + Y Q + S L Sbjct: 299 DSQSNNAEERYNGYQSTGSELTFNANSNDYIGLRFRNIALPQGAVVSNAYLEFTAYQNSY 358 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR-SMLDYQRDSEGQPLNCFGQP 207 + + +A+ + + S + + ++ S + + + P Sbjct: 359 NNSASMTIEAANEADPRSF-------NNYSRYLLRNKAKTSAVTWSGIERWYRNREYQSP 411 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + ++ + G ++ + N Y P + F ++ Sbjct: 412 SVASIVNQLVNRGDWQSGNDMMFILSDFNNTRGAYTYSERPSGAAKLVIEFQGQATPGQT 471 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-T 326 + + + + S + DT+ G ++ + D + G + + T+ + T Sbjct: 472 STVREHLVSKVDELSAS--GYTPIVDTLYEGVLYYG-GLDVDYGLTRGNNSVSNTVRRNT 528 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY----------IVLLTD 376 D + SSN ++ IVLL+D Sbjct: 529 RVSHRLSYTGQDATLPSGCEEDNLSSSNCITQQIVQGARYLSPITDRQCQVNNHIVLLSD 588 Query: 377 GE-NTQDNEEGIAICNKAKSQG-------------------------IRIMTIAFSVNKT 410 GE N + + I A G I TI F+ N Sbjct: 589 GEANNNHSVDEIETLLSASCTGSGGEKCGLSLVRNIADTEESVIDSRIITHTIGFAAN-- 646 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +A FL+ A F++A+++ EL F+ Sbjct: 647 --TQANSFLNQIALQGGGGFYQADNSQELLGAFQS 679 >gi|217978821|ref|YP_002362968.1| hypothetical protein Msil_2684 [Methylocella silvestris BL2] gi|217504197|gb|ACK51606.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 429 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 56/169 (33%), Gaps = 17/169 (10%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G ++ L +PV++G+ + ++ Y + ++ A +A + + L Sbjct: 20 RRFRYDRGGGVALMIGLALPVIIGMIALGTEISFLLYKKFQMQSVADSAALGGAAALQSG 79 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 A+ +F F + V V + + +A V Sbjct: 80 HPAPGIEARGISSFL----------GFVDG------AAGVTVTVNNPPATGSAANNASAV 123 Query: 132 VLSSRYDLLLNPLSLFLRSM-GIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + L+ +SLF+ + + + + T+ + S+ Sbjct: 124 EVIISQPQTLSMVSLFVSGLFTVGARAVATRGTTSSCVLQLGSGGQFSM 172 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 48/343 (13%), Positives = 91/343 (26%), Gaps = 96/343 (27%) Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 + ++ I + + A+ + + V++ V+D S SM Sbjct: 7 LISALLIVAGSVPALLHADDTEVYFGQSQPVNLLLVLDVSGSMAWTTDACR--------- 57 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 ++ Y YPG + S Sbjct: 58 ---------------------------LNRWGQPYPSCYPGNGEKSR------------- 77 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 ++++AL + + N V + + + + + T Sbjct: 78 ----LDIMKEALELFLDDL----PDNVKVGILTYSAGNNIDLLHEVKQLSDNNHKATLLT 129 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T A+ A ++ + IV LTDG+ + G Sbjct: 130 TIDGLEANGGTLTAGALYEAGSYFRGQYDNLPSPITPGCSNASNIVFLTDGQPNSMSYNG 189 Query: 387 IAI-----------C---NKAKS--------------------QGIRIMTIAFSVNKTQQ 412 + C + K ++ TIAF + Sbjct: 190 YSYRNSIINMTGSSCARSDDGKECSEKLAGFLSTVDQIEDLTPSKVKTHTIAF---ALED 246 Query: 413 EKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 AR FL N A + + A+ST L F+ I +I + + Sbjct: 247 NNARTFLENVADAGNGQSYTADSTDGLVDAFKSSIQTDIEQSM 289 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 55/158 (34%), Gaps = 24/158 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDR------VISDPSFSWGVHKLIRTIVKTFAIDE 331 + ++ +N R+ F+ R V S+ HK + + + Sbjct: 98 VKKLLADFTLAENAA---RVAIVTFSSRNKVVNHVDHLSKPSYHKHKC-SLLEEELPRIK 153 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T AM A + + + + K + L+TDG + + + Sbjct: 154 YAGGGTYTKGAMIKAQEVLRHARPNA----------TKAVFLMTDGYSNGG--DPLPEAR 201 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 K K ++I T F + ++ + ++ A +S+F Sbjct: 202 KLKQNDVQIFT--FGIRSGNVKELQNMATDPAEEHSYF 237 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 42/117 (35%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + D TAI++A++ A + Sbjct: 130 RVGIIAYAGKAFPQLPITTDYASAKMFLQNMN-TDMLSSQGTAISEAIELAKTYYDDEEQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + +++++DGE+ E + I +A +GIRI+T+ K Sbjct: 189 TN-----------RVLIIISDGEDHGG--EAVDIAEEANEEGIRILTVGVGDVKGGP 232 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 43/117 (36%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + H ++ D TAIN+A++ A + + Sbjct: 130 RVGVIIYAGNSYPLLPITTD-HAAANMFLQNANPDMVSSQGTAINEALELAKTYYNNDEQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +++V+++DGE+ Q+ E + + G++I TI K Sbjct: 189 TN-----------RFLVIISDGEDHQE--ETKQVAQNLSNDGVKIYTIGVGTEKGGP 232 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 54/139 (38%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A +I ++ K+ N +G F S + + V + Sbjct: 112 LQQAKNLIINAMGKMTNDK----VGIIVFAGEASSIMPLTTDFTAVETY-VGGVETSIVK 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 M T AMQTA + +N + + +VLL+DGE+ + NE+ A A Sbjct: 167 MQGTDFLKAMQTA-----------ADKFRNVAKGSRKVVLLSDGEDNEGNEK--AAAKLA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 +GIR++++ + Sbjct: 214 NREGIRVISVGIGSEEGAP 232 >gi|209527269|ref|ZP_03275780.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492336|gb|EDZ92680.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 414 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 25/157 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I + D R+ F+ R +K Sbjct: 59 LETVKQAAKELIDRLNVGD------RISVVAFDHRAKVLVP---NQDLTDPDGIKKKIDG 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G TAI++ ++ + + +D + + LLTDGEN DN+ + + Sbjct: 110 LRCSGGTAIDEGIKLGIEELGKGKQDRISQGF----------LLTDGENEHGDNKRCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A + I ++ F + + L A Sbjct: 160 AKLATEYKLTINSLGF-----GDDWNQDILEKIADAG 191 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 48/139 (34%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A +I ++ R+G + + + H + ++ D Sbjct: 112 LEKAKRIAFETISQLKGD----RVGIVAYAASAYPQLALTTD-HSAAKMFLQGMNTDMLS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A++ A + + + +L+DGE+ + I ++A Sbjct: 167 SQGTAIQEAIRMASNYFDDKTPTA-----------RLLFILSDGEDHE--MGATEIASEA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +G+ I TI K Sbjct: 214 QEKGVHIYTIGIGTEKGAP 232 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 73/197 (37%), Gaps = 25/197 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + A + + + NV G F S + + ++ A Sbjct: 107 RLSAAKSAARDFVDGLPREFNV------GLVAFAGSAAVLVPPSTD-REALHDGIRRLAE 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ T+ + + +E I++L+DG NT + + Sbjct: 160 GITGVQGTAIGEAISTSLGAVKALDEQA-----TTQPPPARIIVLSDGANTSG-MDPMEA 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELN 438 +A + + + TI+F ++ L A + +F +A+++ EL+ Sbjct: 214 AAEAVAFEVPVHTISFGTPGGFVDRGGRPIQVPVDGETLQAVAEQTGGAFHQADTSDELH 273 Query: 439 KIFRDRIGNEIFERVIR 455 ++ D IG+ + R R Sbjct: 274 AVY-DDIGSSVGWRKER 289 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 45/291 (15%), Positives = 86/291 (29%), Gaps = 33/291 (11%) Query: 174 EHGVSIQWVIDFSRSM-------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + ++ +VID S SM LD + + F + +Q + Sbjct: 3450 DVPTNVNFVIDTSGSMYYGRLLNLDSIHMNSAEKYKVFVNYGATLTAADGTQLYNGSSQS 3509 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK-HLVRDALASVIRSI 285 ++ L Y Y + D + + + A ++ I Sbjct: 3510 GWVTVTYDQMKAGLQYDGYRAEDPIYIKSSIGEDQTYKLTDFPSVFDMTKQAYQVLVDEI 3569 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSF-----SWGVHKLIRTIVKTFAIDENEMGSTAIN 340 N ++ FN V D SF S T + + G T Sbjct: 3570 LTNTNDKSSLNFNVVTFNSTVGGDSSFHYDAESNSFVNSRGTDIHNYLNSLIAGGGTEFE 3629 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 ++T D I++ + + LTDG++ + A + + Sbjct: 3630 APLKTISDHIVTDGNT-----------RNVVYFLTDGKDNTGFSNSANNSDYAALKHAEV 3678 Query: 401 MTIAFSVNKTQQEKARYFLS--------NCASPNSFFEANSTHELNKIFRD 443 ++IA ++ N + P+ +T+EL IF+D Sbjct: 3679 ISIAVG-PSGDADQVNQIAQLGEGYNNNNDSEPSYSKVITNTNELTDIFKD 3728 >gi|239906053|ref|YP_002952792.1| hypothetical protein DMR_14150 [Desulfovibrio magneticus RS-1] gi|239795917|dbj|BAH74906.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 391 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 56/149 (37%), Gaps = 9/149 (6%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G + +AL+M V+ G+ + VD Y L+ AA A + + L+++ ++ + Sbjct: 13 ESGSVVVFSALIMIVLAGLATLAVDYGFLQYKRSQLQTAADAAALAGAADLLRNGDDFDA 72 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + F ++ + E + + ++++ +V Sbjct: 73 VRATAVDFGQRNLGE-------QDTVASAVTTGDVELLKGETPAAGATPDTVRVTAGRTA 125 Query: 138 DLLLNPLSLFL-RSMGIKSWLIQTKAEAE 165 NP+ +FL +G + + A A Sbjct: 126 Q-RGNPVDMFLGPVLGWNTQDLTATASAS 153 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 69/189 (36%), Gaps = 23/189 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S ++ + LV+ +LA ++ + D + V + D + Sbjct: 200 IDVSGSMNMENRLELVKRSLAMLVEKLDSRDTLAIVV------YGDTARTVLEP---TRI 250 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + R+ + + GST + +Q AY +++ ++ ++L +DG Sbjct: 251 MDRSRILEAINALHPEGSTNVQAGLQVAYAI-------AASQVREGATSR--VILCSDGV 301 Query: 379 NTQDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + +I K+ QG+R+ T+ F + E + ++ Sbjct: 302 ANNGITQADSIFQSVKAYAQQGVRLTTVGFGMGNYNDELMERLSH--VGDGQYAYVDALP 359 Query: 436 ELNKIFRDR 444 E +IF ++ Sbjct: 360 EARRIFIEQ 368 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 59/169 (34%), Gaps = 18/169 (10%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N VR+ ++ + + +++ + K + G+T + + ++ A Sbjct: 31 DNPKVRISFITYSTDGHTLMKITSDKNEIRENLAKL--QNVVPSGATHMQEGLRKA---- 84 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICNKAKSQGIRIMTIAFSVNK 409 NE +A I+ LTDG EE +++ G + I + Sbjct: 85 ---NEQIEQENAGEKKAPIVILALTDGTLLPFPFEETKMEAEESRRLGATVYCIGVKDYR 141 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 Q L SP+ F ++ K ++ +G + I +TK Sbjct: 142 KDQ-----LLDIADSPDHMFGVDNG---FKGLQNIVGPLASKSCIDVTK 182 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 198 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSL-LAV 250 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ S E + KN + + I+LL+DG++T Sbjct: 251 NSLTANGGTNIAEGLRK------GSKVIEERQSKNPVCS---IILLSDGQDTYTVSPT 299 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 69/189 (36%), Gaps = 23/189 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S ++ + LV+ +LA ++ + D + V + D + Sbjct: 152 IDVSGSMNMENRLELVKRSLAMLVEKLDSRDTLAIVV------YGDTARTVLEP---TRI 202 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + R+ + + GST + +Q AY +++ ++ ++L +DG Sbjct: 203 MDRSRILEAINALHPEGSTNVQAGLQVAYAI-------AASQVREGATSR--VILCSDGV 253 Query: 379 NTQDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + +I K+ QG+R+ T+ F + E + ++ Sbjct: 254 ANNGITQADSIFQSVKAYAQQGVRLTTVGFGMGNYNDELMERLSH--VGDGQYAYVDALP 311 Query: 436 ELNKIFRDR 444 E +IF ++ Sbjct: 312 EARRIFIEQ 320 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 69/192 (35%), Gaps = 38/192 (19%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 +D + + +G F S + R VK + TA + + TA Sbjct: 122 VDGLTQGINLGFVTFAGTASVMQSPT-----TNREAVKAAIDNIKLAERTATGEGILTAL 176 Query: 348 DTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGI-------AICNKAKSQGIR 399 +I E + IVL++DG+ T +++ + AKS+GI Sbjct: 177 QSI----ETLATVLGGAETPPPARIVLMSDGKQTVPDDKDVDNPRHAFTAARLAKSKGIP 232 Query: 400 IMTIAFSV--------NKTQQEKARYF--------LSNCA--SPNSFFEANSTHELNKIF 441 + TI+F ++ Q ++ L A S F+ A+S EL ++ Sbjct: 233 VSTISFGTEWGSVEIPDQDGQGGSQRVKVPVDNESLREIAKLSGGEFYTASSLEELTAVY 292 Query: 442 ---RDRIGNEIF 450 ++IG E Sbjct: 293 DTLEEQIGYETT 304 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 14/125 (11%) Query: 291 VNDTVRMGATFFNDRVISDP-SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D VR+ + + FS+G ++ + I G T A+ A Sbjct: 103 DKDDVRIAMIQY---AETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRTGKALLAAKGE 159 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVN 408 + S + K IVL TDG ++ I + + + I+I + + Sbjct: 160 LFSEEKGARKNA------SKIIVLFTDG---LSVDDPIKHAQQLREIEKIKIYVVYVGSD 210 Query: 409 KTQQE 413 +QE Sbjct: 211 GFEQE 215 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 17/153 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + ++ + + R+G F+ RV ++ G + + + E Sbjct: 24 ELVKRFVNQIVDFLDVSP---EGTRVGLVQFSSRVRTEFPL--GRYGTAVEVKQAVLAME 78 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S + R N + +V TDG + ++ Sbjct: 79 YMERGTMTGLALRHMVEHSFSEAQGARPRALN--VPRVGLVF-TDG---RSQDDISVWAA 132 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 133 RAKEEGIVMYAVGVGKAVEAE------LREIAS 159 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 49/144 (34%), Gaps = 22/144 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +A + +++ R+G F R + + R I+ Sbjct: 115 TRLARAKAEVAELSSALRG-------HRIGLVAFAGRASVLAPLTPD-YGFFRMILDGVD 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A++ A R + K I+L+TDGE+ E Sbjct: 167 TKSVSRGGTEIGQALRKAV------------RSFDPGPGAKMILLITDGEDHGGYAE--D 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +A G+R++ I F + Q Sbjct: 213 AAREALEAGVRVVAIGFGSEQGSQ 236 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 56/180 (31%), Gaps = 22/180 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + K I K Sbjct: 225 KIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNLRKVDDKNTPNIQK-IIN 277 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T IN M A++ + I LL+DG++ +E+ Sbjct: 278 SITAEEYTDINSGMLMAFNILQKRQ---------FFNPVSSIFLLSDGQDNGADEKIKKY 328 Query: 390 CN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N K++ I + F + R +F+ +++++ F D +G Sbjct: 329 INSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLK---DGNFYYVEKINQVDEFFVDALG 385 >gi|62001316|gb|AAX58360.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 99/363 (27%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G + L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 45/275 (16%), Positives = 77/275 (28%), Gaps = 29/275 (10%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 M + +N F DR + L Sbjct: 149 GQMPPKAAVRTEEFINYFRYDYDRPQDRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIE 208 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D S K LV+ ALA + ++ D V+ V GA Sbjct: 209 RSERPPANLVFLMDVSGSMGRPDKLPLVKTALAGLAGELQPQDKVSIVVYAGAAG----- 263 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + K+ + GSTA +Q AY Sbjct: 264 -LVLEPTNDTRKIRAAL-----NQLQAGGSTAGGAGIQLAYQI-------AEDNFIEGGV 310 Query: 367 AKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + ++L TDG+ + I + K + GI + T+ F + + A Sbjct: 311 NR--VILATDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGTGNYNEA----MMEQIA 364 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + A L + +G+E+ + I K Sbjct: 365 NHGNGNYAYIDSALEA--KKVLGDEMSSTLFTIAK 397 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + + G T+I +++A+ I + Sbjct: 58 GMVTFDSAAHVQSELIQINSGSDRDTLAKR-LPAAASGGTSICSGLRSAFTVIRKKYPTD 116 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ---- 412 IVLLTDGE+ I+ C N+ K G I T+A + Q+ Sbjct: 117 GSE----------IVLLTDGEDN-----TISGCFNEVKQSGAIIHTVALGPSAAQELEEL 161 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 K L AS + + + L F GN + Sbjct: 162 SKMTGGLQTYASD----QVQN-NGLIDAFGALSSGNGAVSQRS 199 >gi|116876155|gb|ABK30937.1| complement component 2/factor B variant 2 [Carcinoscorpius rotundicauda] Length = 889 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 36/254 (14%), Positives = 80/254 (31%), Gaps = 25/254 (9%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S + ++ + N F ++ K +++ ++ R + + + +PG Sbjct: 364 SQISFRHIFKTSSNNNFRMSTNKVSKMLANKGFEMSRRSIFQKSPRIEPRRRRIDLNFPG 423 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV-IRSIKKIDNVNDTVRMGATFFNDRV 306 L + S ++ + + + ++ R GA F+ V Sbjct: 424 RLVIYFVFDA-SGSIGRKYFNSSIKFAKGLVTRMGVKEFGT--------RFGAVSFSSTV 474 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + ++ + G TAI+ A+ +I H + Sbjct: 475 SASFLPQ--DYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMI---PLSKH-TFADRA 528 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K I LLTDG+ + + + K+ I +IA + + + L AS Sbjct: 529 MKTIIFLLTDGKANMRG-DPKQVAKELKADVKAEIYSIALTGDYDINK-----LREVASS 582 Query: 426 --NSFFEANSTHEL 437 + + L Sbjct: 583 KKDHVYILKDYETL 596 >gi|115315541|gb|AAV65032.2| complement component 2/factor B variant 1 [Carcinoscorpius rotundicauda] Length = 889 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 36/254 (14%), Positives = 80/254 (31%), Gaps = 25/254 (9%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S + ++ + N F ++ K +++ ++ R + + + +PG Sbjct: 364 SQISFRHIFKTSSNNNFRMSTNKVSKMLANKGFEMSRRSIFQKSPRIEPRRRRIDLNFPG 423 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV-IRSIKKIDNVNDTVRMGATFFNDRV 306 L + S ++ + + + ++ R GA F+ V Sbjct: 424 RLVIYFVFDA-SGSIGRKYFNSSIKFAKGLVTRMGVKEFGT--------RFGAVSFSSTV 474 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + ++ + G TAI+ A+ +I H + Sbjct: 475 SASFLPQ--DYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMI---PLSKH-TFADRA 528 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K I LLTDG+ + + + K+ I +IA + + + L AS Sbjct: 529 MKTIIFLLTDGKANMRG-DPKQVAKELKADVKAEIYSIALTGDYDINK-----LREVASS 582 Query: 426 --NSFFEANSTHEL 437 + + L Sbjct: 583 KKDHVYILKDYETL 596 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 68/191 (35%), Gaps = 19/191 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S M+ S++ +D SS + ++ + ++++ ++ + D ++ Sbjct: 355 SHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIA 410 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A F FS+ + ++ G TA DA+ + D Sbjct: 411 AVQFT--YDQRTEFSFTDYSTKENVLAVIRSIRYMSGGTATGDAISFTVRNVFGPVRDS- 467 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K ++V++TDG++ D A + A GI I ++ + + + Sbjct: 468 -------PNKNFLVIITDGQSYDDVRGPAAAAHDA---GITIFSVGVAWAPL--DDLKDM 515 Query: 419 LSNCASPNSFF 429 S ++FF Sbjct: 516 ASKPKESHAFF 526 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 53/376 (14%), Positives = 100/376 (26%), Gaps = 35/376 (9%) Query: 44 VRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS--RAKNSFTFPKQKIEEYLIRNFENN 101 + L + A I V ++ E V + + + Sbjct: 76 TSMTLEAQYLFPVPENAAIRNLVLMVDGKELVGKLMPRDEARRVYEGIVRSKKDPALLEY 135 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + V I + + S D S I I+ Sbjct: 136 TGRGLIQTSVFPIPPGADRTITLKYSTLLPRTFGAVDFTFPFGGRGFTSKPIGKLRIEVD 195 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSR---SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 ++ H S ID +M+ +QR+S + + + Sbjct: 196 LRTTNDLKTIHSP---SHDVAIDRKGNRDAMVTFQRESFLPDQDFRLLFNEGEGALGAMV 252 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 +E + ++ S P P+ + +D S KK R A+ Sbjct: 253 LSHPPTANEDGTLLLL---ASPTIETTPNKTPPAKTVVLILDRSGSMSG-KKIEQARAAM 308 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V+ ++ + D N ++D V + R F GST Sbjct: 309 KFVVENLNQDDLFN------LILYDDTVEMFKPELLRCNAENRAEALRFIEGVRPGGSTD 362 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---- 394 I+ ++ I + Y++ LTDG T + I A+ Sbjct: 363 IDQGLRAGLKLIADESRPN------------YVIFLTDGLPTSGETNELKIAEAARAANP 410 Query: 395 -SQGIRIMTIAFSVNK 409 + + + + VN Sbjct: 411 LKAKLFVFGVGYDVNA 426 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 27/163 (16%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + T + G T+I ++TA+ I + Sbjct: 345 GMVTFDSSATIQSELIQIETDAQRNSLITR-LPTVAGGGTSICSGLRTAFTVIKKKFSTD 403 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAF--SVNKTQQEK 414 IVLLTDGE+ I+ C ++ K G I T+A S + ++ Sbjct: 404 GSE----------IVLLTDGEDN-----TISTCFDEVKQSGAIIHTVALGPSADPGLEKL 448 Query: 415 ARYF--LSNCASPNSFFEANSTHELNKIFRD-RIGNEIFERVI 454 A + A+ N+ + L F GN + Sbjct: 449 AEMTGGMKTTATDNA-----QNNGLIDAFSALSSGNGAITQRS 486 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 91/241 (37%), Gaps = 40/241 (16%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR- 296 + +L P P S S+ +K + D + S + IDN+ D R Sbjct: 1090 AKAPLLRLRPDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRC 1147 Query: 297 -------MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYD 348 GA ++D V + R +K+ G T + A++ + Sbjct: 1148 DRNLVWNAGALHYSDDVEIIQGLTRMPGD--RDTLKSRVDAIKYFGKGTYTDCAIKKGLE 1205 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIA 404 ++ ++L+ KY++++TDG E ++ G+ N+AK G+++ ++A Sbjct: 1206 QLLVGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVA 1257 Query: 405 FSVNKTQQEKARYFLSNCASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIR 455 + + + LS A+ +++ F A E+ D I + I V + Sbjct: 1258 ITPDHLEPR-----LSIIATDHTYRRNFTAADWGQSRDAEEVISQTIDTIVDMIKNNVEQ 1312 Query: 456 I 456 + Sbjct: 1313 V 1313 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 75/272 (27%), Gaps = 36/272 (13%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + G D + S S + R + Y CN L+Y P L+ Sbjct: 307 METVNSASSSSIGVDIDVSGGSSGSGLFDMMGRYLGVLSYGTDCN--LHYCPSPSVLNDI 364 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT----------VRMGATFF 302 + V + VI + + + I + + RMG F Sbjct: 365 ATTPVPVPDQDIMLVIDRSGSMSGDAGTGQSKIDEAKDSASLFVQLVEASAGHRMGLVSF 424 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS------TAINDAMQTAYDTIISSNED 356 + D ++ +G T+I D + A + Sbjct: 425 STSASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLIPDGWTSIGDGIDKAQSELT----- 479 Query: 357 EVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K I+LLTDG +NT E N RI I Sbjct: 480 ------GGANP-KTILLLTDGLQNTPPMIETAT--NDIGDT--RIHAIGLGTEANLNGGL 528 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 L+ ++ ++ A EL K F G+ Sbjct: 529 LSDLTQ-STGGAYTRAGDGLELKKFFALAFGD 559 >gi|170571356|ref|XP_001891697.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158603658|gb|EDP39502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 319 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 77/229 (33%), Gaps = 13/229 (5%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D + + + + AD T K K + + + Sbjct: 102 AVDKNELLEITGNINHIILQSGRKLAADITRKLLRQAQEKCRTTTTTTTTTTTTTTATTT 161 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + + +++ K + + + ++ R+ Sbjct: 162 TTNPISGCEQDVVLVMDLSTTTNPVYRKYIEMAEELINRLLI-------GRRFSRIALIT 214 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + F+ + + IV E+ G+TA+ + ++ + +D+ H Sbjct: 215 FSSVGKTRTQFNLDRYFNGKDIVTAIRRLESSGGTTAVGEGIR-----LGIEQKDKQHGG 269 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + N AKK +++ TDG + + + + AK G + T+ + + K+ Sbjct: 270 RPNEIAKKAMLVFTDGWSNKGP-DVEEMSRNAKGAGFTLYTVVYEIQKS 317 >gi|331085807|ref|ZP_08334890.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406730|gb|EGG86235.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 1321 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 31/268 (11%), Positives = 78/268 (29%), Gaps = 35/268 (13%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + Y D+ Q T + + + + N + Sbjct: 437 ERQEDFTQYVTDTVNQKAASVSISGIDTKEFETVRAVVSLEEGIADTEEKFRGNVEILDC 496 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIK------KKHLVRDALASVIRSIKKIDNVNDTVRM 297 P E+ + +L K ++ A+++ + + D V + Sbjct: 497 GVEIPDYKVKKLEYDTVNIALCCDNSGSMEGEKIENLKKAVSTFVGKL------ADEVNI 550 Query: 298 GATFFNDRVISDPSFSWGV--HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 G F V+ GV R ++ T I ++ + + Sbjct: 551 GIVPFGSGVLE------GVCEPGSSREKLEQSVESFRSDSGTNIYSGVEYTLSMLAKEKD 604 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAFSVNKTQQEK 414 V+++DG+++ +EE + A G I + ++ + + Sbjct: 605 AL-----------NIAVIMSDGQDSIPSEEQLQKITSACENGNILLYSMGLGADVESEVL 653 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFR 442 + Y + A ++ + ++ L ++ Sbjct: 654 STY---SDAGNGAYVFVSDSNSLYSFYQ 678 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 60/174 (34%), Gaps = 18/174 (10%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V DA+ ++ + +N + + F +V + + Sbjct: 35 KIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVALHQPLA--------SASDIHWQ 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-A 388 D + G T + A+Q A I + + + +VL++DG E+ + A Sbjct: 87 DLSAGGMTPLGTALQMAKAMIEDK------DVIPSRAYRPTVVLVSDGGPNDAWEKPLNA 140 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + +S + +A + + ++ + N F A + +L F+ Sbjct: 141 FISDGRSAKCDRLAMAIGADADEAVLGKFIE---GTSNRLFYAENAKQLRDFFK 191 >gi|190410234|ref|YP_001965735.1| terY1 [Klebsiella pneumoniae] gi|146151027|gb|ABQ02793.1| terY1 [Klebsiella pneumoniae] Length = 239 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLT--------DLLSFQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 158 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 202 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 55/427 (12%), Positives = 119/427 (27%), Gaps = 44/427 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK-QAAQTAII------TASVP 67 + S TG + TA++ + D++ + + A++ + + Sbjct: 71 LTSRTG-LTVTTAIVACGFMIGMPQGRDILNLAPTARLAQPTPPDAAVLDPAGSVSVAPA 129 Query: 68 LIQSLEEVSSRAKNSFTFPKQK----IEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-- 121 L+ + ++ A + + + E R ++ N + R Sbjct: 130 LVDETTDDANVAALAEPVMEMAPAVVLSESSSREEADSAIGNPATAAPLSLTRAAPTNDL 189 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + QVV + ++ ++ E + S + + Sbjct: 190 VAEGMALGQVVSAPEPNVAGIGELDTESFANDTPNPLKITTEEPVSTLSIDVDTAA---Y 246 Query: 182 VIDFS----RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + S + + +N F V +V Sbjct: 247 ALIRSSLTAGQLPPADAVRIEEMINYFPYAYPAPDGQPFQPTINVFETPWNADTQLVHIG 306 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 L + P+L+ +D+S K L+R + ++ ++ D V Sbjct: 307 --LQGEMPSIQDRPALNLVFLIDTSGSMESADKLPLLRQSFRLMLDNLAPEDEVAIVTYA 364 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G+T + RT + N GST ++ AY Sbjct: 365 GSTSIALQPTQASE---------RTTILAALNALNAGGSTNGQGGLEQAY--------AL 407 Query: 358 VHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 MK + + + ++L TDG+ D A + G + + F Q Sbjct: 408 AETMKTDGDVSR-VILATDGDFNVGLSDPRGLQAYIEDKRDTGTYLSVLGFGRGNLQDAT 466 Query: 415 ARYFLSN 421 + N Sbjct: 467 MQSLAQN 473 >gi|328784200|ref|XP_003250409.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Apis mellifera] Length = 2258 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 31/103 (30%), Gaps = 18/103 (17%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + K G T A+ AY + + + KK + L+TDG + Sbjct: 172 LNKQLTNISYTGGGTYTRGALLEAYRILEKARSNA----------KKAVFLITDGFSNGG 221 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + N K G I T + L + AS Sbjct: 222 --DPRPAANLLKGAGATIFTFGIRTGNVDE------LHDIASS 256 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 67/203 (33%), Gaps = 23/203 (11%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + + FS + D G+ + D T+ S + K D + + K Sbjct: 10 LFIALFFSFYLGDATNIVFGESNDSNNATLDFTITSSQLEYAKPPNGDAQGRLDVTLIPK 69 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + P+D FV S K + AL S + K N ND R Sbjct: 70 GRVDNIVRPPIDV-----VFVFDVSGSMTPLKLQSAKYALQSAVDYFKANANPND--RFA 122 Query: 299 ATFFNDRVISDP--SFSWGVHKLIRTI--VKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F+ V + F G + + + + + A D G T ++Q A Sbjct: 123 LIPFSSDVQYNKVVPFPTGAYDVKQHLERIANVANDLRAYGGTNYTQSLQQA-------- 174 Query: 355 EDEVHRMKNNLEAKKYIVLLTDG 377 N+ KKYI+ LTDG Sbjct: 175 ----QSFFNDPTRKKYIIFLTDG 193 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 41/113 (36%), Gaps = 16/113 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 300 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRKQSL-LAV 352 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T I + ++ S E + KN + + I+LL+DG++T Sbjct: 353 NSLTSNGGTNIAEGLRK------GSKVIEERQAKNPVCS---IILLSDGQDTY 396 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 64/194 (32%), Gaps = 23/194 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS + ++ ++ VI ++ N R+G + V ++ S G Sbjct: 41 VFIIDSSRSVRPHEFEKIKVFVSRVIEALDVGPNA---TRVGVINYASAVRNELSLQ-GP 96 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H + I G T A+Q A S+ E N KK +++TD Sbjct: 97 HSKAALLQAVRRIQPLSTG-TMTGLAIQFAISRAFSAAEGGRGSAPN---FKKVAIVVTD 152 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G ++ A +A++ GI I I L AS S Sbjct: 153 GRPQDGVQDVSA---RARAAGIEIFAIGVGRVDMGT------LRQMASEPLDEHVDYVES 203 Query: 434 ---THELNKIFRDR 444 +L F++ Sbjct: 204 YSVIEKLTHKFQEA 217 >gi|45384200|ref|NP_990400.1| integrin alpha-1 [Gallus gallus] gi|2582830|dbj|BAA23160.1| alpha1 integrin [Gallus gallus] Length = 1171 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 64/174 (36%), Gaps = 16/174 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ +D ++G + V+ + + I + T + Sbjct: 187 LRNMDIGPQQTQVGIVQYGQTVVHEFYL-NTYSTTEEVMDAALRIRQRGGTQTMTALGID 245 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + E H + + +K +V++TDGE + DN + +K + + I+ IA Sbjct: 246 TAREEAF----TEAHGARRGV--QKVMVIVTDGE-SHDNYRLQEVIDKCEDENIQRFAIA 298 Query: 405 FSVNKTQ----QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 + ++ EK + + AS FF + L I + +G IF Sbjct: 299 ILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTI-VEALGERIFA 351 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 59/437 (13%), Positives = 138/437 (31%), Gaps = 66/437 (15%) Query: 14 LIKSCTGHFFIIT-ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 ++K TG + ALL+P+++ ++ + A++ A A + + Sbjct: 1 MLKQHTGSVSLSFLALLIPLVVLSAATIMIGFQVQLSSRAMQ-AVDAASLACA------F 53 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S + N EY N + +I + E+N YQ+ Sbjct: 54 ADYSDPSVNQAYL------EYYQPNVK---------LVKSEIYSASGCELN---MGYQLT 95 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L S +S ++ + A + V+D S SM Sbjct: 96 GLFS-SLKFAQASYSAQSGSVEQAHVNQSASVT----------PTEMTLVLDISSSMAGS 144 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ R ++ N ++ R + + + Sbjct: 145 --------IDTLKSILTRAIERIEQDNVQIDGRRAISISIVPFSDGV-----------SA 185 Query: 253 LSEEHFVDSSSLRHV-IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + D + K+ + ++++ +I + F + + Sbjct: 186 RNADWLDDKGVFCIDGLTKESGGSVLVNETVQNLDRIHSEKAVSHRAPDEFLADCSASAT 245 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + VKT G T + +I E +L K+ + Sbjct: 246 LVPLTDNMSE--VKTAINALTTTGGTRSYQGVIWGARQLIPRWRQEWGYNPYSLAPKQKL 303 Query: 372 VLLTDGENTQDNEEGI---AICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASP-- 425 +L+TDG ++ + + +C++ ++ I + I F+V ++ + + ++ + Sbjct: 304 ILMTDGVDSGYVLDDLIDAGLCDRLANEFAIELNFIGFNVQDSRLAQFQSCINAANTDGI 363 Query: 426 -NSFFEANSTHELNKIF 441 F A +T +L++ F Sbjct: 364 KGQVFSATNTEKLDEYF 380 >gi|194223903|ref|XP_001494710.2| PREDICTED: integrin, alpha 1 [Equus caballus] Length = 1208 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 74/223 (33%), Gaps = 16/223 (7%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 C+ + P +D + + + + ++++D Sbjct: 177 CSDVSPTFQVVNSIAPVRECSSQLDIVIVLDGSNSIYPWKSVTDFLNDLLERMDIGPKQT 236 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + + V + + +V I + T + TA + Sbjct: 237 QVGIVQYGENVTHEFNL-NKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFTEAR 295 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 H +K K +V++TDGE + DN + + + Q I+ +IA + + Sbjct: 296 GARHGVK------KVMVIVTDGE-SHDNHQLNQVIQDCEKQNIQRFSIAILGHYNRGNLS 348 Query: 412 QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 EK + + AS FF + L I + +G IF Sbjct: 349 TEKFVEEIKSIASEPTEKHFFNVSDELALVTI-VEALGERIFA 390 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 59/138 (42%), Gaps = 5/138 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALL-MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + + ++ + G+ II A +P+ +GV G +++ ++S + L++A+ A ++ Sbjct: 12 KPWRQFLRRFANNERGNVMIIFAAAALPMAIGVAG-ALEISQYSQLKSQLQEASDRAALS 70 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-TAVEM 122 A L + + +A+ + ++ + L T + R + + +++ Sbjct: 71 AMAALREGPRAMRQQARLVMKQTGDNAK--GLKGASSALGGRVTGKGRRLALSHRSTIKV 128 Query: 123 NPRKSAYQVVLSSRYDLL 140 + + + +S + + Sbjct: 129 DGSMTGLKSFVSDKITVT 146 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 49/197 (24%), Gaps = 40/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + S + +G T + H + + + Sbjct: 111 TRLEAAKRIIKSFAEKYEGDS-------LGLTALGSSAAVLIPPTIDRHTFLTRLDQLQV 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + +A + + H I+L TDG+N A Sbjct: 164 GEL--GDGTAIGMGLASAVLHLTQYSTLPSH-----------IILFTDGDNNTGEIHPRA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 + K + I I + K L A + Sbjct: 211 AADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQKKEISGTLNTVFNETELQKIAGYGNGRY 270 Query: 429 FEANSTHELNKIFRDRI 445 F A S L IF I Sbjct: 271 FSAKSPELLTDIFNRFI 287 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 45.4 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 15/112 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + V +T+ I G + + DA++ A D + Sbjct: 120 RVGLVVFAGETRQTLPLTRHVEDFKQTLDS-VGIHSVRRGGSRLGDAIRVASDAFLDKTT 178 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFS 406 D K +V+LTDGE+ + + ++ +A + QGIRI TI Sbjct: 179 DH-----------KAMVILTDGEDQES--DPVSEAKRAHEEQGIRIFTIGLG 217 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 45.4 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 55/413 (13%), Positives = 113/413 (27%), Gaps = 31/413 (7%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 + VV + A A ++T + +S E P + F Sbjct: 5 AAITVVSLALAGCATADNADEIVLTGAKA-ERSAEVPPPPPPPPPPPPPPPPPSPSAQAF 63 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 T + D + + V + P + + Sbjct: 64 AAPSAIMVTGSRISREEADATAQPQTTSPDLRYV--PSIVIPTVPDRERYDGKDVSEVAV 121 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 + T S S + M + +N F R + + Sbjct: 122 TLEQPVSTFSVDVDT-GAYSNARRMLTDGQMPPKGAVRTEEFVNYFRYDYPRPTSAQDAP 180 Query: 219 -NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + L P P+ + +D S K LV+ A Sbjct: 181 FTVNMDVARTPWDADTRLVRIGLAGYEAPKAERPAANLVFLLDVSGSMSSADKLPLVKTA 240 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + +++ + D V+ V GA V+ S S + + + + Sbjct: 241 MKTLVGQLTPKDRVSIVVYAGAAG---LVLEPTSDSREIMAALDQLQAGGSTAGGAG--- 294 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAK 394 ++ AY +S D ++R ++L TDG+ DN++ + + Sbjct: 295 -----LELAYKVAEASKVDGINR----------VILATDGDFNVGLSDNDKLLEYVEDKR 339 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 GI + + F + ++ +S E K+ +++G Sbjct: 340 KNGIAMSVLGFGRGNINEALMEQIADK--GNGNYGYIDSAIEARKVLGEQLGA 390 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAK 80 ++T + V++GV + +D ++ L+ A TA + + ++ EE ++ Sbjct: 20 VMTVSMA-VIVGVAALSIDANHLMVSKNRLQNALDTAALAGATVANRTYEEDDAKEA 75 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 48/394 (12%), Positives = 121/394 (30%), Gaps = 37/394 (9%) Query: 40 LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 D + + QAA++A + + + +++++S R Sbjct: 134 AADSTQVLQDKQP--QAAKSAAELRADFDAGEIATLKNKSEDSAAALG-----MAKRAAP 186 Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + + + + + P Q+ L + N + ++S+ + + Sbjct: 187 AAPGVVAQGQLLAEPMAVAPSPVPPADGHMQIQLDPSRERFANAAANPIKSVA--TDPVS 244 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 T + + ++ +M D Q + +N F K+ Sbjct: 245 TFSADVDSASYSFVRRSLT-------GGAMPDPQSVRVEEMINYFPYDWAGPEKADQPFK 297 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 V + + + ++ P + +D S K L++ A Sbjct: 298 ATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHANLVFLIDVSGSMDEPDKLPLLKSAFR 357 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ +K D V+ G N + +P ++ + + GST Sbjct: 358 LLVSKLKADDTVSIVTYAG----NAGTVLEP-----TRVAEKSKILSAIDRLEAGGSTGG 408 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQ 396 + ++ AY+ + +K+ + ++L TDG+ +E+ I + + Sbjct: 409 AEGIEAAYNLAKQA------FVKDGVNR---VMLATDGDFNVGPSSDEDLKRIIEEKRKD 459 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 GI + + F + N ++ + Sbjct: 460 GIFLTVLGFGRGNLNDSLMQTLAQNGNGSAAYID 493 >gi|325661940|ref|ZP_08150560.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471792|gb|EGC75010.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] Length = 1321 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 65/205 (31%), Gaps = 29/205 (14%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 L+ +S +K ++ A+++ + + D V +G Sbjct: 500 IPDYKVKKLEYDTVNIALCCDNSGSMEGEKIENLKKAVSTFVGKL------ADEVNIGIV 553 Query: 301 FFNDRVISDPSFSWGV--HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F V+ GV R ++ T I ++ + + Sbjct: 554 PFGSGVLE------GVCEPGSSREKLEQSVESFRSDSGTNIYSGVEYTLSMLAKEKDAL- 606 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAFSVNKTQQEKARY 417 V+++DG+++ +EE + A G I + ++ + + + Y Sbjct: 607 ----------NIAVIMSDGQDSIPSEEQLQKITSACENGNILLYSMGLGADVESEVLSTY 656 Query: 418 FLSNCASPNSFFEANSTHELNKIFR 442 + A ++ + ++ L ++ Sbjct: 657 ---SDAGNGAYVFVSDSNSLYSFYQ 678 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 59/169 (34%), Gaps = 11/169 (6%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-AIDENEMGSTAI 339 V+ I + ++ V + F D +F + + F + GSTA Sbjct: 91 VLDFIDLVPISSEEVHLSVVTFADSPQDVFTFKQPQATNKQLAKEAFKYLRYRRGGSTAT 150 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + + A + K +VL+TDGE + + + I ++A+++GI Sbjct: 151 DKGLIRARRYLTRP------VYGTRANVPKVLVLMTDGE-SDRHYDTIQAADQARAEGIS 203 Query: 400 IMTIAF-SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + N + P F ++ +EL + +G Sbjct: 204 VFVVGVGMANPVECRGVCGCGRYGPCPQ--FIMSNWNELVQTVDSIMGE 250 >gi|301623011|ref|XP_002940816.1| PREDICTED: integrin alpha-M-like [Xenopus (Silurana) tropicalis] Length = 907 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 68/225 (30%), Gaps = 22/225 (9%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 Y + S + + F+ S + + ++ VI K D Sbjct: 209 YSCDPSTSRCQRVPINECPVRSLDIVFLIDGSGSIYPDQFQEMLTFVSKVIEDFKGTD-- 266 Query: 292 NDTVRMGATFFNDRVISD-PSFSWGVHK--LIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 T F S + +K + G+T A+Q Sbjct: 267 --------TLFAIMQYSSWFVLHFDFNKFLATKDHNSLVRNINQLKGATWTATAIQKVLS 318 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + +K ++++TDGE D+ + +A+ +GI I Sbjct: 319 QLFIPSRGARDGS------QKLLIVITDGEKYGDSLDYSIPIAEAERKGIVRFAIGVGRA 372 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ + S + L+K FR + ++IF Sbjct: 373 FSEDTAYSELITIASQPSDKYVYRVGDFSALSK-FRKDLQDKIFA 416 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 39/119 (32%), Gaps = 10/119 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +R+ ++ + ++ ++++ G T + ++ A N Sbjct: 110 LRISIITYSTEAEVILPLTSDSKEINKSLL--VLKSIVPQGLTHMQKGLRKA-------N 160 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 E I+ LTDG + + + KA+ G + T+ + QQ Sbjct: 161 EQIRKSTLGGRIVNSVIIALTDGLLLLKPYLDTMEEAKKARRMGAIVYTVGVFMYSKQQ 219 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 54/177 (30%), Gaps = 29/177 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ A +++ + D R+ F+ R V Sbjct: 56 RPLETVKKAAIELVKQLNPED------RVSVIAFDHRAKVIVP---NQGIEDLNTVIEQI 106 Query: 329 IDENEMGSTAINDAMQTA-YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEG 386 G TAI++ ++ ++ + E I LLTDGEN DNE Sbjct: 107 KALRAAGGTAIDEGLKLGIKESALGKQERVSQ-----------IFLLTDGENEHGDNERC 155 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 + + A I + T+ F + L A S + + + F Sbjct: 156 LKLAQVASDYNITLNTLGF-----GNHWNQDVLEKIADSAGGSLSYIENPEKALEEF 207 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + + G T I A++ + + Sbjct: 130 KIGLVVFAGEAYVQLPITSDYVSAKMFLSDITPNLISAQG-TDIARAIRVSLSSFTQQKG 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K I+L+TDGE+ + + +AK +G+ + + +K Sbjct: 189 -----------VGKAIILITDGEDNEG--GALEAVKEAKEKGVNVFILGVGDSKGAP 232 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 73/227 (32%), Gaps = 20/227 (8%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 Y + + G I ALL+ ++G+ V+ R+ L+ +A+ A + ++ Sbjct: 6 YQNRSLHKQKGVAAIWMALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALAVTIEDK 65 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 V + K E N + E D ++ T S + Sbjct: 66 PGAASVMAENYVRSYVRDIKSINVQAERREPG---NSRNEEAADFIQYTVNATTTHDSWF 122 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +D ++ I ++ A S ++ + I +V DFS SM Sbjct: 123 ANSFIPSFD---------------ETQDIAGRSLARKYLSSVGGKN-IDIVFVSDFSGSM 166 Query: 190 L-DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 D+ + + ++ + Q+ + + +L P Sbjct: 167 NFDWMDPNGNKKIDDLKTAIRAISNKFICQDVRNEFVEGELKPVCHD 213 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 100/297 (33%), Gaps = 43/297 (14%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + + S R ++ + + + + S Sbjct: 1 LQQAETTSHSAFPSAHPDPGPHLTVTRRPARNMRLAHALLPLLLQACWVATQDIQGSKAI 60 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVR----- 296 P+D S S+ +K + D + S + IDN+ D R Sbjct: 61 AFQDCPVDLFF---VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLRDRYYRCDRNL 115 Query: 297 ---MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIIS 352 GA ++D V + R +K G T + A++ + ++ Sbjct: 116 VWNAGALHYSDEVEIIRGLTRMPSG--RDELKASVDAVKYFGKGTYTDCAIKKGLEELLI 173 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVN 408 ++L+ KY++++TDG E ++ G+ N+AK GI++ ++A + + Sbjct: 174 GG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPD 225 Query: 409 KTQQEKARYFLSNCASPNSF---FEA------NSTHELNKIFRDRIGNEIFERVIRI 456 + LS A+ +++ F A E+ D I + I V ++ Sbjct: 226 HLEPR-----LSIIATDHTYRRNFTAADWGHSRDAEEVISQTIDTIVDMIKNNVEQV 277 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 59/364 (16%), Positives = 117/364 (32%), Gaps = 52/364 (14%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY-----LIRNFENNLKKNFTDRE 110 A + + +I + +A +F K++ ++ RN + N E Sbjct: 107 AVDSMQLRIGDKVIAGQIQPKKQALQTFENAKKQGKQASLLQQQRRNLFTSDVANLGPHE 166 Query: 111 VRDIVRDTAVEMNPRKSAYQV----VLSSRYDLLLN-----PLSLFLRSMGIKS------ 155 + ++ R + + ++ RY+ + PL S + Sbjct: 167 QLVVEISYQQKVEYRDGLFSLRFPLAITPRYNPQADRTTEQPLLAMPSSANTATSAKHVR 226 Query: 156 --------WLIQTKAEAETVSRSYH--KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 I E ++ YH K+ V + ++F+ + + N Sbjct: 227 PALDVKMQVNIDAGFELTSLDSLYHPIKQSNVGNHYSVNFAGKQIADRDFVLQWQANVGA 286 Query: 206 QPADRTV----KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-----EE 256 P T K++ + N + P V N ML P ++ S E Sbjct: 287 VPKAATFYQTGKTHLADNSDERSETAQRQPNPVDNNMYSLVMLMPPSVEVSEQHLIAREL 346 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI--SDPSFSW 314 V +S + + AL + ++ ID+ N FN V S S Sbjct: 347 ILVIDTSGSMSGQSITQAKQALQFALAGLRDIDSFN------IIEFNSDVTMLSATPLSA 400 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + F + G T + A+QTA ++ S + + + + E + ++ + Sbjct: 401 NSRNIGKA--NRFIQSLDADGGTEMRSALQTA---LVDSVQQDSDQTDAHSEMLRQVIFM 455 Query: 375 TDGE 378 TDG Sbjct: 456 TDGA 459 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 60/174 (34%), Gaps = 18/174 (10%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V DA+ ++ + +N + + F +V + + Sbjct: 35 KIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVALHQPLA--------SASDIHWQ 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-A 388 D + G T + A+Q A I + + + +VL++DG E+ + A Sbjct: 87 DLSAGGMTPLGTALQMAKAMIEDK------DVVPSRAYRPTVVLVSDGGPNDAWEKPLNA 140 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + +S + +A + + ++ + N F A + +L F+ Sbjct: 141 FISDGRSAKCDRLAMAIGADADEAVLGKFIE---GTSNRLFYAENAKQLRDFFK 191 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 25/116 (21%), Positives = 46/116 (39%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E +VLLTDGE+ + Sbjct: 376 PTYALGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VVLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F + ++A +SN + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVH---FIALGSAADEAVIEMSNITGGSHFYASDEAQNNGLIDAFG 474 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 65/208 (31%), Gaps = 37/208 (17%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + + + A+ + + D R+G FN Sbjct: 102 WDALDGTPTEDPDRMRITHAKRAIREFLPLLSGRD------RVGLAVFNRTYRVIQPIVG 155 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + + + AY + S E+ + ++ +V+L Sbjct: 156 DPSLVLEKLDAIE------------RPSREQAYTELYRSMEEAL-TDFGEEGRRRVLVVL 202 Query: 375 TDGEN-----TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NS 427 +DGEN ++ + A GI I F + + + AS Sbjct: 203 SDGENFPVDPSESPSTPGTAIDLAHRYGITCYVIHFG------TEKDRLIGDLASETGGR 256 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIR 455 F+A + EL ++ I E+V++ Sbjct: 257 VFDARNALELASVYT-----AIQEQVLQ 279 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 36/101 (35%), Gaps = 11/101 (10%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 FS+ + ++ G TA DA+ + +D ++ Sbjct: 410 YDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGANKN---- 465 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 ++V+LTDG++ D A+ GI + ++ + Sbjct: 466 ----FLVILTDGQSYDDVRGP---AVAAQKAGITVFSVGVA 499 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 82/250 (32%), Gaps = 25/250 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + +++ S G +P + Y Sbjct: 184 LNWTQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQ 243 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ D+ + R FN++ Sbjct: 244 GASSPKDMVILVDVSGSVSGLT-LKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAE 296 Query: 308 SDPSFSWG----VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + I K G+T A++ ++++N + Sbjct: 297 AVVP-CFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNTNVPRANCN-- 353 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I+L TDG +D + I ++ +R+ T + + ++ C+ Sbjct: 354 -----KIIMLFTDG--GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWI--ACS 404 Query: 424 SPNSFFEANS 433 + +FE S Sbjct: 405 NKGYYFEIRS 414 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 59/393 (15%), Positives = 113/393 (28%), Gaps = 45/393 (11%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 E L+ A A +T S EEVS ++ + + + E+ + + T Sbjct: 327 ETELELATTEATVTVVARDESSGEEVSKTITLDAPGFADEVYDIELTDPESGAEISVTGE 386 Query: 110 E-VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 V+ V +S Y P +S ++ + + Sbjct: 387 GIVKSDFVVDPVPAEENRSFY-----------AGPFIHIRNFSDFESATVEMPLDDDV-- 433 Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + +S+ + + ++ A TV S+S + Sbjct: 434 --DPSDGNLSVY----KWDQHDEKPWHAVETDVHVENGTAVATVDSFSYFSVFWVDNWND 487 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 V+ + Y+ + FV S + + + + + + Sbjct: 488 AITDTVNLAEHPEYVANETEGSIEPIDLAFVIDESGSMGGARIQDAKASAKRFVGGLYED 547 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D R F S + + +I + A +Q A D Sbjct: 548 D------RAALVSFAGGATLGQSLTTDHGAVNASIDQLNAGGGTNT-----GAGLQKAVD 596 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + S+ E + I+LL DG T + + I A I I TI Sbjct: 597 ELTSNGEGDTQE----------IILLADG-GTGLGPDPVTIAQTADEHRITINTIGMGTG 645 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 QE A+ F++ + + EL ++F Sbjct: 646 IDAQELTSI---ADATGGEFYQVSDSSELPEVF 675 >gi|197098872|ref|NP_001126081.1| complement C2 [Pongo abelii] gi|55730287|emb|CAH91866.1| hypothetical protein [Pongo abelii] Length = 752 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDITEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIRELLNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|207079949|ref|NP_001128930.1| DKFZP469A1324 protein [Pongo abelii] gi|55726315|emb|CAH89929.1| hypothetical protein [Pongo abelii] Length = 740 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDITEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 32/141 (22%), Positives = 52/141 (36%), Gaps = 16/141 (11%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 V L RT KT + G T I A+ A D + HR IVL+T Sbjct: 108 VSTLDRTEAKTAVATLSPTGWTPIGPALLKAADDL--DGGTGSHR----------IVLIT 155 Query: 376 DGENTQDNEEGIAICNK--AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DGE+T + + + AK G+ I T+ V ++ K ++ A+ ++ Sbjct: 156 DGEDTCAPLDPCEVAREIAAKGVGLTIDTLGL-VPNSKLSKQLSCIAE-ATGGTYTSVEH 213 Query: 434 THELNKIFRDRIGNEIFERVI 454 +L + + V Sbjct: 214 KEDLTDKVNQLVDRAADKVVT 234 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 85/246 (34%), Gaps = 23/246 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G + + + + G G P + C+ Y G D + FV Sbjct: 561 GPRGEPGSRGPTGEAGPEGTPGPPGDPGLTDCDVMTYVRETCGCCDCEKRCGALDIMFVI 620 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGV 316 SS L ++ + +V+ + I D ++T R+G ++ Sbjct: 621 DSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDER 680 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + E G T A+Q AY+ +I + E A+ + V++TD Sbjct: 681 INSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK--------AQVFAVVITD 732 Query: 377 G--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANS 433 G + D++ A+C + + + TI Q E++ +S C P + Sbjct: 733 GRYDPRDDDKNLGALC----GRDVLVNTIGIGDIFDQPEQSETLVSIACNEPQRVQKMRL 788 Query: 434 THELNK 439 +L Sbjct: 789 FSDLVA 794 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 66/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 330 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 385 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + D Sbjct: 386 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS----- 438 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D A A GI I ++ + + + S Sbjct: 439 ---PNKNFLVIVTDGQSYDDVRGPAA---AAHDAGITIFSVGVAWAPL--DDLKDMASKP 490 Query: 423 ASPNSFF 429 ++FF Sbjct: 491 KESHAFF 497 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 42/281 (14%), Positives = 94/281 (33%), Gaps = 27/281 (9%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 S Y + + P++ + +++TK TV + +V++ S+ L +R Sbjct: 136 SGYKVRVKPMAGDSE----QEVMLKTKTPKATVGGLSPTKEYTLQVYVLNGSQEALFAKR 191 Query: 195 DSEGQPLNC----FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 P++ G + ++ + L P Sbjct: 192 KFGITPVDGRKIHQGHYPKSLALAQGVCKSQIAELRRATGRFSSLQLLLSVSNLSPAGSQ 251 Query: 251 PSLSEEHFVD-----SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 +D S L+++ L+++I + D +R+G + ++ Sbjct: 252 FQCDTPAMIDLVLLVDGSWSIGRNNFKLIKEFLSNLISPFSIAE---DKIRVGLSQYSSD 308 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ S + +++ + G+T A+ + + D R+ Sbjct: 309 PRTEWELS--AYSTREQVLEAVRNLRYKGGNTFTGLALTHVLEQ--NLKPDAGARL---- 360 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 EA+K ++LLTDG + ++ K+ GI I I Sbjct: 361 EAEKLVILLTDG---KSQDDANLAAQTLKNLGIEIFAIGVK 398 >gi|198418440|ref|XP_002122148.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1270 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 79/250 (31%), Gaps = 21/250 (8%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + + C Q + SQ S +++ S + P Sbjct: 115 MAMTMTSARDSVVMCSPQYMFSCPNAAVSQRQIGSCVKYDTSSTLITKLDSPCPTVCPQG 174 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +D S L + + ++++S + ++ V +G F++ + Sbjct: 175 SSLIVDIIILLD-GSTSVFPSNFELGKSWIKNLLQSFSSDIDKHNVV-VGLYSFSNIIKR 232 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + S + T+ G T I+ A+ A + Sbjct: 233 EIPLS---ARTYSTLSGMIDAVRYPYGQTFIHTAINEAVQEYQRAGRAS---------VP 280 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 K ++++TDGE T + N A++ GI + + + + E L+ A + Sbjct: 281 KLLIVITDGEATVPS-AVAPSANAARAAGIILTAVGIGSSVNENE-----LTTIAGAAER 334 Query: 428 FFEANSTHEL 437 F + L Sbjct: 335 VFRVSDFSSL 344 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 19/145 (13%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + T + + G T+I ++ + I SN+ Sbjct: 346 GLVTFDSFAKIQSKLI-KIIDDNTYQKITANLPQEADGGTSICRGLKAGFQAISQSNQS- 403 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE+ Q I++C + K G I TIA + ++ Sbjct: 404 --------TFGSEIILLTDGEDYQ-----ISLCFGEVKQSGTVIHTIALGPSADEE---L 447 Query: 417 YFLSNCASPNSFFEANSTHELNKIF 441 LSN + F+ + + L F Sbjct: 448 ETLSNMTGGHRFYAHKNINGLIDAF 472 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 25/54 (46%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + ++ + + +I A+ + +LG G VD S ++ A +A++ Sbjct: 2 TPRFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAG 55 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 54/177 (30%), Gaps = 22/177 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ +I+ + ID R+ F+ + + Sbjct: 78 KIEKLKAAILFMIKKLSSID------RLSIVTFSRDARRLCPLRQITENSQKDLENLI-N 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G+ I +QT + R A I+L++ E ++ Sbjct: 131 GLHAYGAANITAGLQTGLKVLND------RRFTGGRVAT--IMLVSSSEQNNGDD----- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ + + T F + + N + +F + + LNK F + Sbjct: 178 ADQILVGNVPVHTFGFGA-YHEPGVLKAIAHN-SIGGTFSDVQNMDNLNKAFSQCLA 232 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 84/246 (34%), Gaps = 23/246 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G + + + G G P + C+ Y G D + FV Sbjct: 561 GPRGEPGTRGPPGEAGPEGTPGPPGDPGLTDCDVMTYVRETCGCCDCEKRCGALDIMFVI 620 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGV 316 SS L ++ + +V+ + I D ++T R+G ++ Sbjct: 621 DSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDER 680 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + E G T A+Q AY+ +I + E A+ + V++TD Sbjct: 681 INSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK--------AQVFAVVITD 732 Query: 377 G--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANS 433 G + D++ A+C + + + TI Q E++ +S C P + Sbjct: 733 GRYDPRDDDKNLGALC----GRDVLVNTIGIGDIFDQPEQSETLVSIACNEPQRVQKMRL 788 Query: 434 THELNK 439 +L Sbjct: 789 FSDLVA 794 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 67/210 (31%), Gaps = 42/210 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD-----PSFSWGVHKLIRTIV 324 + +VR A ++ + K +D RMG F + + + K+ Sbjct: 77 RIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTD 136 Query: 325 KTFAIDE-----NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-- 377 + N T + AM I D+ +N+ A+K + ++DG Sbjct: 137 AVDLMTIPYQNYNNDQITNFDSAMTQMNTII-----DQAGDGTSNISAEKILFFVSDGVG 191 Query: 378 ENTQDNEEGIAI-------------CNKAKSQGIRI---MTIAFSVNKTQ---------Q 412 ++ + + C K +G++I T + Q Sbjct: 192 DSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQ 251 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + CASP +FE + T + + Sbjct: 252 GEIPTKMQACASPGFYFEVSPTEGITDAMK 281 >gi|261822923|ref|YP_003261029.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606936|gb|ACX89422.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 59/172 (34%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++I ++++ +T + F+ + ++ + D Sbjct: 21 MEAVKNGVQTLITTLRQDPYALETAYVSVITFDSSARQVVPLT--------DLINFKSPD 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ +A+ I + K + +I +TDG T D +G+A Sbjct: 73 LVANGTTALGEALSLVAQAIEREVQKTTAETKGDWRPLVFI--MTDGAPTDDWRKGVAEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + Q L + ++ + F+ Sbjct: 131 TSART-GVVV------ACAAGQVAETKVLQEITEIVLQLDTADSNAIKAFFK 175 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 84/246 (34%), Gaps = 23/246 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G + + + G G P + C+ Y G D + FV Sbjct: 561 GPRGEPGTRGPPGEAGPEGTPGPPGDPGLTDCDVMTYVRETCGCCDCEKRCGALDIMFVI 620 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGV 316 SS L ++ + +V+ + I D ++T R+G ++ Sbjct: 621 DSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDER 680 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + E G T A+Q AY+ +I + E A+ + V++TD Sbjct: 681 INSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK--------AQVFAVVITD 732 Query: 377 G--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANS 433 G + D++ A+C + + + TI Q E++ +S C P + Sbjct: 733 GRYDPRDDDKNLGALC----GRDVLVNTIGIGDIFDQPEQSETLVSIACNEPQRVQKMRL 788 Query: 434 THELNK 439 +L Sbjct: 789 FSDLVA 794 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 88/248 (35%), Gaps = 28/248 (11%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD---EKLSPYMVSCNKSLYYMLYP 246 L++ E + Q + + S G + LSP+ + + L Sbjct: 273 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPGRYQPLSPWALPXQRHLLGRYIQ 332 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 G P S S+ + K L++ ++ ++ ++ +D V + FN++ Sbjct: 333 GASSPKDMVIIVDVSGSVSGLTLK--LMKTSVCEMLDTLS----DDDYVNVA--SFNEKA 384 Query: 307 ISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 F+ V +R + K G+T + A+D + +SN + Sbjct: 385 QPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN--- 441 Query: 365 LEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 442 ----KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--AC 491 Query: 423 ASPNSFFE 430 A+ +FE Sbjct: 492 ANKGYYFE 499 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 64/195 (32%), Gaps = 22/195 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKL 319 S+++ + + D + ++I +K+ + ++ F V S Sbjct: 28 STNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGAGGVKLQTPLS------ 81 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I G+T + A++ D I + + Y+V+++DGE Sbjct: 82 --KIESIQFSPLGTGGNTPLGMALELTRDYIQNKD------TFPGKFYTPYVVMVSDGEP 133 Query: 380 TQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 D + + + +S ++ + + F SPN + AN Sbjct: 134 NDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDFS---GSPNQVYYANDVQS 190 Query: 437 LNKIFRDRIGNEIFE 451 L F+ I + + Sbjct: 191 LINCFK-AITASVTQ 204 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 84/246 (34%), Gaps = 23/246 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G + + + G G P + C+ Y G D + FV Sbjct: 485 GPRGEPGTRGPPGEAGPEGTPGPPGDPGLTDCDVMTYVRETCGCCDCEKRCGALDIMFVI 544 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGV 316 SS L ++ + +V+ + I D ++T R+G ++ Sbjct: 545 DSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDER 604 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + E G T A+Q AY+ +I + E A+ + V++TD Sbjct: 605 INSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK--------AQVFAVVITD 656 Query: 377 G--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANS 433 G + D++ A+C + + + TI Q E++ +S C P + Sbjct: 657 GRYDPRDDDKNLGALC----GRDVLVNTIGIGDIFDQPEQSETLVSIACNEPQRVQKMRL 712 Query: 434 THELNK 439 +L Sbjct: 713 FSDLVA 718 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 36/101 (35%), Gaps = 11/101 (10%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 FS+ + ++ G TA DA+ + +D ++ Sbjct: 623 YDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGANKN---- 678 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 ++V+LTDG++ D A+ GI + ++ + Sbjct: 679 ----FLVILTDGQSYDDVRGP---AVAAQKAGITVFSVGVA 712 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 62/193 (32%), Gaps = 31/193 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ VI + +D R+ F+ V + + +K Sbjct: 74 KLQSMKRAMQFVIMKLTPVD------RLSVVSFSSSATRHCPL-RSVTQAAQAELKGIVD 126 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + TA I + + L++DG+ T + + Sbjct: 127 GLVANGGTNIKAGLDTALAVIAGRATTKARTPN--------VFLMSDGQQTDGDARQVDP 178 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE-ANSTHELNKIFRDRIG 446 N A + T F ++ LS+ A SP F ++ F +G Sbjct: 179 GNVA------VYTFGF-----GKDADHALLSDVAKKSPGGTFNSVPDGGNVSAPFSQLLG 227 Query: 447 N--EIFERVIRIT 457 I + +++T Sbjct: 228 GLLSIVAQDVQLT 240 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 89/247 (36%), Gaps = 43/247 (17%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 S P+D S S+ +K + D + S + IDN+ Sbjct: 19 TQDIQGSKAIAFQDCPVDLFF---VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLR 73 Query: 293 D-TVR--------MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 D R GA ++D V + R +K G T + A Sbjct: 74 DRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSG--RDELKASVDAVKYFGKGTYTDCA 131 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGI 398 ++ + ++ ++L+ KY++++TDG E ++ G+ N+AK GI Sbjct: 132 IKKGLEELLIGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGI 183 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEA------NSTHELNKIFRDRIGNEI 449 ++ ++A + + + LS A+ +++ F A E+ D I + I Sbjct: 184 KVFSVAITPDHLEPR-----LSIIATDHTYRRNFTAADWGHSRDAEEVISQTIDTIVDMI 238 Query: 450 FERVIRI 456 V ++ Sbjct: 239 KNNVEQV 245 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 89/247 (36%), Gaps = 43/247 (17%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 S P+D S S+ +K + D + S + IDN+ Sbjct: 19 TQDIQGSKAIAFQDCPVDLFF---VLDTSESVALRLKPYGALVDKVKSFTKRF--IDNLR 73 Query: 293 D-TVR--------MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 D R GA ++D V + R +K G T + A Sbjct: 74 DRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSG--RDELKASVDAVKYFGKGTYTDCA 131 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGI 398 ++ + ++ ++L+ KY++++TDG E ++ G+ N+AK GI Sbjct: 132 IKKGLEELLIGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGI 183 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEA------NSTHELNKIFRDRIGNEI 449 ++ ++A + + + LS A+ +++ F A E+ D I + I Sbjct: 184 KVFSVAITPDHLEPR-----LSIIATDHTYRRNFTAADWGHSRDAEEVISQTIDTIVDMI 238 Query: 450 FERVIRI 456 V ++ Sbjct: 239 KNNVEQV 245 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 36/117 (30%), Gaps = 13/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + + G TAIN A+ A + Sbjct: 130 RIGIIAYAGQAYPQLPITTDYGAAKMFLQGLNTNMLSSQG-TAINQALDLASTYYDDDEQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ KA QGI+I TI K Sbjct: 189 TN-----------RVLFIISDGED-HSEGSTEGAVEKAVDQGIKIFTIGVGTEKGAP 233 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 62/184 (33%), Gaps = 24/184 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ +L ++ + + D R+ F+ + K Sbjct: 202 KIESVQQSLVQLLNFLSEKD------RLCLIVFDGGAKRHTPLKTLTEGNKKYF-KGAIA 254 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GST I A+ I + +MKN + + I LL+DG+++ E Sbjct: 255 AISAGGSTNIAAGTDIAFQQI------QQRKMKNQVTS---IFLLSDGQDSGAAERIQKQ 305 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 ++ S + I + + + +S SF+ + L++ F D +G Sbjct: 306 KDRI-SDVVTIHSFGY-----GNDHDADLMSKICKVGQGSFYYIENVKLLDEFFADALGR 359 Query: 448 EIFE 451 Sbjct: 360 LSSA 363 >gi|120601101|ref|YP_965501.1| hypothetical protein Dvul_0050 [Desulfovibrio vulgaris DP4] gi|120561330|gb|ABM27074.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 389 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 35/81 (43%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L+ G+ +I AL +LG+G M VD+ +++AA A + + LI S Sbjct: 5 KRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQALINS 64 Query: 72 LEEVSSRAKNSFTFPKQKIEE 92 + T + + + Sbjct: 65 SGNTDMARTQAITTARANLAQ 85 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 79/250 (31%), Gaps = 32/250 (12%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + + + + S G +P + Y Sbjct: 192 LNWTQALEDVFIENRKEDPSLLWQVFGSATGVTRYYPATPWRAPNKIDLYDVRRRPWYIQ 251 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN Sbjct: 252 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVHEMLDTLS----DDDYVNVA--SFN---E 301 Query: 308 SDPSFSWGVHKLI-----RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 S H + + + K G+T + A+D + +SN + Sbjct: 302 KAKPVSCFKHLVQANIRNKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSNITRANCN- 360 Query: 363 NNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + QD E NK +R+ T + + ++ Sbjct: 361 ------KMIMMFTDGGEDRVQDVFEKYNWPNKT----VRVFTFSVGQHNYDVTPLQWM-- 408 Query: 421 NCASPNSFFE 430 CA+ +FE Sbjct: 409 ACANKGYYFE 418 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 65/188 (34%), Gaps = 32/188 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +RS+ D +++G F + + ++I I Sbjct: 105 TRFEAAKQEAKNFVRSLP------DGIKVGLVSFAGYATLEAEPTTDHQRVIDQI----- 153 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI D + +++ + +DE + +VLL+DG T + + Sbjct: 154 ELLQMARRTAIGDGL---LESLRAIPKDENGKPLGPST----VVLLSDGR-TNSGVDPME 205 Query: 389 ICNKAKSQGIRIMTIAFS--VNKTQQEK---------ARYFLSNCA--SPNSFFEANSTH 435 + A+ G+ + TI N ++ L A + ++ A S Sbjct: 206 VAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYWMQFDEETLRAIAEATGGQYYAAGSAE 265 Query: 436 ELNKIFRD 443 L + +R+ Sbjct: 266 ALRQAYRN 273 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 79/250 (31%), Gaps = 32/250 (12%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + + + + S G +P + Y Sbjct: 155 LNWTQALEDVFIENRKEDPSLLWQVFGSATGVTRYYPATPWRAPNKIDLYDVRRRPWYIQ 214 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN Sbjct: 215 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVYEMLDTLS----DDDYVNVA--SFN---E 264 Query: 308 SDPSFSWGVHKLI-----RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 S H + + + K G+T + A+D + +SN + Sbjct: 265 KAKPVSCFKHLVQANIRNKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSNITRANCN- 323 Query: 363 NNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + QD E NK +R+ T + + ++ Sbjct: 324 ------KMIMMFTDGGEDRVQDVFEKYKWPNKT----VRVFTFSVGQHNYDVTPLQWM-- 371 Query: 421 NCASPNSFFE 430 CA+ +FE Sbjct: 372 ACANKGYYFE 381 >gi|46581745|ref|YP_012553.1| hypothetical protein DVU3344 [Desulfovibrio vulgaris str. Hildenborough] gi|46451168|gb|AAS97813.1| hypothetical protein DVU_3344 [Desulfovibrio vulgaris str. Hildenborough] gi|311235373|gb|ADP88227.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 389 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 35/81 (43%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L+ G+ +I AL +LG+G M VD+ +++AA A + + LI S Sbjct: 5 KRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQALINS 64 Query: 72 LEEVSSRAKNSFTFPKQKIEE 92 + T + + + Sbjct: 65 SGNTDMARTQAITTARANLAQ 85 >gi|319780570|ref|YP_004140046.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166458|gb|ADV09996.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 359 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 34/301 (11%), Positives = 91/301 (30%), Gaps = 7/301 (2%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I+S +G F I L M ++ G VD + A AI++ + S + Sbjct: 8 FIESRSGSFAPILILAMIPLITAIGFSVDYTSAVQTRSTEQAALDAAILSITTMDTASTK 67 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A + + +F+ + T + T R + V + Sbjct: 68 PQRQVAMQASYMANGGQGTATLNSFDVSANGTATAQASASFAMPTVFMQIARIPSVAVGV 127 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S + + + G+ + + T+ + + ++I + + D + Sbjct: 128 TSAVNKAPALVEATFKVTGVSGYWNKKMTLYGTMFGAAAGKPLMTIDYAYGKTG---DPK 184 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 A +TV + + + + + ++ + + P+ Sbjct: 185 GYGTTTVSVLTTDSAGKTVTTVAQKQVCKLVDSSTPAGAVIQTDGFQTKYYCVDTMYPAD 244 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN----VNDTVRMGATFFNDRVISD 309 ++ S + + + + + ++ + N + + GA +N+ Sbjct: 245 GAGAQINVSQMAGLYLQMDVPSGNPSKLMSNDPTTSNRLYSGLKSDQTGAIDYNEMATGQ 304 Query: 310 P 310 Sbjct: 305 I 305 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 160 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 219 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 220 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 272 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 273 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 328 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 329 ---KVIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 379 Query: 424 SPNSFFE 430 + +FE Sbjct: 380 NKGYYFE 386 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 158 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 217 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 218 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 270 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 271 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 326 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 327 ---KVIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 377 Query: 424 SPNSFFE 430 + +FE Sbjct: 378 NKGYYFE 384 >gi|269125745|ref|YP_003299115.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310703|gb|ACY97077.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 228 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 51/164 (31%), Gaps = 13/164 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + V S + D L ++ I V D R+ F+D + Sbjct: 9 YLVCDESYSMAGNPLQEINDQLPQIVTEIASNPTVADKARLCIISFSDTAEVLLPLA--- 65 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLL 374 + + + A A+ + + E ++ +K ++ + + L Sbjct: 66 -----DLNDVHQVPQLAPKGATSYGA---AFTLLRDTIERDIRDLKAAGHVPFRPTVFFL 117 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 TDG+ T + AK G R +AF + E R Sbjct: 118 TDGQPTDSDWATAHQRLTAKDFGPRPTILAFGFGDVRPETLRAV 161 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 64/194 (32%), Gaps = 35/194 (18%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + + ++A+ ++ I K+ ++ G F + + K + Sbjct: 101 AATDYEPTRLDAAKNAINNL---ILKMGPQHNV---GVVLFESGATTVSYLTPDKEKSVN 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I E +G+TAI D + D S + K ++LL+DG + Sbjct: 155 AISSI----EQGLGATAIGDGLALGVDMASSIPDK-----------KGVVILLSDGVHNS 199 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SPNS 427 AK ++I TI E L A + + Sbjct: 200 GLVTPEEATEYAKINNVQIHTIGLGSIEPVFLRDDIYGEPQYAELDEETLVIIAQQTSGN 259 Query: 428 FFEANSTHELNKIF 441 ++++ LN+IF Sbjct: 260 YYKSLDEQTLNEIF 273 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 61/459 (13%), Positives = 141/459 (30%), Gaps = 84/459 (18%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 A + + + + R PKQ+ + Y E K + +++ Sbjct: 137 QDWVNAIYAFPLPENAAVDHLNMTIGERKITGEIMPKQQAKAYFETAKEQGRKASLIEQK 196 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN----------PLSLFLRSMGIKSW--LI 158 ++ + + P ++ + ++ Y ++ P+++ R S LI Sbjct: 197 RPNLFTNAIANIGPNET---ISVTIEYQQVIRFDQQTFSVRFPMTITPRYSPRASQIPLI 253 Query: 159 QTKAEAETVSRSYHKEHG-----------------VSIQWVIDFSRSMLDYQRDSEGQPL 201 A A+ S + + +S++ FS S D + + Sbjct: 254 NQTAVAKVNSNGWGESVSVLSRQIKTPDEPANPIRISVELNSGFSISPEDIVSEHHPIDI 313 Query: 202 NCFGQPA---------DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML-------- 244 Q A + S K + D + + + Y + Sbjct: 314 TPKDQQAGHYQIELTKEHIANQDFSLRWKPTVGDIPSAAHFSETIGNYRYAMVMLTPPRQ 373 Query: 245 -----YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 S E F+ +S + + A+ + + D+VN Sbjct: 374 DDADDKSTKTPVSAREVVFLLDTSGSMAGESIVQAKRAVDFALTQLHPEDSVN------V 427 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 FND + + + + + + G T + A+ A + +N ++ + Sbjct: 428 IEFNDAPQALWNLAMPATANNIQRARNWVASLSANGGTEMAPALSMA---LHKTNLEQQN 484 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + + +V +TDG + ++ I N+ R+ TI Sbjct: 485 INEGSPVQLRQVVFITDGSVSNEDALMSLIENQLADS--RLFTIGIG------------- 529 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGN--EIFERVIRI 456 ++PNS+F + F IG+ ++ +++ + Sbjct: 530 ---SAPNSYFMTQAAQAGRGTFT-YIGDINQVQQKMTEL 564 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 60/180 (33%), Gaps = 31/180 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMG---------ATFFNDRVISDPSFSWGVHKL 319 + +DA + I + G A D + P + K+ Sbjct: 19 TRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNSEKM 78 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I + G T I ++++ A + E ++ I+L++DG Sbjct: 79 IAHMD-----GLQPRGFTPIGESLRKAAAEL-------------PKEGQRSIILVSDGVA 120 Query: 380 TQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T + + K QGI I T+ F+V Q++ + A+ ++ A+ L Sbjct: 121 TCTPPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIAD--ATGGTYANASDADSL 178 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 48/343 (13%), Positives = 106/343 (30%), Gaps = 38/343 (11%) Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KAEAETVSRSY 171 D RD + + + Y++ Y + + ++ I KA V R Sbjct: 203 DSARDGCIVITTKSGNYRIQYDESYAQITENDFERTNLSPLSTFSIDVDKASYSNVRRMI 262 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + ++ V M++Y + QP + + V N +R Sbjct: 263 NNGQVIASDAV--KIEEMINYFNYNYPQPTDDHPFSINTEVTDTPWHNKTQLVRIGLQGK 320 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + P+ + +D S K L++ A ++ +++ D V Sbjct: 321 SYADKDL------------PASNLTFLIDVSGSMSSHNKLPLLKSAFKLLVNQLREKDKV 368 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + V GA V+ +P+ K+I + + ++ AY + Sbjct: 369 SIVVYAGAAG----VVLEPTSGNNKEKIISALNNLQSGGSTAG-----GAGIKLAY-KLA 418 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVN 408 N + + ++L TDG+ + + + + + G+ + + F Sbjct: 419 EKNFKKKGNNR--------VILATDGDFNVGASSDNDMKTLIEEKRKSGVFLSVLGFGYG 470 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + K + ++ E K+F G +F Sbjct: 471 NYKDSKLETLADK--GNGNHAYIDNMQEAQKVFGKEFGGTLFT 511 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 46/133 (34%), Gaps = 6/133 (4%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-- 382 K F D ST + DA+ A+ I E N K IV LTDGE T Sbjct: 452 KKFIQDLQSESSTNMEDALNKAHL-IAKLGETRFKDGAN--TPKPIIVFLTDGEPTTGIT 508 Query: 383 -NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +E I + + I ++ F + L+N +EA+ + F Sbjct: 509 EPQELIKYVSNTNEEKYPIYSLGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNF 568 Query: 442 RDRIGNEIFERVI 454 I + + V Sbjct: 569 YKEISSPVLSNVT 581 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 7/111 (6%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + H+ AKK ++++TDG+ +D E + +A+ GI I Sbjct: 243 TATGILTVVTQLFHHKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIG 302 Query: 405 FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + + ++P + F+ ++ L I ++ E++ Sbjct: 303 VGHAFQGPTARQELNTISSAPPQDHVFKVDNFAAL-----GSIQKQLQEKI 348 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 7/111 (6%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + H+ AKK ++++TDG+ +D E + +A+ GI I Sbjct: 227 TATGILTVVTQLFHHKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIG 286 Query: 405 FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + + ++P + F+ ++ L I ++ E++ Sbjct: 287 VGHAFQGPTARQELNTISSAPPQDHVFKVDNFAAL-----GSIQKQLQEKI 332 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 7/111 (6%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + H+ AKK ++++TDG+ +D E + +A+ GI I Sbjct: 227 TATGILTVVTQLFHHKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIG 286 Query: 405 FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + + ++P + F+ ++ L I ++ E++ Sbjct: 287 VGHAFQGPTARQELNTISSAPPQDHVFKVDNFAAL-----GSIQKQLQEKI 332 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 51/166 (30%), Gaps = 26/166 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ A ++ + D R+ F+ R +K Sbjct: 57 RSLETVKKAANRLVDRLNPSD------RLSVVVFDHRAKVLVP---SQSVEDPEKIKNQI 107 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-I 387 G TAI++ ++ + + +D V + LLTDGEN + + Sbjct: 108 NRLAADGGTAIDEGLRLGIEELAKGKKDTVSQAF----------LLTDGENEHGDNNRCL 157 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCASPNS 427 A S + + T+ F N Q + LS P Sbjct: 158 KFAQLAASYNLTLNTLGFGDNWNQDVLEKIADAGLGTLSYIQKPEE 203 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 56/152 (36%), Gaps = 20/152 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + DR+ + + + +T +K F + T I+ + A + E Sbjct: 76 KVGVVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDISVGLDEAVKVLKQGME 135 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT----------QDNEEGIAICNKAKSQGIRIMTIAF 405 H IV+L DG N + ++ +AK GI I TI Sbjct: 136 -PAHAP--------MIVVLADGNNDLDPNTGKTSQEASDHLNQAVQEAKGSGIPIYTIGL 186 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + ++A L+N + F +S +L Sbjct: 187 NADGKLNKEALAELAN-QTGGKSFTTSSADDL 217 >gi|299138149|ref|ZP_07031329.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600079|gb|EFI56237.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 349 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 55/164 (33%), Gaps = 11/164 (6%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI--NDAMQTAYDT---IISS 353 F+ V ++ H+L R I K + T++ + + Sbjct: 142 LISFDVNVDLLSDYTNSAHELKRAIDKASINAASSSAGVPGIGGGPFPTSHPRGTLLYDA 201 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT--IAFSVNKTQ 411 H + +K +VLLTDG + E + A+ + IA N + Sbjct: 202 VYLAAHDKLQSQTGRKILVLLTDGGDQGSQETLKSATEAAQKANAILYVILIADRANFSY 261 Query: 412 QEKARYFLSNCA--SPNSFFEA-NSTHELNKIFRDRIGNEIFER 452 A + A + N+ +L + F D+I +E+ + Sbjct: 262 GFNADGQMEQLAHETGGRVINVGNNGKKLEEAF-DQIQDELRTQ 304 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 52/139 (37%), Gaps = 20/139 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A +++ + D R+ F+ R + G+ L I Sbjct: 56 KPLETVKQAAIELVKQLNVED------RLSIIAFDHRAKVLVP-NQGIDNLNTII--EQI 106 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G TAI++ ++ + +D V + I LLTDGEN DNE + Sbjct: 107 NSLKPAGGTAIDEGLKLGIQESANGKKDRVSQ----------IFLLTDGENEHGDNERCL 156 Query: 388 AICNKAKSQGIRIMTIAFS 406 + + A I + T+ F Sbjct: 157 KLAHVASDYNITLNTLGFG 175 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 73/205 (35%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + ++ LV+ +L ++ +++ D V V GA Sbjct: 240 PTENLPASNLIFLIDVSGSMYGPERLDLVKSSLKLLVNNLRDKDKVAIVVYSGAAG---- 295 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ + GSTA + ++ AY I N + Sbjct: 296 ----EKLA-STPGSDKQKIREAIDELEAGGSTAGGEGIKLAY-KIARKNFITGGNNR--- 346 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ ++E + + + G+ + + + + + K + Sbjct: 347 -----IILCTDGDFNMGVSSDQELKKLIEQKRKSGVFLTVLGYGMGNYKDSKMQTLAEK- 400 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 401 -GNGNHAYIDNLQEANRVLVNEFGA 424 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 58/198 (29%), Gaps = 23/198 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + VD+S K L++ A A ++ ++ D R+ + Sbjct: 209 DKRELPPANLVLLVDTSGSMDDPAKLPLLKRAFAQLVPQLRAKD------RVSIVAYAGH 262 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G + + GST + ++ AY + + H Sbjct: 263 AGLVLPPTPGNRHG---EILAALEGLHAAGSTNGGEGLRLAY-----AMARQGHVEGGVN 314 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ D + + + GI + T+ F A Sbjct: 315 R----ILLATDGDFNVGITDRNALLTLVADQRRSGIALSTLGFGSGNYNDAMAERLAD-- 368 Query: 423 ASPNSFFEANSTHELNKI 440 A ++ E + Sbjct: 369 AGNGQHLYIDTLDEARRA 386 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 52/418 (12%), Positives = 123/418 (29%), Gaps = 56/418 (13%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK---NFTDREVRDIVRDTAV 120 ++ L ++ + S +E+ + F +N+++ ++ D + + Sbjct: 206 STAGLNATIGAYYAATGTSSDLTAAALEKPEAKQFVSNIEQAIVHYGDNTLTFLTNLQKA 265 Query: 121 EMNPRKSAYQ--VVLSSRYDLLLN-------PLSLFLRSMGIKSWLIQTKAEAETVSRS- 170 + +Y V + + N P + + + A+ S Sbjct: 266 DDRGAALSYISAVTVEESSLIGYNQGNPTNDPAKVGQHAPPKVPIVAIYPADGTLNSDHP 325 Query: 171 ---YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + Q DF + + L ++ ++ + + Sbjct: 326 FVTLNWADPTRKQIAADFLGYLRGPETQQRFAALGFRSFDGKPGPQASTANGVQPDAKIS 385 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----KKKHLVRDALASVIR 283 L P + L ++ V S V K L + A + Sbjct: 386 FLQPPSPTVLAKLLTTWTDLRKKANVLLVVDVSGSMGDEVKGTGKSKIDLAKQAAIDSLG 445 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPS-----FSWGVHKLIRTIVKTFAIDENEMGSTA 338 D ++G F + D + + + + T Sbjct: 446 QFVPRD------QVGLWQFATHLDGDKDYQELLPVQPLGSNGKETLASRLSGLTPQSGTG 499 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGI-----AICN 391 + D+ AY+ + K +L+ +V+LTDG N + + Sbjct: 500 LYDSSLAAYEYL-----------KAHLDPSAINAVVVLTDGRNEDPGGVDLDHLVPQLRP 548 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ +R+ TIA+ + + L A+ S ++++ +N++F I N Sbjct: 549 EGNAESVRLFTIAYGGDAD-----QNVLKQIAEATAGSEYDSSKPDSINQVFTSVISN 601 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 29/79 (36%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + +G + L V+LG+ G +D R + L+ AA ++ A L ++ Sbjct: 7 RAVTDRSGSVALTFGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGNYLKLAV 66 Query: 73 EEVSSRAKNSFTFPKQKIE 91 ++ + Sbjct: 67 ATSAAAKSIVVDTVHAQAA 85 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 73/207 (35%), Gaps = 20/207 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S ++ +++V+ +K + ++D Sbjct: 144 CPQQDSDIAFLIDGSGSIDPVDFDRMKKFVSTVMSRFQKSKTL-----FALMQYSD--DF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII-SSNEDEVHRMKNNLEA 367 F++ K + G T ++ + SSN H + Sbjct: 197 RTHFTFNDFKRNSDLELLVRPIGQLFGRTHTATGIRKVVRELFHSSNGARNHAL------ 250 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--- 424 K ++++TDGE D E + +A +GI I + +K+R L AS Sbjct: 251 -KIMIVITDGEKYLDPLEYRDVIPEADRKGIIRYVIGVG-DAFNSKKSRKELDTIASKPP 308 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFE 451 + F+ N+ L I ++++ +IF Sbjct: 309 ADHVFQVNNFEALKTI-QNQLQEKIFA 334 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 65/204 (31%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + + ++ D V + Sbjct: 187 PPANLVFLVDVSGSMQAQDKLPLVKKTLRILTKRLRAEDKVT------LITYASNEKLVL 240 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + GSTA A+Q AY + +KN + Sbjct: 241 PP---TSGKDKDTILQAINQLEAGGSTAGEQALQMAYAQAQKA------YIKNGINR--- 288 Query: 371 IVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + + + + GI + T+ F ++ A + Sbjct: 289 ILLATDGDFNVGITDFNTLKDTVAEKRKAGISLTTLGFGTGNYNEQLMEQLAD--AGDGN 346 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + ++ E K+ + ++ + + Sbjct: 347 YSYIDNETEAKKVLQRQLSSTLAT 370 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 63/187 (33%), Gaps = 16/187 (8%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++D + +++ + ++ ++ S L+ IV+ Sbjct: 176 MKDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQL------- 228 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +Q + S K K ++++TDG+ +D E + +A Sbjct: 229 QGLTYTASGIQKVVKELFHSKNGARKSAK------KILIVITDGQKFRDPLEYRHVIPEA 282 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI I + + + + S + F+ + L I + +I +IF Sbjct: 283 EKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSI-QRQIQEKIFA 341 Query: 452 RVIRITK 458 + + Sbjct: 342 IEVLYKQ 348 >gi|188026532|ref|ZP_02962486.2| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] gi|188019325|gb|EDU57365.1| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] Length = 197 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++++ +T + FN + ++ D Sbjct: 6 IEAVKNGVQTLLSTLRQDPYALETAHVSIITFNSTAQQIVPLT--------DLINFSLPD 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ DA+ I + + K + +I +TDG T D + G+ Sbjct: 58 LQASGTTALGDALSVVAHCIENEVQRTTVESKGDWRPLVFI--MTDGAPTDDWKAGLNKF 115 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ I Q L + + + F+ Sbjct: 116 KAART-GLVI------ACAAGQSAQTKVLKEITEVVLQLDTADSTTIKSFFK 160 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 68/216 (31%), Gaps = 30/216 (13%) Query: 215 YSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEH----FVDSSSLR 265 Y +G + + C K + L +D S + +D+S Sbjct: 168 YDYVSGPEAGKKFAVYTEYADCPWNKDTKLMMVGLNTAAIDMSEKKASNLVFLIDTSGSM 227 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + K L + A + ++ + D R+ + SD GV + Sbjct: 228 YEENKLPLAQKAFKMLAENLDEND------RISIVTY---AGSDTVVLNGVAGSEAYTIC 278 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QD 382 GST + + TAY+ E +K+ ++L TDG+ Sbjct: 279 EALDSLEASGSTNGSAGLITAYEI------AEQQFIKDGNNR---VILATDGDLNVGLTS 329 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + + + + K GI + + F + + K Sbjct: 330 ESDLVGLITEEKDSGIFLSVLGFGSDNLKDNKLEAL 365 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 63/187 (33%), Gaps = 16/187 (8%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++D + +++ + ++ ++ S L+ IV+ Sbjct: 171 MKDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQL------- 223 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +Q + S K K ++++TDG+ +D E + +A Sbjct: 224 QGLTYTASGIQKVVKELFHSKNGARKSAK------KILIVITDGQKFRDPLEYRHVIPEA 277 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI I + + + + S + F+ + L I + +I +IF Sbjct: 278 EKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSI-QRQIQEKIFA 336 Query: 452 RVIRITK 458 + + Sbjct: 337 IEVLYKQ 343 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 7/111 (6%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + H+ AKK ++++TDG+ +D E + +A+ GI I Sbjct: 227 TATGILTVVTQLFHHKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIG 286 Query: 405 FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + + ++P + F+ ++ L I ++ E++ Sbjct: 287 VGHAFQGPTARQELNTISSAPPQDHVFKVDNFAAL-----GSIQKQLQEKI 332 >gi|163848230|ref|YP_001636274.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526140|ref|YP_002570611.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669519|gb|ABY35885.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450019|gb|ACM54285.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 419 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +R A+ I + + D + F+ R RT + Sbjct: 60 KIERLRQAVVRAIEQLNEQDI------ISIVIFDHRTEVLVP---AQPVRQRTAILDLVH 110 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I A++ + + + +VLLTDG+ T+ +E + Sbjct: 111 RIRDAGGTRIAPALEKGIQELQKMPQGV-----------RRLVLLTDGQ-TEHEKECLLR 158 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCASPN---SFFE 430 + A GI I + + + ++R A P +F+ Sbjct: 159 ADDAGRLGIPITALGIGKDWNEDLLIEMANRSRGVADYIAQPGEIVQYFQ 208 >gi|15929704|gb|AAH15276.1| Inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 94/304 (30%), Gaps = 47/304 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 199 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 247 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 248 VYDVNRESPGNVQIVNGYFV-------HFFAPQGLPVVPKNIVFVIDVSGSMSGRKIQQT 300 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAI 329 R+AL ++ +K+ D + F+ V +W H + T KTF Sbjct: 301 REALLKILDDVKEDD------YLNFILFSTDVT-----TWKDHLVQATPANLKEAKTFVK 349 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEG 386 + ++ T IND + + + + ED + ++ I++LTDG+ Sbjct: 350 NIHDQSMTNINDGLLKGIEMLNKARED-------HTVPERSTSIIIMLTDGDANTGESRP 402 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 403 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 462 Query: 444 RIGN 447 + N Sbjct: 463 EVAN 466 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 9/56 (16%), Positives = 24/56 (42%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + ++ G ++ A+L+ + + +DV + L+ A ++A+ L Sbjct: 10 RTRRNTRGAVSVMAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQL 65 >gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 [Mus musculus] Length = 941 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus musculus] Length = 943 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 266 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 319 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 320 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 378 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 379 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 429 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 430 KMALDNGGLARRIYEDSDSALQLQDFYHE 458 >gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus musculus] Length = 927 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 266 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 319 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 320 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 378 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 379 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 429 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 430 KMALDNGGLARRIYEDSDSALQLQDFYHE 458 >gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus musculus] Length = 904 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 266 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 319 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 320 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 378 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 379 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 429 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 430 KMALDNGGLARRIYEDSDSALQLQDFYHE 458 >gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus] Length = 941 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 [Mus musculus] gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus] gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus musculus] gi|151357764|emb|CAO78004.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 941 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus] Length = 942 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|159110717|ref|NP_032433.2| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Mus musculus] Length = 889 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 94/304 (30%), Gaps = 47/304 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 202 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 250 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 251 VYDVNRESPGNVQIVNGYFV-------HFFAPQGLPVVPKNIVFVIDVSGSMSGRKIQQT 303 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAI 329 R+AL ++ +K+ D + F+ V +W H + T KTF Sbjct: 304 REALLKILDDVKEDD------YLNFILFSTDVT-----TWKDHLVQATPANLKEAKTFVK 352 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEG 386 + ++ T IND + + + + ED + ++ I++LTDG+ Sbjct: 353 NIHDQSMTNINDGLLKGIEMLNKARED-------HTVPERSTSIIIMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 465 Query: 444 RIGN 447 + N Sbjct: 466 EVAN 469 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 31/80 (38%), Gaps = 8/80 (10%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 Y T G N DN C A+ G+ + +AF ++ L CAS +S F Sbjct: 588 YTTFSTRG-NNLDNLHT--QCQLARDLGVTVFAVAFETTDADADE----LRLCASSDSHF 640 Query: 430 EANSTHELNKIFRDRIGNEI 449 E+ F D I +I Sbjct: 641 FHVQGTEIIDAF-DTIARQI 659 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 51/145 (35%), Gaps = 10/145 (6%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQ-TAYDTII 351 T R+G +N + + I + + + +T A A + + Sbjct: 81 TTRVGLVTYNSVAKVNADL--NTFQSINDVYNGVFNYLSAVTDATDSYLATGLQAANALF 138 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +S R KK +++ + + + + ++ K G+ I+T+A+ ++ Sbjct: 139 ASQSFNSTRN----HYKKVVIVYASEYKSYGELDPVKVADEMKGSGVYIVTVAY--DQGG 192 Query: 412 QEKARYFLSNCASPNSFFEANSTHE 436 + L+ A+P F + Sbjct: 193 NGQLLKDLAGIATPGYSFSNTDDSD 217 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 12/172 (6%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D +++ +K + ++ A F + F+ + +++ +++ Sbjct: 260 KDLSKRLVQQLKIGPH---YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKG 316 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+TAI ++ A I DE + + K IV TDG + + + A Sbjct: 317 GTTAIGAGIEKALTQI-----DESEGARPGIATKVMIVF-TDGWSNKGP-DPEKRARDAV 369 Query: 395 SQGIRIMTIAFSV-NKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDR 444 + G + T+A++ LS + S F + L + R Sbjct: 370 NAGFEMYTVAYTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIKQR 421 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 67/174 (38%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TIA S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAK--LRLISGDSSSE 209 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 26/63 (41%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G+ I+ + + ++G+ + +D + L+ +A + A+ L + + Sbjct: 13 QNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATELDEGATHEQA 72 Query: 78 RAK 80 RA Sbjct: 73 RAA 75 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 67/174 (38%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TIA S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAK--LRLISGDSSSE 209 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 66/189 (34%), Gaps = 24/189 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A ++ +K D R+ + V S P R I + Sbjct: 111 KLEQAKQAARHLVTLLKDDD------RLAIVHYGSDVKSLPGLQATPANRERMI--QYIE 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---G 386 + G T I+ + + ++ D ++L++DG+ T+ + + Sbjct: 163 GIWDEGGTNISAGLLAGQAQVETARSDYRVNR---------LILISDGQPTEGSTDEGSL 213 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + +++GI + +I + + + + F A S+ +L +F+ + Sbjct: 214 KQVVKDIRTRGITVSSIGVGTDFNE-DLMQAFAEYGA--GSYGFLEDAGKLATLFQKDL- 269 Query: 447 NEIFERVIR 455 + +V R Sbjct: 270 QQASTQVAR 278 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 67/174 (38%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TIA S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAK--LRLISGDSSSE 209 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 67/174 (38%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TIA S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAK--LRLISGDSSSE 209 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 2/78 (2%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK-TQQEKARYFLSNCASPNSF-FEANSTHEL 437 + AK+ GI + TI FS + +K LSNCAS +S F AN + L Sbjct: 58 NALDALTTQAYTNAKAAGISVYTIGFSTPSDSIDDKGLSLLSNCASSSSQAFVANDANTL 117 Query: 438 NKIFRDRIGNEIFERVIR 455 F + R+ R Sbjct: 118 ISAFNQIAKSVGSLRLTR 135 >gi|148685682|gb|EDL17629.1| mCG133494, isoform CRA_a [Mus musculus] gi|148685684|gb|EDL17631.1| mCG133494, isoform CRA_a [Mus musculus] Length = 828 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 91/297 (30%), Gaps = 44/297 (14%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q EA +S + W++ ++ R++ CF + Sbjct: 81 LQVPPEAVNISLGLSLAAATNPSWLLACGPTVHHTCRENIYLTGLCFLLSSSFKQSQNFP 140 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + ++ +++ + S+ + LD + + S R + + Sbjct: 141 TAQQECPKQDQDIVFLIDGSGSISSTDFEKMLDFVKAVMSQLQRPSTRFSLMQF------ 194 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +D R+ FN+ + + S G T Sbjct: 195 --------------SDYFRVHF-TFNNFISTSSPLS------------LLGSVRQLRGYT 227 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ + ++ +A K ++++TDG DN ++ A++ Sbjct: 228 YTASAIKHVITELFTTQSGAR------QDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS 281 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIFE 451 I I E ++ L AS S F + L I +++ +IF Sbjct: 282 IIRYAIGVG-KAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDI-ENQLKEKIFA 336 >gi|10946646|ref|NP_067309.1| integrin alpha-X precursor [Mus musculus] gi|48428495|sp|Q9QXH4|ITAX_MOUSE RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor gi|6684131|gb|AAF23492.1|AF211864_1 leukocyte adhesion glycoprotein p150,95 alpha integrin subunit [Mus musculus] gi|189442099|gb|AAI67225.1| Integrin alpha X [synthetic construct] Length = 1169 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 91/297 (30%), Gaps = 44/297 (14%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q EA +S + W++ ++ R++ CF + Sbjct: 81 LQVPPEAVNISLGLSLAAATNPSWLLACGPTVHHTCRENIYLTGLCFLLSSSFKQSQNFP 140 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + ++ +++ + S+ + LD + + S R + + Sbjct: 141 TAQQECPKQDQDIVFLIDGSGSISSTDFEKMLDFVKAVMSQLQRPSTRFSLMQF------ 194 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +D R+ FN+ + + S G T Sbjct: 195 --------------SDYFRVHF-TFNNFISTSSPLS------------LLGSVRQLRGYT 227 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ + ++ +A K ++++TDG DN ++ A++ Sbjct: 228 YTASAIKHVITELFTTQSGAR------QDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS 281 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIFE 451 I I E ++ L AS S F + L I +++ +IF Sbjct: 282 IIRYAIGVG-KAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDI-ENQLKEKIFA 336 >gi|74215118|dbj|BAE41792.1| unnamed protein product [Mus musculus] Length = 1169 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 91/297 (30%), Gaps = 44/297 (14%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q EA +S + W++ ++ R++ CF + Sbjct: 81 LQVPPEAVNISLGLSLAAATNPSWLLACGPTVHHTCRENIYLTGLCFLLSSSFKQSQNFP 140 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + ++ +++ + S+ + LD + + S R + + Sbjct: 141 TAQQECPKQDQDIVFLIDGSGSISSTDFEKMLDFVKAVMSQLQRPSTRFSLMQF------ 194 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +D R+ FN+ + + S G T Sbjct: 195 --------------SDYFRVHF-TFNNFISTSSPLS------------LLGSVRQLRGYT 227 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ + ++ +A K ++++TDG DN ++ A++ Sbjct: 228 YTASAIKHVITELFTTQSGAR------QDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS 281 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIFE 451 I I E ++ L AS S F + L I +++ +IF Sbjct: 282 IIRYAIGVG-KAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDI-ENQLKEKIFA 336 >gi|695636|emb|CAA49843.1| inter-alpha-inhibitor H3 chain [Mus musculus] Length = 886 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 94/304 (30%), Gaps = 47/304 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 199 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 247 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 248 VYDVNRESPGNVQIVNGYFV-------HFFAPQGLPVVPKNIVFVIDVSGSMSGRKIQQT 300 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAI 329 R+AL ++ +K+ D + F+ V +W H + T KTF Sbjct: 301 REALLKILDDVKEDD------YLNFILFSTDVT-----TWKDHLVQATPANLKEAKTFVK 349 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEG 386 + ++ T IND + + + + ED + ++ I++LTDG+ Sbjct: 350 NIHDQSMTNINDGLLKGIEMLNKARED-------HTVPERSTSIIIMLTDGDANTGESRP 402 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 403 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 462 Query: 444 RIGN 447 + N Sbjct: 463 EVAN 466 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 85/264 (32%), Gaps = 32/264 (12%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + +IT LL +M+ + G+ +D + L+ A A + A L Sbjct: 7 RRSRKSGSVMVLITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLLG--- 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFE-NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + P+ E+L NF + + ++ + + + Sbjct: 64 ---------TPAVPETLAAEFLASNFRTDGSAGTWGAHDLHSTI------VYTPGITKII 108 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + + L LFLR +G S ++ + + VID S +M Sbjct: 109 DIDATAQVPL----LFLRILGKTSATVRARGSGTRTDSR--------VMLVIDRSGTMDV 156 Query: 192 YQRDSEGQPL-NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + N + +++ ++G+ S Y+ + Sbjct: 157 SDGTGLPTRIENAKTVAQTLFIPAFTEGADEIGLVAFDGSAYVAYPPSQPGWDPTTTSSS 216 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLV 274 + +F D ++ ++I + + + Sbjct: 217 RGGPDMYFKDPNNPNNMINQVNAI 240 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 52/173 (30%), Gaps = 37/173 (21%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA------------ 367 R V T G TA+ + M A T++ + R+ N+ Sbjct: 156 AREAVATAIERLQPQGGTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPRALANPP 215 Query: 368 ------------KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 IVL +DGE+ A A G+RI T+ + Sbjct: 216 ADTEPVKPGSYTSGAIVLFSDGESNAGPAALRAA-QLAAEHGVRIYTVGVGTPEGVVLSV 274 Query: 411 -----QQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L A+ +F EL +++R F++ ++ Sbjct: 275 DGWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRALNARLAFDKRSQV 327 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 13/113 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +R+ F+ R + ++ + + + N G T +++ ++ A + + Sbjct: 70 LRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEV--NPAGETYMHEGIKLATEQMK--- 124 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFS 406 K ++ IV LTDG+ T ++ I + A+ G R+ + Sbjct: 125 -------KEPKKSSSIIVALTDGKLETYIHQLTIDEADSARKYGARVYCVGVK 170 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 13/113 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +R+ F+ R + ++ + + + N G T +++ ++ A + + Sbjct: 70 LRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEV--NPAGETYMHEGIKLATEQMK--- 124 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFS 406 K ++ IV LTDG+ T ++ I + A+ G R+ + Sbjct: 125 -------KEPKKSSSIIVALTDGKLETYIHQLTIDEADSARKYGARVYCVGVK 170 >gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus] Length = 902 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 39/86 (45%), Gaps = 3/86 (3%) Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-- 425 K ++++TDGE + D + + +A+ GI I + + +S + P Sbjct: 254 SKVLIVITDGEKSGDPLQYSDVIPEAERAGIIRFAIGVGKAFSGGTAKQELISIASQPED 313 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F ++ L I ++++ ++IF Sbjct: 314 DHVFPVDNFDALKDI-QNKLQDKIFA 338 >gi|148692826|gb|EDL24773.1| inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 94/304 (30%), Gaps = 47/304 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 200 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 248 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 249 VYDVNRESPGNVQIVNGYFV-------HFFAPQGLPVVPKNIVFVIDVSGSMSGRKIQQT 301 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAI 329 R+AL ++ +K+ D + F+ V +W H + T KTF Sbjct: 302 REALLKILDDVKEDD------YLNFILFSTDVT-----TWKDHLVQATPANLKEAKTFVK 350 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEG 386 + ++ T IND + + + + ED + ++ I++LTDG+ Sbjct: 351 NIHDQSMTNINDGLLKGIEMLNKARED-------HTVPERSTSIIIMLTDGDANTGESRP 403 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 404 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 463 Query: 444 RIGN 447 + N Sbjct: 464 EVAN 467 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 40/119 (33%), Gaps = 10/119 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +R+ ++ + ++ ++++ + G T + ++ A N Sbjct: 110 LRISIITYSTEAEVILPLTSDSKEINKSLL--VLKNIVPQGLTHMQKGLRKA-------N 160 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 E I+ LTDG + + + KA+ G + T+ + QQ Sbjct: 161 EQIRKSTLGGRIVNSVIIALTDGLLLLKPYLDTMEEAKKARRMGAIVYTVGVFMYSKQQ 219 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 52/132 (39%), Gaps = 10/132 (7%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I+ ++ + D VR+G ++D ++ + S + I + T Sbjct: 57 IKIVRDLPIHEDAVRVGLIQYSDAAKTEFNLSR-YSERNDIITHLETLTFMPGEDTRTGV 115 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ A + + + + L+A + I+L TDG + ++ + +G++I Sbjct: 116 ALDKADEEMFN------YIGGARLKATRLIILFTDGLSM---DKPTKSAKTLRRKGVKIY 166 Query: 402 TIAFSVNKTQQE 413 TI+ + E Sbjct: 167 TISVNSIGFVPE 178 >gi|161784288|sp|Q61704|ITIH3_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor Length = 889 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 94/304 (30%), Gaps = 47/304 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S++ + K+ VS + +D RS C + Sbjct: 202 SFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRS-----------CPTCTDSLLNGDFTI 250 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 N + + ++ Y V + P L FV S +K Sbjct: 251 VYDVNRESPGNVQIVNGYFV-------HFFAPQGLPVVPKNIVFVIDVSGSMSGRKIQQT 303 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAI 329 R+AL ++ +K+ D + F+ V +W H + T KTF Sbjct: 304 REALLKILDDVKEDD------YLNFILFSTDVT-----TWKDHLVQATPANLKEAKTFVK 352 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNEEG 386 + ++ T IND + + + + ED + ++ I++LTDG+ Sbjct: 353 NIHDQSMTNINDGLLKGIEMLNKARED-------HTVPERSTSIIIMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 465 Query: 444 RIGN 447 + N Sbjct: 466 EVAN 469 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 59/525 (11%), Positives = 145/525 (27%), Gaps = 102/525 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 72 LEE---------VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + + N + ++ + + L + D+ V T Sbjct: 70 RKANDYKLGGSNPNDDSFNISSEVGKRDHAIVTTFVKTFLPQTNDDKMNLIPVCKTVNNT 129 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHGVSIQ 180 + + ++ + S F L+ ++ Q +++ + + + + Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVDVASKSKAFKKNTFNIPIDLM 189 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V D S SM ++E + + + + + D + +L Sbjct: 190 VVADLSGSMNFDLDNNETKKTGKPSKISILKEVLVELADKTLLSEDANQHNRIYVTPFAL 249 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR---- 296 + S + S K ++ L + D +N+ V Sbjct: 250 GAEINKNSCALPYSWDIESSS--------KIENIKKILNKENSQYNRADLINNLVYRIST 301 Query: 297 -------MGATFFND---------RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 G +N + + + W + G+T + Sbjct: 302 KETLNNINGKQKYNVTFPKNTFCLKDMKTSNQGWYTRSDKSKFTS-YVQSIKASGATLAS 360 Query: 341 DAMQTAYDTIISSNEDEVH----------------------------------RMKNNLE 366 + A + +I R+ NL Sbjct: 361 SGVLVAANNMIKDGSRTEQLKEQTKRVILVLSDGNDEIIKSDPNSKVPFLNYTRITENLI 420 Query: 367 AKKYIVLLTDGE----------------NTQDNEEGIAICNKAKS--------QGIRIMT 402 K V L+ + +TQ E +C + + +I+ Sbjct: 421 YGKQEVFLSQKQKISLSLSHSTIETYLTDTQPRNETDGMCKVIRDRLDTLNNDKNTKIVF 480 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + F KA+ +C +++ AN L F+ IG Sbjct: 481 VEF----GYASKAKQAWQHCVGNGNYYSANDKESLLNSFKQAIGE 521 >gi|151357765|emb|CAO78005.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 903 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L FV S KK R+AL +++ + D N F+ Sbjct: 264 APENLPTMSKNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFN------LIEFSG 317 Query: 305 RVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + + V +A G T IN+A+ A + + SN+ E+ K+ Sbjct: 318 EANQWKQSLVQATEENLNKAVN-YASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKS 376 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T I N + + + F + FL Sbjct: 377 VSL----IILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYP-----FLE 427 Query: 421 NCASPNS------FFEANSTHELNKIFRD 443 A N + +++S +L + + Sbjct: 428 KMALDNGGLARRIYEDSDSALQLQDFYHE 456 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 45/145 (31%), Gaps = 18/145 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N S + P P S S+ K L++ A+ VI+++ D R Sbjct: 251 NMSNSPLNSHNPRAPVDLVTVLDISGSMAG--TKLALLKRAMGFVIQNLGSSD------R 302 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + F+ + ++ G T I + ++ + E Sbjct: 303 LSVIAFSSTARRLFPLRRMTDAGRQQALQA-VNSLVANGGTNIAEGLRKGAKVMEDRKE- 360 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQ 381 I+LL+DG++T Sbjct: 361 --------RNPVSSIILLSDGQDTY 377 >gi|270007556|gb|EFA04004.1| hypothetical protein TcasGA2_TC014153 [Tribolium castaneum] Length = 813 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 75/236 (31%), Gaps = 20/236 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV + L +K + DA+ ++ + + D D VR Sbjct: 280 NGYFVHFFSPSGLKPLPKHVVFVLNHGLTMHGRKIDQLIDAMQKILSELTENDAF-DIVR 338 Query: 297 MGAT--FFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 GAT ++ L + K F ++ I A T YD Sbjct: 339 FGATPSVWDSTRHKFIRLPDLRHYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGL 398 Query: 353 SNEDEVHRMKNNL---------EAKKY---IVLLTDG---ENTQDNEEGIAICNKAKSQG 397 + V+ ++ L +Y I+ LTD E I K + Sbjct: 399 GLSNPVYALEVGLFLAKRIQDNLPNRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNR 458 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I +++F + + + N +EA F I + + +V Sbjct: 459 IPIFSLSFGEDVDKNFMRQLAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|258516146|ref|YP_003192368.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] gi|257779851|gb|ACV63745.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] Length = 219 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 56/188 (29%), Gaps = 16/188 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ + +I +KK +T + F D + + Sbjct: 29 IEAVKQGVKYMISELKKEPQAIETAYISVITFGSDARQDVQLT--------ELAAFKEPQ 80 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ A+ + + K + + +I +TDGE ++ Sbjct: 81 IEANGTTSLGAALHILNNCFDNEVRKSTPTQKGDYKPLVFI--MTDGE---PTDDWENAA 135 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + K + ++ I + L + + + FR + + Sbjct: 136 REIKQKSGKVANI--VAVGCGPDVNTDTLKKITDIVLLMSSYQPEDFKQFFR-WVSQSVK 192 Query: 451 ERVIRITK 458 + I+ TK Sbjct: 193 QASIKFTK 200 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++D + +++ + ++ ++ S L+ IV+ Sbjct: 176 MKDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQL------- 228 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +Q + S K K ++++TDG+ +D E + +A Sbjct: 229 QGLTYTASGIQKVVKELFHSKNGARKSAK------KILIVITDGQKFRDPLEYRHVIPEA 282 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI I + + + + S + F+ + L I + +I +IF Sbjct: 283 EKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSI-QRQIQEKIFA 341 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 89/280 (31%), Gaps = 34/280 (12%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQ 412 LTDGE+ GI C +A G I TIA + ++ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSAARE 447 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++D + +++ + ++ ++ S L+ IV+ Sbjct: 176 MKDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQL------- 228 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +Q + S K K ++++TDG+ +D E + +A Sbjct: 229 QGLTYTASGIQKVVKELFHSKNGARKSAK------KILIVITDGQKFRDPLEYRHVIPEA 282 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI I + + + + S + F+ + L I + +I +IF Sbjct: 283 EKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSI-QRQIQEKIFA 341 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++D + +++ + ++ ++ S L+ IV+ Sbjct: 178 MKDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQL------- 230 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +Q + S K K ++++TDG+ +D E + +A Sbjct: 231 QGLTYTASGIQKVVKELFHSKNGARKSAK------KILIVITDGQKFRDPLEYRHVIPEA 284 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI I + + + + S + F+ + L I + +I +IF Sbjct: 285 EKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSI-QRQIQEKIFA 343 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 69/214 (32%), Gaps = 53/214 (24%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + + + VR+G F + + R + Sbjct: 111 RLVASQNAAKAFLADLPR------NVRVGVVAFAGTAAVVQPPT-----VSREDLTAAID 159 Query: 330 DENEMGSTAINDAMQTAYDTII----------SSNEDEVHRM------KNNLEAKKY--- 370 TAI + + + + +N + H + K++ KK Sbjct: 160 KFQLQRGTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSLDQAGKDDGNGKKAFTP 219 Query: 371 ----------IVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKT 410 I+LLTDG+ T + + A +GIR+ TI F Sbjct: 220 VAPGSYTSAAIILLTDGQRTTG-IDSLDAAKVAADRGIRVYTVGVGTVEGETIGFEGWSM 278 Query: 411 QQEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 + + L A+ +F A + +L K+++ Sbjct: 279 RVKLDEETLKGIARATQAEYFYAGTATDLKKVYQ 312 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 69/226 (30%), Gaps = 5/226 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G ++ ++++ ++LGV +D+ + L+ A TA + +V + E+V Sbjct: 20 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAV-VADKTEDV 78 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 E A P Y + + Sbjct: 79 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYV-- 136 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY--Q 193 R + +S +L ++ + A A + + + I D + ++ D Sbjct: 137 RVAVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGY 196 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 R P +K N +G + +L + + S Sbjct: 197 RPPGYDPNVDMDPSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNS 242 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 89/280 (31%), Gaps = 34/280 (12%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 196 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 255 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 256 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 315 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 316 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 375 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 376 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 426 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQ 412 LTDGE+ GI C +A G I TIA + ++ Sbjct: 427 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSAARE 461 >gi|149918184|ref|ZP_01906676.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] gi|149820944|gb|EDM80351.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] Length = 522 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 69/216 (31%), Gaps = 27/216 (12%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P + +D S H K L+ D+L ++ + + D+V V GA+ Sbjct: 147 PERELPPRNLVFLLDVSGSMHDQDKLPLLTDSLRVLVNQLGERDHVAIVVYAGASG---- 202 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V+ P R + + GST + +Q AY + Sbjct: 203 VVLPP-----TRGSDRGTILAAISELRAGGSTNGGEGIQKAY--------ALAREHFDPQ 249 Query: 366 EAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + ++L TDG+ + + + G+ + + F Sbjct: 250 GINR-VILATDGDFNVGTTTESGLENLIERERESGVFLTVLGFGRGNLGDRTMEMLADK- 307 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ +S E K+ +G E ++ I K Sbjct: 308 -GNGNYAYIDSLAEARKV----LGTEAGSTLVTIAK 338 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L SL VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSGSLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L SL VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSGSLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L SL VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSGSLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L SL VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSGSLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 39/124 (31%), Gaps = 18/124 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-- 327 K + AL + + K N N R F+D V +G + + + + Sbjct: 97 KLQSAKSALQAAVNYFKS--NYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILN 154 Query: 328 -AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEE 385 G T + A+ A KKYI+ LTDG T + + Sbjct: 155 TGNSLTAGGGTNYSAALSLAKSYFTDPTR------------KKYIIFLTDGMPTVLNTVD 202 Query: 386 GIAI 389 I Sbjct: 203 TITY 206 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 27/69 (39%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + + + G+ + AL+ P+ L V VD L+ A A + + Sbjct: 1 MRGFVSRFLGARGGNLATMAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAA 60 Query: 67 PLIQSLEEV 75 L Q+ E V Sbjct: 61 SLSQANEAV 69 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 45/145 (31%), Gaps = 18/145 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N S + P P S S+ K L++ A+ VI+++ D R Sbjct: 246 NMSNSPLNSHNPRAPVDLVTVLDISGSMAG--TKLALLKRAMGFVIQNLGSSD------R 297 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + F+ + ++ G T I + ++ + E Sbjct: 298 LSVIAFSSTARRLFPLRRMTDAGRQQALQA-VNSLVANGGTNIAEGLRKGAKVMEDRKE- 355 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQ 381 I+LL+DG++T Sbjct: 356 --------RNPVSSIILLSDGQDTY 372 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 89/280 (31%), Gaps = 34/280 (12%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I++ T+ V + + + V+ D + + + + G Sbjct: 182 IEATRCSTRITGTNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPDKIQTAGASIMFMQ 241 Query: 213 KSYSSQNGKVGIRDEKLSPYMVS--CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S +P + + CN+ + + D + + Sbjct: 242 NLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPPPPTFSL 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRM--GATFFNDRVISDPSF----SWGVH-KLIRTI 323 R + V+ +D + +RM A + +++ S ++ + + Sbjct: 302 LKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQIVEKESMVGLVTFDSAAHIQNYL 361 Query: 324 VK----------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K T + + G T+I +Q + I SS++ IVL Sbjct: 362 IKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE---------IVL 412 Query: 374 LTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQ 412 LTDGE+ GI C +A G I TIA + ++ Sbjct: 413 LTDGEDN-----GIRSCFEAVSRSGAIIHTIALGPSAARE 447 >gi|73972314|ref|XP_860410.1| PREDICTED: similar to complement component 2 precursor isoform 5 [Canis familiaris] Length = 748 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S ++I ++ K D T A+ + + + + + + E + I+ Sbjct: 312 SRDATEVINSLNKVNYKDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAII 371 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ +A+ N+ +S + I I + S Sbjct: 372 LLTDGKSNMGGSPKLAVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDG 431 Query: 425 PNSFFEANSTHELNKIFR 442 F T L ++F Sbjct: 432 ERHAFILQDTEALYQVFE 449 >gi|73972312|ref|XP_860371.1| PREDICTED: similar to complement component 2 precursor isoform 4 [Canis familiaris] Length = 682 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S ++I ++ K D T A+ + + + + + + E + I+ Sbjct: 246 SRDATEVINSLNKVNYKDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAII 305 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ +A+ N+ +S + I I + S Sbjct: 306 LLTDGKSNMGGSPKLAVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDG 365 Query: 425 PNSFFEANSTHELNKIFR 442 F T L ++F Sbjct: 366 ERHAFILQDTEALYQVFE 383 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 46/133 (34%), Gaps = 6/133 (4%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-- 382 K F D ST + DA+ A+ I E N K IV LTDGE T Sbjct: 452 KKFIQDLQSESSTNMEDALNKAHL-IAKLGETRFKDGAN--TPKPIIVFLTDGEPTTGIT 508 Query: 383 -NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +E I + + I ++ F + L+N +EA+ + F Sbjct: 509 EPQELIKYVSNTNEEKYPIYSLGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNF 568 Query: 442 RDRIGNEIFERVI 454 I + + V Sbjct: 569 YKEISSPVLSNVT 581 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 49/136 (36%), Gaps = 16/136 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT G T I A+ A D + + + IVL+TDGE+T Sbjct: 114 RTEAKTAVATLAPTGWTPIGPALLGAADDLDGGDST------------RRIVLITDGEDT 161 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + ++GI I T+ + +++ A+ ++ +T EL+ Sbjct: 162 CGPLDPCEVARDIAARGIHLVIDTLGLVPDAKIRQQLTCIAE--ATGGTYTAVQNTDELS 219 Query: 439 KIFRDRIGNEIFERVI 454 + + V Sbjct: 220 GRVKQLVDRAAEPVVT 235 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 67/203 (33%), Gaps = 27/203 (13%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S + K L++ A + ++ D ++ G+ V++ Sbjct: 250 NLVFLIDVSGSMNEANKLPLLKQAFKLLADQLRVEDKISIVAYAGSAG---MVLAP---- 302 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYI 371 + +K GSTA + ++ AYD K + K + Sbjct: 303 --TSGSEKKTIKDALDKLEAGGSTAGGEGIELAYD-----------LAKKHFLPKGNNRV 349 Query: 372 VLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +L TDG+ + E + + + GI + + F + + ++ Sbjct: 350 ILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADK--GNGNY 407 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ E K+F G +F Sbjct: 408 AYIDNIQEARKVFVQEFGGTLFT 430 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 29/78 (37%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 7 RRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGR 66 Query: 72 LEEVSSRAKNSFTFPKQK 89 + +S Sbjct: 67 DDAISRAQTAIEKIANSA 84 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 69/226 (30%), Gaps = 5/226 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G ++ ++++ ++LGV +D+ + L+ A TA + +V + E+V Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAV-VADKTEDV 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 E A P Y + + Sbjct: 66 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYV-- 123 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY--Q 193 R + +S +L ++ + A A + + + I D + ++ D Sbjct: 124 RVAVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGY 183 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 R P +K N +G + +L + + S Sbjct: 184 RPPGYDPNVDMDPSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNS 229 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 65/207 (31%), Gaps = 32/207 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S +K + A ++ ++ D +G F + V R Sbjct: 47 TSYSMDGEKIFRAKQAALRLLDILRDKD------YVGVYGFAGKFYKVLEP---VPATNR 97 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V+ I T I YDT+ E+ +++ + I+ +TDGE T Sbjct: 98 NEVEKAIIGLKLGSGTNI-------YDTLKKLVEETKKVLESGAISLVRIIFITDGEPTT 150 Query: 382 DNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHE 436 ++ I AK G + I ++ LS A F + Sbjct: 151 GQKKPEKILEMAKKLREAGASALIIGVGTEYNEK-----LLSRMAMVLNGEFEHVSDPAS 205 Query: 437 LNKIFRD------RIGNEIFERVIRIT 457 L K+ + I + + R++ Sbjct: 206 LEKLISEYAKSTQEISAKNVAVLFRLS 232 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 31/65 (47%), Gaps = 1/65 (1%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + L++ V+LG+ G+++D+ + L+ A + ++A+ L + +RA N+ Sbjct: 16 MVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQEL-DGASDALTRATNAGV 74 Query: 85 FPKQK 89 Sbjct: 75 TAGNA 79 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 70/190 (36%), Gaps = 33/190 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L ++A+ + ID ++G F+D L + A+ Sbjct: 432 KLDLAKEAVYQASLGLTPID------QVGLVVFDDTANWVLQL----QPLPSMVEIERAL 481 Query: 330 DENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I ++ A + S++ H ++LLTDG + + IA Sbjct: 482 GSFGIGGGTNIRPGIEQAALALASTDAKIKH-----------VLLLTDGIAESNYSDLIA 530 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF-RDRI-- 445 + ++ GI I T+A ++ + A + S E+ +IF ++ I Sbjct: 531 ---QMRASGITISTVAVGLDAN--PNLVDVAN--AGGGRSYRVTSIDEVPRIFLQETIIA 583 Query: 446 -GNEIFERVI 454 G +I E+ I Sbjct: 584 AGRDIIEQPI 593 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 47/127 (37%), Gaps = 11/127 (8%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 ++ A G TAI +++ AY + + IVL+TDGENT + Sbjct: 411 IRADAKALTADGETAIFSSLRAAYRHLAERASALGDDRFTS------IVLMTDGENTAGD 464 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 + G + T F + ++ L N AS F+A L++ F Sbjct: 465 SADDFESFYRRLPGAQRTTPVFPILFGDSDRGE--LENIASLTGGRLFDATK-GSLDQAF 521 Query: 442 RDRIGNE 448 + G + Sbjct: 522 EEIRGYQ 528 >gi|3183041|sp|Q90615|ITA1_CHICK RecName: Full=Integrin alpha-1; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1 gi|497990|gb|AAA59067.1| alpha 1 integrin [Gallus gallus] Length = 285 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 64/174 (36%), Gaps = 16/174 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ +D ++G + V+ + + I + T + Sbjct: 88 LRNMDIGPQQTQVGIVQYGQTVVHEFYL-NTYSTTEEVMDAALRIRQRGGTQTMTALGID 146 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + E H + + +K +V++TDGE + DN + +K + + I+ IA Sbjct: 147 TAREEAF----TEAHGARRGV--QKVMVIVTDGE-SHDNYRLQEVIDKCEDENIQRFAIA 199 Query: 405 FSVNKTQ----QEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 + ++ EK + + AS FF + L I + +G IF Sbjct: 200 ILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTI-VEALGERIFA 252 >gi|13591884|ref|NP_112256.1| integrin alpha-1 precursor [Rattus norvegicus] gi|124941|sp|P18614|ITA1_RAT RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|56494|emb|CAA36384.1| unnamed protein product [Rattus norvegicus] gi|149059385|gb|EDM10392.1| integrin alpha 1, isoform CRA_b [Rattus norvegicus] Length = 1180 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 65/188 (34%), Gaps = 16/188 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + +A + +K++D ++G + + V + + +V I Sbjct: 183 IYPWESVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAANKIG 241 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T + TA + +K K +V++TDGE + DN + Sbjct: 242 RQGGLQTMTALGIDTARKEAFTEARGARRGVK------KVMVIVTDGE-SHDNYRLKQVI 294 Query: 391 NKAKSQGIRIMTIAFSVNKTQ----QEKARYFLSNCAS---PNSFFEANSTHELNKIFRD 443 + + I+ +IA + + EK + + AS FF + L I + Sbjct: 295 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVK- 353 Query: 444 RIGNEIFE 451 +G IF Sbjct: 354 ALGERIFA 361 >gi|311273688|ref|XP_003133992.1| PREDICTED: integrin alpha-2 [Sus scrofa] Length = 1186 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 89/296 (30%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + T + + SI V + +M + F + SQ G Sbjct: 93 STTTCEKLNLQTSTSISNVTEMKTNMSLGLTLTRNAGTGGFLTCGPLWAQQCGSQYYTTG 152 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + + + S P +D V S V++ L ++ Sbjct: 153 VCSDVSPDFQLLTSFSPAVQACPSLIDV-----VVVCDESNSIY--PWDAVKNFLEKFVQ 205 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDA 342 + G + + + K +V+ + G T A Sbjct: 206 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTFKTKAEMVEATSHTTQYGGDLTNTFKA 260 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A D+ S+ A K +V++TDGE + D A+ ++ + I Sbjct: 261 IQYARDSAYSAA------AGGRPGATKVMVVVTDGE-SHDGSMLKAVIDQCNNDNILRFG 313 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + +L + +G +IF Sbjct: 314 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEADLLEK-AGTLGEQIFS 368 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 45/143 (31%), Gaps = 5/143 (3%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAKN 81 I AL++ +++ + G+ VDV E +A A + S + + Sbjct: 24 IIALMILILVAMVGLSVDVGNTFSKERQAVASANAASLAGMSAYMARSSSTLDTTIYQAI 83 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + + + N + N+ D + + V +A R L Sbjct: 84 TASLQSNGLVVGDGTNNTVEVTANYLDSQGNLLAGHPVV--GSGGTAPNGAAYIRVQLSG 141 Query: 142 NPLSLFLRSMGIKSWLIQTKAEA 164 + F R +G I A A Sbjct: 142 MVNTSFARVVGRDDLPINADAHA 164 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 61/219 (27%), Gaps = 28/219 (12%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + Y G + P D Sbjct: 397 PTSQKPIQYVVVLDASGSMSANFDGQCNNSGGVKQCANGPSGFPDVQVSNTGYDYWWTTE 456 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTV------RMGATFFNDRVI--SDPSFSWGVHK 318 ++ ++ + AL ++ ++ + +M +FND V +F+ Sbjct: 457 SQRRIYVAKKALERLV-TLSNMPGNPGYTNTRPSDQMAVVWFNDGVSSSQTQAFTNNPTT 515 Query: 319 LIRTIVKTFAIDEN--EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 L I ++ N G T + A + + V N+E K+ ++ +TD Sbjct: 516 LKNYITTLNNVNGNYRSAGGTNGAGGLYRA-SLLYQNAPKTVSFNGTNVEYKRVVLFVTD 574 Query: 377 GENT----------------QDNEEGIAICNKAKSQGIR 399 G + D + + C KS+ I Sbjct: 575 GVSNYFLNTSASDLKGPLSSYDTFKKNSTCYNMKSKVIE 613 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 13/115 (11%), Positives = 46/115 (40%), Gaps = 14/115 (12%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G ++ ++++ ++LG+ +D+ + L+ A +A + +V + ++ Sbjct: 10 KRTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDEN--- 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 E + +++ + + E+ +TAV + ++ + Sbjct: 67 -----------GNVSAAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATF 110 >gi|332823604|ref|XP_003311225.1| PREDICTED: complement C2 isoform 2 [Pan troglodytes] Length = 620 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 181 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 240 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 241 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 300 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 301 ERHAFILQDTKALHQVFE 318 >gi|332823602|ref|XP_003311224.1| PREDICTED: complement C2 isoform 1 [Pan troglodytes] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 85/296 (28%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + +S + S P +D V S V++ L ++ Sbjct: 148 VCSDISPDFQLSASFSPAAQPCPSLIDV-----VVVCDESNSIY--PWDAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T + T A Sbjct: 201 GLDVGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|225543438|ref|NP_001139375.1| complement C2 isoform 2 preproprotein [Homo sapiens] Length = 620 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 181 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 240 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 241 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 300 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 301 ERHAFILQDTKALHQVFE 318 >gi|194385074|dbj|BAG60943.1| unnamed protein product [Homo sapiens] Length = 506 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 67 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 126 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 127 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 186 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 187 ERHAFILQDTKALHQVFE 204 >gi|194374891|dbj|BAG62560.1| unnamed protein product [Homo sapiens] Length = 723 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 284 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 343 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 344 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 403 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 404 ERHAFILQDTKALHQVFE 421 >gi|194374835|dbj|BAG62532.1| unnamed protein product [Homo sapiens] Length = 620 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 181 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 240 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 241 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 300 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 301 ERHAFILQDTKALHQVFE 318 >gi|126031547|pdb|2ODP|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement gi|126031548|pdb|2ODQ|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement Length = 509 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 70 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 129 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 130 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 189 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 190 ERHAFILQDTKALHQVFE 207 >gi|118138230|pdb|2I6Q|A Chain A, Complement Component C2a gi|118138231|pdb|2I6S|A Chain A, Complement Component C2a Length = 517 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 78 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 137 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 138 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 197 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 198 ERHAFILQDTKALHQVFE 215 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 46/111 (41%), Gaps = 6/111 (5%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R F + + + + ++ + D E T A++ A++ + S+E Sbjct: 129 RRAIILFAAKPLVQCPLTADMD-AFEALLGMASPDLIEAQGTDFRSALELAHNVLEPSSE 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 R+ + + +K +VLL+DGE+ + A N+ K+ I + I Sbjct: 188 S---RLASAAKGEKIMVLLSDGEDHAGDLP--AAANRIKNGRIHVFAIGVG 233 >gi|62897125|dbj|BAD96503.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|62896633|dbj|BAD96257.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|34628|emb|CAA28169.1| unnamed protein product [Homo sapiens] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|15277207|dbj|BAB63292.1| C2 [Homo sapiens] Length = 577 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|14550407|ref|NP_000054.2| complement C2 isoform 1 preproprotein [Homo sapiens] gi|3915642|sp|P06681|CO2_HUMAN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|298124|gb|AAB97607.1| complement component C2 [Homo sapiens] gi|2347131|gb|AAB67975.1| complement component C2 [Homo sapiens] gi|28175369|gb|AAH43484.1| Complement component 2 [Homo sapiens] gi|33346923|gb|AAQ15273.1| complement component 2 [Homo sapiens] gi|55961814|emb|CAI17451.1| complement component 2 [Homo sapiens] gi|57209923|emb|CAI41858.1| complement component 2 [Homo sapiens] gi|119623954|gb|EAX03549.1| complement component 2, isoform CRA_b [Homo sapiens] gi|123857990|emb|CAM25860.1| complement component 2 [Homo sapiens] gi|168983782|emb|CAQ06833.1| complement component 2 [Homo sapiens] gi|168984416|emb|CAQ09272.1| complement component 2 [Homo sapiens] gi|168985077|emb|CAQ07481.1| complement component 2 [Homo sapiens] gi|168985955|emb|CAQ07111.1| complement component 2 [Homo sapiens] gi|189069137|dbj|BAG35475.1| unnamed protein product [Homo sapiens] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|38257345|sp|Q8SQ74|CO2_PANTR RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110330|gb|AAL82821.1| complement C2 [Pan troglodytes] Length = 752 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 124/379 (32%), Gaps = 38/379 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K IK G I+ L+P M+ + + + L +AA+ A + LI S Sbjct: 2 KSIKKNRGVAGILFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLA----LIASP 57 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +E + +Y + + N D +V R + +++ Sbjct: 58 KEDEDKNV-----------KYARYLVDRYILDNSEDVDVAVFTRKCEYKDGCVQASG--E 104 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQT--KAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 L+ D +++ + + + + ++ V+R Y + V + ++ DFS SM Sbjct: 105 LAPFSDFVVSATAKYTSWISYEDVDLEPEFTVSGRAVTRKYLPQ-SVDVYFIGDFSGSMG 163 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP---YMVSCNKSLYYMLYPG 247 + + + L+ + R V N + R L ++ +K++ Y Sbjct: 164 NSWK-NGKMKLDVVKETIKRVVDDIEKFNTEEKSRVALLGYNPFHVKQTDKTVRVNAYGY 222 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN-DTVRMGATFFNDRV 306 + SS +++ ++ + + + + F Sbjct: 223 YGSWRKKYAYNYARSSPGTTVRRMFDKPKLYNEILEPKRGMSRYEVERLHTHNVNFAKYY 282 Query: 307 -ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 D + + ++ T G T+ + + A + NL Sbjct: 283 KFYDIPLTEDYDEFRSQLMNT---KLQAGGGTSSWNGIIAAAQEANKA---------TNL 330 Query: 366 EAKKYIVLLTDGENTQDNE 384 ++ ++L+DG++ N Sbjct: 331 NPEQVFIVLSDGQDGDKNY 349 >gi|149175890|ref|ZP_01854508.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] gi|148845337|gb|EDL59682.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] Length = 368 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 56/153 (36%), Gaps = 8/153 (5%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI---QSLE 73 G + TA+++ +LG+ VD L++AA A++ A LI + Sbjct: 17 HRRGAIAVFTAIMLVPLLGMVAFAVDYGYLLKKRADLQRAADAAVLAAVRDLIPDANGTQ 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++S + I E + + + + + + V + Sbjct: 77 DLSKVRATLRQYADLNIGE-IEGFQVLDSDIQIGRYNPESVYDN--FTILDWGTFDTVRV 133 Query: 134 SSRYDLLLN-PLSLFL-RSMGIKSWLIQTKAEA 164 + R+D N P+SLF R +GI + + A Sbjct: 134 TLRFDTQANSPVSLFFARLLGINESDLNATSTA 166 >gi|126463435|ref|YP_001044549.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126105099|gb|ABN77777.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 651 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 35/277 (12%), Positives = 85/277 (30%), Gaps = 26/277 (9%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + + +N F + I +P + +L + Sbjct: 227 RAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPWNPETRLVHVALQGRM 286 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L+ +D+S K L++ + ++ ++ D V G+ Sbjct: 287 PAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVAIVTYAGSAG--- 343 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V++ R+ + + + GSTA ++ + AY M Sbjct: 344 EVLAP------TAANQRSTILSALDRLDAGGSTAGDEGLALAYR--------TASEMAGA 389 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 E + +VL TDG+ + + + G+ + + F + N Sbjct: 390 GEVTR-VVLATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQN 448 Query: 422 CASPNSFFEANSTHE--LNKIFRDR---IGNEIFERV 453 ++ ++ + + L I +++ +V Sbjct: 449 GNGQAAYIDSLNEAQKVLVDQLSGALFPIADDVKVQV 485 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 69/205 (33%), Gaps = 23/205 (11%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 Y P+ + +D S K L+ A ++ + + D V+ V GA Sbjct: 330 YLNEELPASNLTFLIDVSGSMEDHNKLPLLISAFKLLVHQLIEKDKVSIVVYAGAAG--- 386 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + G K I+ GSTA ++ AY E + NN Sbjct: 387 ---VVLPPTNGDQK--EKIINALQ-KLEAGGSTAGGQGIKLAYKL----AEKNFKKNGNN 436 Query: 365 LEAKKYIVLLTDGE-NTQDNEEGI--AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 ++L TDG+ N + + + K ++ G+ + + F + + K Sbjct: 437 R-----VILATDGDFNVGASSDTAMEKLIEKKRASGVFLSVLGFGMGNYKDSKLETLADK 491 Query: 422 CASPNSFFEANSTHELNKIFRDRIG 446 + ++ E K+F D G Sbjct: 492 --GNGNHAYIDTMQEAQKVFGDEFG 514 >gi|182412149|ref|YP_001817215.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177839363|gb|ACB73615.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 859 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 45/306 (14%), Positives = 80/306 (26%), Gaps = 32/306 (10%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + T E + + + + M D QR + N F D Sbjct: 412 GLDAPQAEVSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAFDY-GD 470 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 T S ++ L ++ L+ +D+S Sbjct: 471 PTPASADKIACRIEQAAHPLLQQRNLVRIAMKVPAAGRGAGQPLNLTVLLDTSGSMERTD 530 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + VR AL + + D R+ F + + + A Sbjct: 531 RATSVRAALGVLASLLTPDD------RVTLIGF---ARQPRLLAESLAGDQARQLVDLAS 581 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + A+ A E+ R +N A+ IVL+TDG N + + Sbjct: 582 TTPFTGGTNLEAALSLA---------GELARRHHNAAAQNRIVLITDGAANLGNADPAQL 632 Query: 390 C---NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--------EANSTHELN 438 + QGI + ++ +A +L Sbjct: 633 ATRIETLRQQGIAFDACGVGTDGLDDAVLEALTRK--GDGRYYVLDAPENADAGFARQLA 690 Query: 439 KIFRDR 444 FR Sbjct: 691 GAFRPA 696 >gi|332559488|ref|ZP_08413810.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] gi|332277200|gb|EGJ22515.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] Length = 651 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 53/443 (11%), Positives = 122/443 (27%), Gaps = 44/443 (9%) Query: 37 GGMLVDVVRWS---YYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 G L V R S + L A + ++ ++ ++ + + P+ Sbjct: 61 AGFLNGVRRMSGSFFSRPLLAATGSVAALGLALVVVMPNARLAEPPQTAPDAPEADARLA 120 Query: 94 L---IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 A + A + + D L+ + + Sbjct: 121 AVPEAGGGAETAGAPVPAEPRARSAEGAAPQTFAADEAMPMAVPPAPDFALSKQAAEAPA 180 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQ--------WVIDFS----RSMLDYQRDSEG 198 + + A A E VS + I S + + Sbjct: 181 RALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAILRSSLRAGQLPPREAVRIE 240 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + +N F + + +P + +L + P L+ Sbjct: 241 EMINYFPYDYPAPESGTPPFRPSLSVTRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFL 300 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S K L++ + ++ ++ D V G+ V++ Sbjct: 301 IDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVAIVTYAGSAG---EVLAP------TAA 351 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 R+ + + + GSTA + + AY M E + +VL TDG+ Sbjct: 352 NQRSTILSALDRLDAGGSTAGEEGLALAYR--------TASEMAGAGEVTR-VVLATDGD 402 Query: 379 NTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + + G+ + + F + N ++ ++ + Sbjct: 403 FNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAYIDSLNEA 462 Query: 436 E--LNKIFRDR---IGNEIFERV 453 + L I +++ +V Sbjct: 463 QKVLVDQLSGALFPIADDVKVQV 485 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 12/172 (6%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D +++ +K + ++ A F + F+ + +++ +++ Sbjct: 263 KDLSKRLVQQLKIGPH---YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKG 319 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+TAI ++ A I DE + + K IV TDG + + + A Sbjct: 320 GTTAIGAGIEKALTQI-----DESEGARPGIATKVMIVF-TDGWSNKGP-DPEKRARDAV 372 Query: 395 SQGIRIMTIAFSV-NKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDR 444 + G + T+A++ LS + S F + L + R Sbjct: 373 NAGFEMYTVAYTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIKQR 424 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 50/154 (32%), Gaps = 20/154 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L+ +I + D+ +MG F + + +KT Sbjct: 110 SRLDYAKMLLSQLIDRLT--DD-----KMGLIVFAGDAFIQMPITSD-KVSAKMFLKTIQ 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D + TAI A+ A + + + I LLTD EN +DN + Sbjct: 162 PDLIQRQGTAIGSAIDLAVKSFNDTK----------QSGGRAIFLLTDAENHEDNA--VE 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A+ + I + + + + +S Sbjct: 210 AAKMARDKNITVNVVGIGTPEGSPIPVKGTMSYI 243 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 57/172 (33%), Gaps = 8/172 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL G I+ A+ + V+LG + +D+ R + + L+ AA A + + L+ S Sbjct: 8 KLPNGQRGAVTIMFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGAGSLVNSS 67 Query: 73 EEV---SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + I + + + V + A Sbjct: 68 LSTYDWDAAEQKGLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVPVTGDVPAV 127 Query: 130 QVVLSSRYDLLLNPLSLFLRS-MGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + ++ PL LF +GI I A ++ Y GV + Sbjct: 128 RATVALTSTQNNGPLKLFFAPFLGINESNIP----ASAIAAIYPPAGGVGMF 175 >gi|118365082|ref|XP_001015762.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89297529|gb|EAR95517.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 755 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 16/119 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK LVR +L +++ ++K D ++ F+ + + V+ + +K+ Sbjct: 63 KKAQLVRKSLKYLLKILEKGD------QISLVSFSSTAKTLCPLT-QVNDENKQQIKSAI 115 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 N G T + + + S E + +I+LLTDGE + + Sbjct: 116 KQINGQGGTFVIPGFKEVTKILNSRKE---------QREQTFILLLTDGEFGDIDSGKV 165 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 51/148 (34%), Gaps = 12/148 (8%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN---DAMQT 345 D T R+G +N ++ + N + +T + + Sbjct: 91 DYEPKTTRVGLVSYNADAKILAGL--DTYQSYDDLANGVFDSLNSVSATDESYLAKGLSA 148 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + + + N + KK +++ + + ++ K+ G++I+T+AF Sbjct: 149 A-EKVFEEGKSTA----NRTQYKKVVIVYASSYKGTGELNPVPVADRMKTAGVKIITVAF 203 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANS 433 S N + LS AS + F Sbjct: 204 SQNND--DGLLKDLSEIASLDFDFANTD 229 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 80/259 (30%), Gaps = 21/259 (8%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 G NC G + +++ S S +L + Sbjct: 796 NGGRTNCTGYRYGDNCTFTCPNGLVIAGGPTQMTCQSDSTWSSQPPCCDLKCPPHALVDL 855 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS +L+ + +++ + +R+G FN D G Sbjct: 856 MFLLDSSSSVGRSNWNLLINFTVALLDKFVISPDD---MRVGVARFNRHFDRDSEILIGN 912 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I + + T + + N +H N IV++TD Sbjct: 913 YSNISELRQKLRRMPYRGRGTLTG-------NALWHMNNHSLHAPGNRPGVPDVIVVITD 965 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G ++E + N K Q +++ + + A+ L + +S + + Sbjct: 966 G---LASDEVLRAANALKEQDVKMYVVGLINRMNRMNLAQ--LQDISSGTEYLQIIDNG- 1019 Query: 437 LNKIFRDRIGNEIFERVIR 455 +R+ +E+ + + + Sbjct: 1020 -----YERLADELSDTLTQ 1033 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TI S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIGLSTVVNEAK--LRLISGDSSSE 209 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 48/163 (29%), Gaps = 26/163 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID-ENEMGSTAINDAMQTAYDTIISSN 354 R F++ + + AID + G T I A+ A + Sbjct: 173 RFALISFSEDAVIEVPMQ--KVNERNKQQALHAIDRLSVKGRTNIASAVSLAAQVVNGVA 230 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT--------IAFS 406 E R + LLTDG E I + K I + I+ Sbjct: 231 EPNKVRS---------VFLLTDGNANTGYTEAI---DLVKLTSIFVEANRNPHTPPISLH 278 Query: 407 VNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIG 446 E + L A S SF+ +++ F D IG Sbjct: 279 TFGYGPEPDQKLLRGMAMATSGGSFYSVRDNSQVSSAFGDAIG 321 >gi|148656912|ref|YP_001277117.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569022|gb|ABQ91167.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 543 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 51/168 (30%), Gaps = 15/168 (8%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 ++ ++ V S +K + L + + I D R+G Sbjct: 356 AAKNAWSINRKRADILLVVDVSGSMEGEKLEAAKSGLGTFLSRILPED------RVGLIV 409 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + R + N G TA+ DA+ T + R+ Sbjct: 410 FSTDARVVVPPA--PLSEARIALDDAIAQLNARGKTALYDALITGKQVFDDLPPPDEERI 467 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + IVLL+DG + + GI I +A+ + Sbjct: 468 -------RAIVLLSDGLDNASRTTLDQVRLAFDETGISIFPVAYGSDA 508 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMIKKQRESGVTLSTFGVGDDNYNEA 360 >gi|159900556|ref|YP_001546803.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893595|gb|ABX06675.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 423 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 58/178 (32%), Gaps = 25/178 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + V++A +V+ + + D FNDR K +K Sbjct: 63 RLYQVKEACNNVVNQLNRQD------YFSVVSFNDRAEVVVPCQRPNDKDQ---IKRAIG 113 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + M I R + + +VLLTDG D + I Sbjct: 114 MIEAKGGTEMATGMMMGLQEIS--------RPMMSRGISR-MVLLTDGRTYGDESRCVEI 164 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--EANSTHELNKIFRDRI 445 +A+S+GI I + E L AS + + ++ +F D I Sbjct: 165 ARRAQSKGIGITALGI-----GDEWNEDLLETIASAENSRTEYITNAQQIVNVFSDEI 217 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 23/58 (39%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + ++ +G+ ++ LL PVML VD + A +TA + + Sbjct: 11 QALRDRSGNVAVMAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAVTAVANISR 68 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 203 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 258 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 259 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG------FIKGGINR--- 304 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 305 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 350 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 65/197 (32%), Gaps = 33/197 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S D + + +G F S + + I Sbjct: 109 SRLAAAQEAAKSF------ADGLTPGINLGLVAFAGTASVLVSPTPNRDETKAAIDNLTL 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + G + + ++ S + + IVLL+DG E+ D Sbjct: 163 SERTATG--------EAIFTSLQSIDTLAAVLGGSEQAPPARIVLLSDGKQTVPESPDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F E + L A S +FF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTAS 274 Query: 433 STHELNKIF---RDRIG 446 S EL ++ ++IG Sbjct: 275 SLEELRDVYDTLEEQIG 291 >gi|256261598|gb|ACU65921.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 54/155 (34%), Gaps = 12/155 (7%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+DR ++ + G T A++ D ++S+ Sbjct: 190 LVQFSDRFQEHF--TFKDFATSSDPLNLLNSVWQLGGWTFTASAIRFVTDRLLSAAYGAR 247 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K ++++TDGE T+ + + +A++ GI I + + Sbjct: 248 KDA------SKILIVITDGEKTEK-VDYKEVIPRAEAAGIIRYAIGVGSAFQYRNSLQEL 300 Query: 419 LSNCASP--NSFFEANSTHELNKIFRDRIGNEIFE 451 + ++P F+ + L I + ++ +IF Sbjct: 301 IDIASTPSKEHVFQVENFDALRDI-QKQLKEKIFA 334 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 65/197 (32%), Gaps = 33/197 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S D + + +G F S + + I Sbjct: 109 SRLAAAQEAAKSF------ADGLTPGINLGLVAFAGTASVLVSPTPNRDETKAAIDNLTL 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + G + + ++ S + + IVLL+DG E+ D Sbjct: 163 SERTATG--------EAIFTSLQSIDTLAAVLGGSEQAPPARIVLLSDGKQTVPESPDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F E + L A S +FF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTAS 274 Query: 433 STHELNKIF---RDRIG 446 S EL ++ ++IG Sbjct: 275 SLEELRDVYDTLEEQIG 291 >gi|308472935|ref|XP_003098694.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] gi|308268294|gb|EFP12247.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] Length = 405 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 51/153 (33%), Gaps = 7/153 (4%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G +N + ++ + I + ++ T Sbjct: 87 TTRLGLVTYNSVATQNADL----NQYQSIEDAYYGIYGALSTTVNTTESYLTTGLNAAVE 142 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 N +K I++ N + + + I N+ K+ G+ I+TIA+ + Sbjct: 143 LFSRQSFRSNRQHYRKVIIVYASEYNGRGEFDPVPIANRLKASGVNIITIAYEQPGS--A 200 Query: 414 KARYFLSNCASPNSFFEANS-THELNKIFRDRI 445 LS ASP F ++ L K + + Sbjct: 201 GLLQGLSQIASPGFSFSGDNIAGNLVKEIQTAL 233 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 >gi|193788347|dbj|BAG53241.1| unnamed protein product [Homo sapiens] Length = 325 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 100 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 159 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 160 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 219 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 220 ERHAFILQDTKALHQVFE 237 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 58/160 (36%), Gaps = 26/160 (16%) Query: 12 KKLIKS--CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++L G +I A+L+ +LG + VDV L+ A + I + Sbjct: 2 RRLGADNTEKGAVSVIVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKCA 61 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + ++ ++ + + +N + + + + K+A Sbjct: 62 R---DTANADCSTTSTLAGSLAN---QNSLDGMSNVY--------------SIQLDKTAR 101 Query: 130 QVVLSSRYDLLLNPLS----LFLRSMGIKSWLIQTKAEAE 165 V +++ +P + F +++GI S + KA A Sbjct: 102 TVSVTTSAKETGSPDNSVSLFFAKAIGIPSKEVGAKASAT 141 >gi|332256729|ref|XP_003277468.1| PREDICTED: collagen alpha-2(VI) chain [Nomascus leucogenys] Length = 1124 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 64/182 (35%), Gaps = 18/182 (9%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVR 275 G G P + C+ Y G D + FV SS L + Sbjct: 711 GPEGESGPPGDPGLTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEK 770 Query: 276 DALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDE 331 + + +V+ + I D ++T R+G ++ + + + E Sbjct: 771 NFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE 830 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAI 389 G T A++ AYD +I + + R+ + V++TDG + D+ A+ Sbjct: 831 WIAGGTWTPSALKFAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRAL 882 Query: 390 CN 391 C+ Sbjct: 883 CD 884 >gi|225872598|ref|YP_002754053.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793914|gb|ACO34004.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 313 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 60/182 (32%), Gaps = 34/182 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A +R+ + + R+ FN RV F+ + K+ R + + Sbjct: 85 KDLDEEKRAAREFLRA--TLRPED---RVEIVNFNTRVHEVVPFTNNLKKIDRGLNRL-- 137 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 Y I +E+ R +K +V+++DG+NT N Sbjct: 138 ----------SEGPATALYAAIAYGSEELAQR-----PGRKVLVVISDGDNTVANSSYQQ 182 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF--------LSNCASPNSFFEANSTHELNKI 440 ++A I ++ + R LS ++EA+ L + Sbjct: 183 ALDRAVRAETMIFSV--IDLPVINDAGRDVGGEHAMIALSEATGGEYYYEAD--GNLQGV 238 Query: 441 FR 442 F+ Sbjct: 239 FK 240 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 59/185 (31%), Gaps = 24/185 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK V+ +L ++ + + D R+ F+ K Sbjct: 243 KKIASVQQSLVQLLDFLSEKD------RLCLITFDGSAQRLTPLKTLTQDNKNYFKKAIY 296 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + A++ I + +MKN + + I LL+DG++ Q E I Sbjct: 297 -SIRASGQTNIAKGTEIAFNQI------QQRKMKNQVTS---IFLLSDGQD-QGAAEYIQ 345 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIG 446 + I + + + +S SF+ L++ F D +G Sbjct: 346 RQKDVVEDIVTIHSFGYGSDHDA-----ALMSKICKVGQGSFYYIEDVKLLDEFFADALG 400 Query: 447 NEIFE 451 Sbjct: 401 RLSSA 405 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 83/296 (28%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S S + P + D S+ + V++ L ++ Sbjct: 148 V----CSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW---DAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T + T A Sbjct: 201 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|116625274|ref|YP_827430.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228436|gb|ABJ87145.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 320 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 81/236 (34%), Gaps = 27/236 (11%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVI 282 I S + V NK + D +S +S S+R+ ++K A Sbjct: 60 ITTLPKSAFTVFENKIRQDIRKFKSEDVPVSLGLVIDNSGSMRNKLQKVEAAALA----- 114 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + K N +D V + FND D I ++ + G TA+ DA Sbjct: 115 --LVKASNRDDEVFI--VNFNDTAYLDNPKDKDFTNDIG-ELEQALKRIDARGGTAMRDA 169 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q + D + K KK +V++TDG + I A + I Sbjct: 170 IQMSIDHL-----------KKGHRDKKVLVVITDGNDNSSVINMERIMKNAHQSDVLIYG 218 Query: 403 IAFSVNKTQQE--KARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + +E +A+ L++ A+ F E ++ ++I + Sbjct: 219 VGLLTEEEHREAARAKRALNDLAEATGGKTFFPKDLEE-VDAIASQVAHDIRSQYT 273 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 56/175 (32%), Gaps = 23/175 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ L +I D R+G FN ++ + TI Sbjct: 98 SRIDRAKNELQGLIDRF-SAD------RIGIILFNSNAYLLTPLTFDTENIRNTIGNL-K 149 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + GST + ++ + + ++ K +++TDGE +E Sbjct: 150 THMIDKGSTDFSPMLEMINEKLSVGTQNR----------GKVAIVVTDGETHYQIDE--Q 197 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNCASPNSF--FEANSTHELNKI 440 + + K I + + + + R F + + E + +EL K Sbjct: 198 LAKRLKQNNIHLFWLGVGTLGGGKIPEGRGFKKDAEGRDVVTALEVKNINELAKA 252 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 5/85 (5%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ T ++ + D GSTAI DA+ A +++ ++ + + NN I L+TD Sbjct: 656 TEIDSTAIEEYLEDIEPDGSTAIGDAIDAAVAALMAHDDLDPNSSNNNA-----IFLITD 710 Query: 377 GENTQDNEEGIAICNKAKSQGIRIM 401 GE T +++ A + + Sbjct: 711 GEQTSGDKDVCDALEDAAKDDVPVY 735 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 13/78 (16%), Positives = 29/78 (37%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 7 RRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGR 66 Query: 72 LEEVSSRAKNSFTFPKQK 89 + ++ Sbjct: 67 DDAITRARTAIEKIANSA 84 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 203 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 258 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 259 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG------FIKGGINR--- 304 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 305 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 350 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 231 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 290 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 291 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 343 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 344 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 399 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 400 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 450 Query: 424 SPNSFFE 430 + +FE Sbjct: 451 NKGYYFE 457 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 >gi|326789198|ref|YP_004307019.1| hypothetical protein Clole_0061 [Clostridium lentocellum DSM 5427] gi|326539962|gb|ADZ81821.1| Protein of unknown function DUF3520 [Clostridium lentocellum DSM 5427] Length = 670 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 62/171 (36%), Gaps = 21/171 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S K L++ + + ++K+ D ++ V GA+ Sbjct: 166 PPSNLVFLLDVSGSMSDTNKLPLLKKSFNILTSNLKESDCISIVVYAGASGVVLD----- 220 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 GV +++ GSTA + + AY+ E H +K+ Sbjct: 221 ----GVAGNDESLINEALESLEAGGSTAGAEGIAMAYEL------AEKHFIKDGNNR--- 267 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 ++L TDG+ + I I K + +GI + + + + +K Sbjct: 268 VILATDGDFNVGPNSESDLIRIIEKKREKGIFLSVLGLGMGNYKDDKMESL 318 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 69/175 (39%), Gaps = 23/175 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D++ D R+G FN S G + + +K ++ + G T ++ MQ A Sbjct: 229 LDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQK--LKGDVLEISATGGTRLSAGMQMA 286 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTD-----GENTQDNEEGIAICNKAKSQGIRIM 401 + E N E + I+ LTD G+ ++++ G+ N K+ + Sbjct: 287 TELYDEFLE------VNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANANKN----VY 336 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 T + + +++ +++ +S + F++R+ +E V + Sbjct: 337 TTFIGIGVDFNTELVEYITKIRG-ANYYSVHSATQ----FKERMDDEFEYMVTPL 386 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 31/148 (20%) Query: 285 IKKIDNV--NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 IK+I N+ +R+ A F+ + + I+ + G+T+ Sbjct: 928 IKQIPPSAFNNRIRVAAISFSSEAQINFQ--FNEFNNRTEILNALLSLTHSGGNTSSVSG 985 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A I+ + + ++ IVL++DG + E+ + ++ + + Sbjct: 986 INLAIKEILE---------RGREDVRRMIVLMSDGNSQDCWEDLLDASDRLHATNTIVYA 1036 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFE 430 IA A+P+ +F Sbjct: 1037 IA------------------ANPDYYFR 1046 Score = 38.8 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 53/371 (14%), Positives = 117/371 (31%), Gaps = 33/371 (8%) Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM-NPRKSAYQVV 132 + S A + + + + +R+ R + + A+++ Y V Sbjct: 1212 KTSLAAGVEAALLEIASNQRPSARLVIVIFSSGNNRDTRQLAQSAALKLRETGGDIYAVT 1271 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV--IDFSRSML 190 LS Y+L + +G ++ ++ +++ V V +D S S+ Sbjct: 1272 LSKEYELTEYTGNAAKVYIGNRANNFTKDVSNVILNCKVNRKLPVKHVSVGRLDVS-SLT 1330 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR--DEKLSPYMVSCNKSLYYMLYPGP 248 +E + N + + V + + + L + N+ Y + Sbjct: 1331 GRAISNEMKQGNSSNEGGKKQVNRRRGFTAPLDDKSVESALVAKSLESNRCKYSKMD--- 1387 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 E S+S + V R+ S+I + ID V +G F Sbjct: 1388 -----LEIILDASASRQQVF---EHQRELALSLIERLP-IDAGETHVAVGINSFTSVPTL 1438 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + G+ + + + + GST A++ + + + Sbjct: 1439 RQTL--GLGRDKQMVRHAIEDIKYIGGSTFTAQAVELSVQDLERGRRPDA---------I 1487 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + +VL+ DG + E+ + K+ G ++ +A + R S + Sbjct: 1488 QVVVLMNDGMSQDPWEKVLEASQLLKATGAKLFGVAL----GESIDLRELKHYIGSTDRI 1543 Query: 429 FEANSTHELNK 439 + NST Sbjct: 1544 YRDNSTERFLA 1554 Score = 37.3 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 12/147 (8%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + LA ++RS++ ID VR+ T F D + K I+ + Sbjct: 666 EVSKKFLAELLRSLQVID-FQSRVRISLTTFTDNAHIEIELRKPTAK--ENILYAVGKLQ 722 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 NE + +++ A+ A I E ++ V+LTDG + Sbjct: 723 NEYSNASVSAAVDVALAQISVPGEG---------PRQRIFVILTDGSTQDSMQTITTAAA 773 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF 418 K + + + + N ++ E + Y Sbjct: 774 KLRQTDAEVYVVPITENYSKDELSLYV 800 >gi|123283202|emb|CAM24859.1| complement component 2 [Homo sapiens] gi|123857989|emb|CAM25859.1| complement component 2 [Homo sapiens] gi|168983781|emb|CAQ06832.1| complement component 2 [Homo sapiens] gi|168984348|emb|CAQ08707.1| complement component 2 [Homo sapiens] gi|168984415|emb|CAQ09271.1| complement component 2 [Homo sapiens] gi|168985076|emb|CAQ07480.1| complement component 2 [Homo sapiens] gi|168985954|emb|CAQ07110.1| complement component 2 [Homo sapiens] Length = 353 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 181 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 240 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 241 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 300 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 301 ERHAFILQDTKALHQVFE 318 >gi|115496702|ref|NP_001068594.1| collagen alpha-2(VI) chain [Bos taurus] gi|94574217|gb|AAI16098.1| Collagen, type VI, alpha 2 [Bos taurus] gi|296490819|gb|DAA32932.1| collagen, type VI, alpha 2 [Bos taurus] Length = 917 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 44/273 (16%), Positives = 89/273 (32%), Gaps = 30/273 (10%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S + R + G P + + G G P + C+ Y Sbjct: 542 GSKGEPGRKGQKGEPADPGPPGEPGPRGQRGAPGPEGEPGPPGDPGLTECDVMTYVRETC 601 Query: 247 GPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGA 299 G D + FV SS L ++ + +V+ + I D ++T R+G Sbjct: 602 GCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGV 661 Query: 300 TFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ + + + E G T A++ AY+ +I + + Sbjct: 662 VQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNKLIKESRRQK 721 Query: 359 HRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 R+ + V++TDG + D+ A+CN + + I ++ ++ Sbjct: 722 TRV--------FAVVITDGRHDPRDDDLNLRALCNHE----VTVTAIGIGDMFHEKHESE 769 Query: 417 YFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 S C P + + +F D + + Sbjct: 770 NLYSIACDKPQ---QVRNMT----LFSDLVAEK 795 >gi|268530440|ref|XP_002630346.1| C. briggsae CBR-CLEC-62 protein [Caenorhabditis briggsae] Length = 386 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 52/157 (33%), Gaps = 10/157 (6%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA--MQTAYD 348 TVR+G +N++ + F I S + A + A Sbjct: 71 DKRTVRVGLVTYNNQATVQADLNR-FQSADDLFNSVFQILPKLSASDEVYLAKGLDAAES 129 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + ++ + ++ +D + E+ + KS G+ I T+AF + Sbjct: 130 VLSAGRKNATRSNYK----QLVLIYASDYRD-DGEEDPRPTAERMKSSGVSIATVAF--D 182 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 +T E + ASP F + +I I Sbjct: 183 QTGNEGVVKAIGEIASPGFNFTNEDADLVREIQGAMI 219 >gi|291575288|gb|ADE10213.1| hypothetical protein [Actinoplanes liguriensis] Length = 432 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 38/97 (39%), Gaps = 8/97 (8%) Query: 12 KKLIKS---CTGHFFIITALLMP-----VMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 ++LI + G +ITAL+ V+LG+ +++D+ L+ A A Sbjct: 2 RRLIHALFPPRGEHGVITALVAVLAGAGVLLGMAALVIDIGALYAEREQLQSGADAASWK 61 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 + + + A + + + Y RN ++ Sbjct: 62 VAQACAGTAGRDLTSATCTVAAQRDNAQRYADRNAKD 98 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 63/187 (33%), Gaps = 15/187 (8%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D Q + G+P N + + E +P + L Sbjct: 277 DGQTEWNGEPTNKTNSYIEYGGTGSQADYAIRKYAKETTTPGLFDV--YLNIRGNVQKEI 334 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 L VD S + + V+ + + ++ + + D + MG ++ + Sbjct: 335 APLDLVLVVDWSGSMNDNNRIGEVQKGVDRFVDTLAE-SGITDNIHMGYVGYSSDGYKND 393 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S + G ++ +KT + G T A++ A + + + N KK Sbjct: 394 SVAMGPFDSVKNAIKTI-TPSSTTGGTFTQKALRDAGNMLATPNGH-----------KKV 441 Query: 371 IVLLTDG 377 IVLLTDG Sbjct: 442 IVLLTDG 448 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 54/180 (30%), Gaps = 19/180 (10%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMV-SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 A SY + V I+ + N++ L P P S S+ Sbjct: 219 SAASCSNSYDNFTVLVHIKAAATVGRLNPRGNQASLPQLSQTPRAPVDLVTVLDISGSMA 278 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 279 G--TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQHALQ 330 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I + ++ + E I+LL+DG++T Sbjct: 331 A-VNALVANGGTNIAEGLRKGAKVMEDRRE---------KNPVASIILLSDGQDTYTVSG 380 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K ++ Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVVL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TI S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIGLSTVVNEAK--LRLISGDSSSE 209 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 52/143 (36%), Gaps = 20/143 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++DA+ VI ++ D V F+D V + ++ K + Sbjct: 58 KLAALKDAVKRVIETLTPQDIVA------IVLFDDTVQTLVPATFATDKAT---LIAQVD 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-KYIVLLTDGENTQDNEEGIA 388 E G TA++ M + KN+ + ++LLTDG+ D + A Sbjct: 109 AIEEAGGTAMSGGMAAGIVELR----------KNHDPGRVGAMLLLTDGQTWGDEDRCRA 158 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQ 411 + + G+RI + + Sbjct: 159 LAQELARDGVRITALGLGAEWNE 181 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 69/175 (39%), Gaps = 23/175 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D++ D R+G FN S G + + +K ++ + G T ++ MQ A Sbjct: 1003 LDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQK--LKGDVLEISATGGTRLSAGMQMA 1060 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTD-----GENTQDNEEGIAICNKAKSQGIRIM 401 + E N E + I+ LTD G+ ++++ G+ N K+ + Sbjct: 1061 TELYDEFLE------VNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANANKN----VY 1110 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 T + + +++ +++ +S + F++R+ +E V + Sbjct: 1111 TTFIGIGVDFNTELVEYITKIRG-ANYYSVHSATQ----FKERMDDEFEYMVTPL 1160 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 76/246 (30%), Gaps = 66/246 (26%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I V+D S SM + + E N + R S N Sbjct: 79 IVLVLDISGSMDE---EIENPCTNKRVRYCTRHSSS--------------------DPNH 115 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 +++ + ++ E + ++ KK ++ A I +K + N+ +G Sbjct: 116 EEWFLSWHRWINDYCVEHNTSGEHNITANNKKIDELKRAANGFIERMKDVPNLK----IG 171 Query: 299 ATFFNDRVISDPSFSWGVHKLIR------------------------TIVKTFAIDENEM 334 ++ +P+ G K+ + + + + + Sbjct: 172 IVAYSSIATINPNSKSGTKKVKSLDSNSSHDVTNYNSLGANFLQSNDSRLTSVINNLEAL 231 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + M+ A + + A K IVL+TDG T + G K Sbjct: 232 GGTNIGEGMRKAVYML----------DSGDKSASKTIVLMTDGLPTFYSVTGSN-----K 276 Query: 395 SQGIRI 400 + + I Sbjct: 277 NNYMTI 282 >gi|74214868|dbj|BAE33446.1| unnamed protein product [Mus musculus] Length = 1169 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 91/297 (30%), Gaps = 44/297 (14%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q EA +S + W++ ++ R++ CF + Sbjct: 81 LQVPPEAVNISLGLSLAAATNPSWLLACGPTVHHTCRENIYLTGLCFLLSSSFKQSQNFP 140 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + ++ +++ + S+ + LD + + S R + + Sbjct: 141 TAQQECPKQDQDIVFLIDGSGSISSTDFEKMLDFVKAVMSQLQRPSTRFSLMQF------ 194 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +D R+ FN+ + + S G T Sbjct: 195 --------------SDYFRVHF-TFNNFISTSSPLS------------LLGSVRQLRGYT 227 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ + ++ +A K ++++TDG DN ++ A++ Sbjct: 228 YTASAIKHVITELFTTQSGAR------QDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS 281 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIFE 451 I I E ++ L AS S F + L I +++ +IF Sbjct: 282 IIRYAIGVG-KAFYNEHSKQELKAIASMPSHEYVFSVENFDTLKDI-ENQLKEKIFA 336 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 57/171 (33%), Gaps = 19/171 (11%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 K L++ G F + A+++P ++ L+ V + Y + ++ A + Sbjct: 9 NCKGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTAQRGAEIWN 68 Query: 70 QSLEEVSSRAKNSFTFPK--------QKIEEYLIRNFENNLKKNFTDREVRDI------- 114 +++ + + + + + + + Sbjct: 69 SPSKDMIYGQITLDNMDDIDLYWRIWEMSKRKKQKEEKIEKYAGYLLSDSPILGEPIELE 128 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI-KSWLIQTKAEA 164 + VE +V + ++Y NP S LR GI K+ I+ ++A Sbjct: 129 IEAGIVEDYIVYKKLRVSVKAKYK---NPFSSLLRVFGIGKTITIKAHSDA 176 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 356 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 411 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 412 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 461 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 462 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 516 Query: 423 ASPNSFF 429 ++FF Sbjct: 517 KESHAFF 523 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 61/166 (36%), Gaps = 16/166 (9%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATF 301 + F+ SS I +D + S+ + ++ V +++ A Sbjct: 38 NDLQDSTCFIDVVFIVDSSESSKIVLFDKQKDFVDSLSDKVFQLTPVRSLKYDIKLAALQ 97 Query: 302 FNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ V D FS W + + VK+ + T A+ A + Sbjct: 98 FSSSVQIDSPFSSWKDLQTFKQRVKSMNLIGQ---GTFSYYAISNATRLLQREGR----- 149 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 K+ + K +L+TDG + N + +I A++ GI +TI S Sbjct: 150 -KDGV---KVALLMTDGIDHPKNPDVQSISEDARTAGILFITIGLS 191 >gi|329923737|ref|ZP_08279132.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328941103|gb|EGG37405.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 654 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 45/134 (33%), Gaps = 18/134 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-NDRVISDPSFSWGVH 317 VD+S K+ V+ A+ S+++ D R+G F N+ + + Sbjct: 473 VDASGSMAARKRMTAVKGAILSLLQ-----DAYEKRDRIGMIAFRNNEAELILPVTRSIE 527 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + G T + D + AY + S R N+ IV TDG Sbjct: 528 AASKQLRS-----IPTGGKTPLADGLAQAYKVLHS-----EKRRNNDTLPVMIIV--TDG 575 Query: 378 ENTQDNEEGIAICN 391 + + + Sbjct: 576 RANESSIGLTVYAD 589 >gi|325954651|ref|YP_004238311.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437269|gb|ADX67733.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 336 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 17/123 (13%), Positives = 34/123 (27%), Gaps = 14/123 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R F + + + + T + A + A + + Sbjct: 129 RAALVIFAADGYTISPLTNDYAAIDSYLGSLSTN-LISNQGTDFSAAFREAVSVLKGAPN 187 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K +VLL+DGE+ + E A I +++I +K Sbjct: 188 TS-----------KLVVLLSDGEDHESGEN--QSIKLANDNQIHVVSIGIGTDKGAPIPV 234 Query: 416 RYF 418 + Sbjct: 235 QSM 237 >gi|256420212|ref|YP_003120865.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035120|gb|ACU58664.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 212 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 52/172 (30%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +I ++++ +T + F+ + + + Sbjct: 21 IEAVKNGVQVLISTLRQDPYALETAFLSLITFDSDARQLVPLT--------DLSSFQMPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T++ A++ D+I +K + + + L+TDG T + G+ Sbjct: 73 LKASGGTSLGSALELVADSINREVAKSTPDVKGDWKP--LVFLMTDGIPTDTWQNGLNAF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K GI + L + + + F+ Sbjct: 131 QNTK-IGITV------ACAAGNGADVNLLKQITNTVVSLDTADAATIKAFFK 175 >gi|254780135|ref|YP_003064548.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] gi|254039812|gb|ACT56608.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 74/471 (15%), Positives = 144/471 (30%), Gaps = 83/471 (17%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + F+ +L K G F +ITA+L+ + + ++VD V L++ Sbjct: 1 MFQNKNFLLGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTALLQEVLDHV 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I S + L E ++ + Y R+ N + ++ V Sbjct: 61 IYRTSPKNLYDLREAGRDNFIRHQI-EKALNTYNSRDLSN--IGSIESIVKDAVILTKNV 117 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 P + + LS+ L + L +F +S G Sbjct: 118 NSLPLQFTVDIALSTTVQLRGSLLQMFSQSKGK--------------------------- 150 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D SR + + G + F Y + + P + Sbjct: 151 --VDISRRKKVMYKQNIGLMIMPFAWDG------YWLASRGKVADSQVHRPKYLE----- 197 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y Y L+ + ++F+ ++ + + + + + N Sbjct: 198 YSHYYQQYLNRNTLVKNFLSQIPYKNFCMAPYHYKSIFYWAFENFQYHQDNNM-----LI 252 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++ R +W H I F + N+ G + ++N H Sbjct: 253 FYSHR--DFYRITWD-HFPYSFIKNVFDMTSNQFGDG----------QVLTNTN----HC 295 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-------------GIRIMTIAFSV 407 + KY+++ G + G ++ I I ++ FS Sbjct: 296 FPHGASHSKYMLMFAIGNQLSRSSVGKETIDRILQDCYYMHKHHRTGRGAITIFSVGFSP 355 Query: 408 NKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + + RY L CAS P+ ++E NS + I + N I + T Sbjct: 356 D----QDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFSQFT 402 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 294 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 345 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 346 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQAKG 399 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L L VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSDRLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 50/139 (35%), Gaps = 19/139 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A S+I S+ D R+ F+ R + T++++ Sbjct: 57 KPLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQ--SREDDLTLIRSKI 108 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G TAI++ ++ + +I LLTDGEN +N+ + Sbjct: 109 QQLQAGGGTAIDEGIKLGIQE----------SSTGSKGYVSHIFLLTDGENEHGNNQRCL 158 Query: 388 AICNKAKSQGIRIMTIAFS 406 + A GI + T F Sbjct: 159 KLAEVAAEYGITLNTFGFG 177 >gi|170744425|ref|YP_001773080.1| hypothetical protein M446_6382 [Methylobacterium sp. 4-46] gi|168198699|gb|ACA20646.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 482 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 57/474 (12%), Positives = 122/474 (25%), Gaps = 79/474 (16%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 +G+ +I A + ++G+ G+ VD + L AA A + A V + + Sbjct: 20 SGNVSLIVAFSLIPLIGMIGLGVDYGMAVGAKTKLDHAADAAALAAVVTAKAYVAANARN 79 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 + N FT + I + + Y +S+ Sbjct: 80 WNVWDIAVAEGQARAANAFAVNAGSVPFTHFALDPIQLTRSGQTFEATVTYTATVSNN-- 137 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 F GI++ + +A A T SY + +ID S SM Sbjct: 138 --------FGPLFGIRTTAVSGRAVATTSVPSY-----LDFYLLIDVSGSMGLPSTTDGQ 184 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 L + V Q Y ++ ++ + + Sbjct: 185 AQLAALNRIDFFKVYQQGCQ---FACHFPGFVGYDLAVFNNIQLRSGAVNVAVCNLIKRA 241 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + + ++ D ++ G N V + + Sbjct: 242 AQPEVANQYRVGLYPFITQMGTLQDL--TADTSALNLKAGCAASNPMVFTQLLDT----G 295 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG- 377 + G T ++ + + + ++ K ++ L+TDG Sbjct: 296 ATQLDSNGDPSTGVGSGGTHFETSL---TSMLATIKANGYGDGSTQIKPKPFVFLITDGM 352 Query: 378 ENTQ----------------------DNEEGI-----------AICNKAKSQGIRI---- 400 +N Q + C+ + G+ + Sbjct: 353 QNNQWYSIQINGKRYYSGSPSKFSAYPDANWSPGGSDPVPMDWGYCDTLRQAGVTVSVLL 412 Query: 401 ---------MTIAFSVNKTQQEKA-----RYFLSNCASPNSFFEANSTHELNKI 440 + ++ + CASP F A++ +++ Sbjct: 413 IPYIKIDFTYVKSDIADENNKVNGFSSGLPDVARQCASPGYFQMADTPEAIDRA 466 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L L VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSDRLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--- 305 L L VD SS + ++ + ++ V+ R+ F+ + Sbjct: 79 LSDRLELVFLVDESSSVGQTNFLNELKF-VRKLLSDF---PVVSTATRVAIVTFSSKNNV 134 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S S + + G T A Q A + S E+ Sbjct: 135 VARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQILRHSRENS------ 188 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 K I L+TDG + + IA + G+ I T Sbjct: 189 ----TKVIFLITDGYSNGGDPRPIAA--SLRDFGVEIFTFGI 224 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 35/269 (13%), Positives = 83/269 (30%), Gaps = 27/269 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G+ +T + MP++LG +++DV R L+ A + + L Sbjct: 9 RFWDDNRGYVIALTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGARELDGRD 68 Query: 73 EEVSSRAKNSFTFPKQKI---------------EEYLIRNFENNLKK-NFTDR---EVRD 113 + ++ Y N + + + D Sbjct: 69 DAITRADAAIEALANSAAFGGGGNGMSLGSHITVAYDAGNAAGSTVAVTYLKEIPADDDD 128 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + + +P +++Y V+++ + ++F +G I AEA V RS Sbjct: 129 PITASMETTDPNEASYAWVIANDQAMT----TIFPVPVGFNRDTINVAAEAVAVYRSSAC 184 Query: 174 E-HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD---RTVKSYSSQNGKVGIRDEKL 229 + + I + + + + L+ + R ++ +S+ + G + Sbjct: 185 DVTPIYICNPFEAPGNSTEAGNQTAADALHENFAAGNLYGRQIELHSTSSSAPGPGNFGF 244 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + L L G ++ Sbjct: 245 LATYGNGANVLAEALATGSPGVCYKQDAL 273 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 76/259 (29%), Gaps = 21/259 (8%) Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 S + + + +N F + + + + D P Sbjct: 235 AAYSFTRRLLNQGQLPPKDAVRIEEMVNYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNK 294 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + P P + +D S K LV+ ++ ++ ++K D V Sbjct: 295 LLHIGIQGYQLPAGHIPQSNLVFLLDVSGSMDEPSKLPLVKQSMELLLSTLKPEDTVAIV 354 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 V GA V+ + I+ GSTA + + AY Sbjct: 355 VYAGAAG---TVLEPTKV-----REKSKILAALHN-LQAGGSTAGGEGLALAYQ------ 399 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N + I+L TDG+ +E + +++GI + + F Q Sbjct: 400 --LAEANFNPKGVNR-IILATDGDFNVGQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQ 456 Query: 412 QEKARYFLSNCASPNSFFE 430 + N ++ + Sbjct: 457 DALMQTLAQNGNGTAAYID 475 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 63/224 (28%), Gaps = 22/224 (9%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + D+ + ADR S + ++ + L Y+ P Sbjct: 314 ETTNDATREITLDGTAAADRDFVLEWSAVANDAPQVGLFREHVGKDDYVLAYVTPPAVAS 373 Query: 251 PSL--SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 E FV +S + +L + ++ D R F+D + Sbjct: 374 AKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGD------RFNVIRFDDTLTR 427 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 S + + F + G TA+ A+ A D N Sbjct: 428 FFEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAALDDSHQGNGLRQ---------- 477 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 IV LTDGE NE+ + A+ RI + Sbjct: 478 --IVFLTDGE--ISNEQQLLDAIAARRGRSRIFMVGIGTAPNSY 517 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 58/178 (32%), Gaps = 24/178 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ +I+ + ID R+ F+ + Sbjct: 78 KIAKLKMAMLFMIKKLSPID------RLSIVTFSTDSTRLCPLRQITENSQKEFENLI-N 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIA 388 G T I ++TA + + + + I+L++DGE NT + + Sbjct: 131 RLKADGWTNITAGLETALKVLNDRSFNGGRVVG--------IMLMSDGEHNTDGDPAEVP 182 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + N + + T F N + + + + + +T+ L K F + Sbjct: 183 LGN------VPVHTFGFGRN-YEPRVLKAVAHK-SIGGTLSDVQNTNNLGKAFSQCLA 232 >gi|221640506|ref|YP_002526768.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] gi|221161287|gb|ACM02267.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] Length = 651 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 34/277 (12%), Positives = 84/277 (30%), Gaps = 26/277 (9%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + + +N F + + +P + +L + Sbjct: 227 RAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSVTRTPWNPETRLVHVALQGRM 286 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P L+ +D+S K L++ + ++ ++ D V G+ Sbjct: 287 PAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVAIVTYAGSAG--- 343 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 V++ R+ + + + GSTA + + AY M Sbjct: 344 EVLAP------TAANQRSTILSALDRLDAGGSTAGEEGLALAYR--------TASEMAGA 389 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 E + +VL TDG+ + + + G+ + + F + N Sbjct: 390 GEVTR-VVLATDGDFNLGISDPEDLARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQN 448 Query: 422 CASPNSFFEANSTHE--LNKIFRDR---IGNEIFERV 453 ++ ++ + + L I +++ +V Sbjct: 449 GNGQAAYIDSLNEAQKVLVDQLSGALFPIADDVKVQV 485 >gi|148685680|gb|EDL17627.1| integrin alpha M, isoform CRA_b [Mus musculus] Length = 1168 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 145 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 198 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 252 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 311 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 312 HVFQVDNFEALNTI-QNQLQEKIFA 335 >gi|132626289|ref|NP_001076429.1| integrin alpha-M isoform 1 precursor [Mus musculus] Length = 1154 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|132626295|ref|NP_032427.2| integrin alpha-M isoform 2 precursor [Mus musculus] gi|148685679|gb|EDL17626.1| integrin alpha M, isoform CRA_a [Mus musculus] gi|162318464|gb|AAI56095.1| Integrin alpha M [synthetic construct] gi|162319590|gb|AAI56992.1| Integrin alpha M [synthetic construct] Length = 1153 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|74222716|dbj|BAE42227.1| unnamed protein product [Mus musculus] Length = 1064 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 54 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 106 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 107 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 160 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 161 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 220 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 221 HVFQVDNFEALNTI-QNQLQEKIFA 244 >gi|74212839|dbj|BAE33379.1| unnamed protein product [Mus musculus] Length = 967 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|74213162|dbj|BAE41718.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|74191759|dbj|BAE32836.1| unnamed protein product [Mus musculus] Length = 1166 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|74178358|dbj|BAE32446.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|124956|sp|P05555|ITAM_MOUSE RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: CD_antigen=CD11b; Flags: Precursor gi|52983|emb|CAA30479.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|303246237|ref|ZP_07332517.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] gi|302492300|gb|EFL52172.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] Length = 382 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + + G +I AL M V+ G + VD Y L+ AA A + + Sbjct: 2 IRFTETRRPPAQAGSVAVIVALSMIVLAGFATLAVDYGFLEYKRSQLQNAADAAALAGAS 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L+Q + + + + N D + +++ + P Sbjct: 62 VLVQYGANQEAVTDTAVLYGQ----------------ANLNDSDSKEMAIRNSDVTYPDA 105 Query: 127 SAYQVVLSSRYDLLLNPLSLFL-RSMGIKSWLIQTKAEAETVSRSYHK 173 + + + R NP+ +FL R +G + I A K Sbjct: 106 VSVRATV-GRTQERGNPVEMFLGRILGWNTQDIAATGVAALFCSKSSK 152 >gi|270487809|ref|ZP_06204883.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|270336313|gb|EFA47090.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 207 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 56/172 (32%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++ ++++ +T + F+ + ++ + Sbjct: 21 IEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSARQAVPLT--------DLLNFKLPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ DA+ I + + K + +I +TDG T D +G++ Sbjct: 73 LVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI--MTDGSPTDDWRKGLSDF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + L + + + F+ Sbjct: 131 KAART-GVVV------ACAAGHAVETKVLQEITEIVLQLDTADSSSIKAFFK 175 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 62/179 (34%), Gaps = 18/179 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +++ ++ N + +G + R+ + + F Sbjct: 15 TGFQISKAFVKALLSEVRVASN-ATRIAIGTFSSDHRINFNYILNPSYANTKCKFNDDFK 73 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNED-EVHRMKNNLEAKKYIVLLTDGE-NTQDNEEG 386 + + T I ++Q AY+ S + D H + K ++LLTDG N N Sbjct: 74 KIKFDGFMTNIKGSLQDAYNVFRSLDSDPVTHSRRPRSN--KVVILLTDGVGNMVGNRVD 131 Query: 387 IAICNKA------KSQG-IRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHEL 437 A + A K G + + T+ + L A+ P+ F + +L Sbjct: 132 SAGADGAPEALRLKQTGYVELYTVGV-----THATDQNMLKKIATDPSLFLYSKDFTDL 185 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 64/184 (34%), Gaps = 19/184 (10%) Query: 272 HLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ L +I ++ I + V M + + + K++R I + Sbjct: 237 EHVKRTLGLMIDNLCDGISPDTNRVAMLRYSSDVKEDLNFIEGSNEPKVMRNIQRLKYKP 296 Query: 331 ENE--MGSTAINDAMQTAYDTIISS-----NEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + GST AM A TI +S N +K E +V++TDGE+ + Sbjct: 297 ITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTEDGIKVRTE----VVIITDGESNDPD 352 Query: 384 EEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 E K K GI++ + K + + + S F S +L F Sbjct: 353 ETFTIQGQKVKYDEYGIKVYALGVGDIKKDEIRQLTSMD----DESIFYLMSWKDL-AAF 407 Query: 442 RDRI 445 I Sbjct: 408 NRII 411 >gi|116626778|ref|YP_828934.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] gi|116229940|gb|ABJ88649.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] Length = 466 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 49/298 (16%), Positives = 94/298 (31%), Gaps = 28/298 (9%) Query: 15 IKSCTGHFFIITA-LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL- 72 + + G I A +++PV+ G+ G +D+ R L AA+ A I A+ LI + Sbjct: 30 VAAREGAISIQLAVIMVPVLFGMMGFALDLGRLYLVRGELNHAAEAAAIAAASHLIGTTG 89 Query: 73 --EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + ++ A+ + T K NL TD V A +P S Sbjct: 90 ALDAATTAAQQTLTLNKYNFGSLTPGEGSGNLTSTITDPAYFSTVAG-ATGNDPNGSQAD 148 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + + L + L W + + ++ + G+S+ Sbjct: 149 GTTARHVQVSLTADAPLLF------WSLLSAGQSRKTPIAAQALAGISVPL--------- 193 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + F A S + G D+ + +C + L Sbjct: 194 -----CTACGIEPFAIAAKDA--SDLTDFGFGAPADDVHYTFYYNCTGTAPAFLPNSGQS 246 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + + D+ + + L R ++ S + TV +G D V + Sbjct: 247 AAYTIINRYDAGNTTVTDESDQLFRYGAGGLLSS-TTPNPTGSTVPLGCVGIGDSVEA 303 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 37/282 (13%), Positives = 88/282 (31%), Gaps = 34/282 (12%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D+ + L + +S G++ + E P + + S+ + Sbjct: 1196 DHTNVNGEDSLVLQIPVIAKDTSGLTSTGGQISVSIEDDQPVAKNIDISVSPETKS---N 1252 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI--S 308 ++ S K +++ A++ ++ + VR+ FN R Sbjct: 1253 TNVQLILDTSGSMSNSSNGKLAIMKAAVSKMLDQY----HDMGDVRVQLIDFNSRSTRLE 1308 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 +W + +V G T +DA++ A + ++ N Sbjct: 1309 FNGRAWMTVSEAKYLV----NRLTAGGGTDYDDAVKKARQSWDHDEHLQLDNANNVSY-- 1362 Query: 369 KYIVLLTDG-------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR----- 416 ++DG + T + E N S GI +I + + + + + Sbjct: 1363 ----FISDGKPQDGHDDATISDNEETKWANHLISNGITSQSIGINSSGSLNDSGQIDRVS 1418 Query: 417 YFLSNCASPNSFFEA---NSTHELNKIFRDRIGNEIFERVIR 455 Y + +S + +A + L+ + I + + + Sbjct: 1419 YDGTKGSSSGADTDAIIIKDANGLSDVLTQSIIKTVSGHLTQ 1460 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 61/161 (37%), Gaps = 15/161 (9%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I + T R+G +N D ++ K ++ ++ + A Sbjct: 65 AQIGINPSNPKTTRIGMVTYNSNATVDAHLNYDWPK-NNDVLNFYSTMSEISEDSTSYVA 123 Query: 343 --MQTAYDTIISSN--EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +Q A + + S + + H KK I++ + I + N+ K +G+ Sbjct: 124 HGLQAAQNLLYSESFGSNRSH-------YKKVIIVCASTFKGTGKNDPIPVANRLKGRGV 176 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +I+TIA+ EK L+ +SP F+ ++ L + Sbjct: 177 KILTIAY---DQGDEKVVEELAKISSPGLSFKQDANMTLIE 214 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 74/217 (34%), Gaps = 23/217 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P LDP FV S K +A+ +++ ++ D+ + Sbjct: 292 NGYFVHFFAPDNLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFS---- 347 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN + + + + + K I K + G T IN+A+ A + +N Sbjct: 348 --VIDFNQNIRTWRNDLFQLQKHRLQIAKRYIEKIQPSGGTNINEALLRAIFILNEANNL 405 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 + + I+L++DG+ T + I K I + ++ + Sbjct: 406 GLLDPNSVSL----IILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLG-----MGFD 456 Query: 414 KARYFLSNCASPNS-----FFEANSTHELNKIFRDRI 445 FL ++ N + T K F +++ Sbjct: 457 VDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQV 493 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 56/152 (36%), Gaps = 20/152 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + DR+ + + + +T +K F + T ++ + A + Sbjct: 76 KVGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDMSVGLDEAVKVL-KQGM 134 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT----------QDNEEGIAICNKAKSQGIRIMTIAF 405 D H IV+L DG N + +E+ +AK GI I TI Sbjct: 135 DPAHAP--------MIVVLADGNNDLDPNTGRTSKEASEQLAQAVKEAKGSGIPIYTIGL 186 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + ++ L+N + F +S +L Sbjct: 187 NADGKLNKETLAELAN-QTGGKSFTTSSADDL 217 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 64/172 (37%), Gaps = 16/172 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV---NDTV 295 +L+ + FV SS I +D + S+ + ++ V + Sbjct: 32 NLFARKNDLQDSTCFIDVVFVVDSSESSKIALFDKQKDFVNSLSDKVFQLTPVGFLKYDI 91 Query: 296 RMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++ A F+ V DP FS W + + VK+ T A+ A + Sbjct: 92 KLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSM---NFIGQGTFSYYAISNATMLL---- 144 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K +L+TDG + N + +I A++ GI +TI S Sbjct: 145 -----KREGRKDGVKVALLMTDGIDHPKNPDVQSISEDARTAGILFITIGLS 191 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 47/121 (38%), Gaps = 16/121 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT + G T I A+ A D + + + K IVL++DGE+T Sbjct: 114 RTEAKTAVATLSPTGWTPIGPALLKAADDL------------DGGDGSKRIVLISDGEDT 161 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ N +++ A+ ++ T EL Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNTKMRQQLSCIAE--ATGGTYTSVEHTDELT 219 Query: 439 K 439 Sbjct: 220 D 220 >gi|268573930|ref|XP_002641942.1| Hypothetical protein CBG16647 [Caenorhabditis briggsae] gi|229553908|sp|A8XP79|CL160_CAEBR RecName: Full=C-type lectin protein 160; Flags: Precursor gi|187026423|emb|CAP34559.1| CBR-CLEC-160 protein [Caenorhabditis briggsae AF16] Length = 635 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 62/177 (35%), Gaps = 16/177 (9%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIVKTFAI 329 +V+ + +++ + + V++G +++ V ++ Sbjct: 305 MVKAEINTLVGQMSLDPEIQKHVQVGLIKYSNESEIVFKPSEYTNEDEFTEDLWTDPRLE 364 Query: 330 DENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D +E ++ A+Q A I S + KK +V+ N + +++ Sbjct: 365 DVDEKTDEVNLHLALQQAAKMIGSMRKG----------VKKVVVIYAASYNDEGDDDARQ 414 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I + G I+T+AF ++ + + ASP F + L D + Sbjct: 415 IAANIRESGYEIITVAFVEPESSSLVMK--IGELASPRMNFTSFRDELLVDELEDAL 469 >gi|148685681|gb|EDL17628.1| integrin alpha M, isoform CRA_c [Mus musculus] Length = 1037 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 145 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 198 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 252 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 311 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 312 HVFQVDNFEALNTI-QNQLQEKIFA 335 >gi|62286486|sp|Q863A0|CO2_GORGO RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|29690202|gb|AAM10001.1| complement C2 [Gorilla gorilla] Length = 752 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + ++I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRILGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDRIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|309791847|ref|ZP_07686333.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226108|gb|EFO79850.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 542 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 62/173 (35%), Gaps = 21/173 (12%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + ++ ++ V S K + R L S + I D R+G Sbjct: 355 AVKNSWAINRKRADIILVVDISGSMEGDKLEMTRAGLESFLMRILPDD------RVGMIT 408 Query: 302 FN---DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ V++ + S +L I + A G TA+ DA++ A ++ + Sbjct: 409 FSSSATEVVAPAALSENRMQLQMAISEMSAT-----GKTAVFDAVELARQSLEALPSTGE 463 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 RMK IVLL+DG + + G+ I +A+ + + Sbjct: 464 DRMKA-------IVLLSDGADNASRITLADLERNFDETGVSIFPVAYGADADR 509 >gi|26333265|dbj|BAC30350.1| unnamed protein product [Mus musculus] Length = 1036 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|189237279|ref|XP_973594.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 75/236 (31%), Gaps = 20/236 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L P FV + L +K + DA+ ++ + + D D VR Sbjct: 280 NGYFVHFFSPSGLKPLPKHVVFVLNHGLTMHGRKIDQLIDAMQKILSELTENDAF-DIVR 338 Query: 297 MGAT--FFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 GAT ++ L + K F ++ I A T YD Sbjct: 339 FGATPSVWDSTRHKFIRLPDLRHYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGL 398 Query: 353 SNEDEVHRMKNNL---------EAKKY---IVLLTDG---ENTQDNEEGIAICNKAKSQG 397 + V+ ++ L +Y I+ LTD E I K + Sbjct: 399 GLSNPVYALEVGLFLAKRIQDNLPNRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNR 458 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I +++F + + + N +EA F I + + +V Sbjct: 459 IPIFSLSFGEDVDKNFMRQLAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|116671476|ref|YP_832409.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] gi|116611585|gb|ABK04309.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] Length = 353 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 65/194 (33%), Gaps = 18/194 (9%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G +I A+++ V+LG G + VDV L+ A A + + +S S Sbjct: 21 ERGAVSVIVAIMLVVLLGFGAVAVDVGMLYAERTQLRNGADAAALAVAQKCAKSAPSSSD 80 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 ++ + + + +N+K D R + + + Sbjct: 81 ADCSNTSTLAASLANSNANDGRSNIKSIILDTTNRKVTVTAGAQEKGKSP---------- 130 Query: 138 DLLLNPLSLFL-RSMGIKSWLIQTKAE---AETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 N +SLF R +G+ S + + + VS+ V + + M Q Sbjct: 131 ----NEVSLFFARVLGMNSAEVNAPSTVVWGSPEKGTSPFPITVSVCQVRNSTNIMQLLQ 186 Query: 194 RDSEGQPLNCFGQP 207 + L+C P Sbjct: 187 LHGKNANLDCNYGP 200 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 30/204 (14%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S + H ++ A + I + + G FN + + Sbjct: 308 LDVSGSMRDFNRIHRLKQAAELFLLQIIETGSWA-----GIVVFNSKASTKAFLQQITSD 362 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +R + + G T+I ++ + + IVLLTDGE Sbjct: 363 SVRQTLSDH-LPTVADGGTSICSGIREGFQVFLQKYSSTEGCE---------IVLLTDGE 412 Query: 379 NTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE- 436 + ++ C + + G I TIA + ++ + ++ F A + + Sbjct: 413 -----DSSVSSCFAEVERSGSTIHTIALGPSAAKELEMLANMTG----GLTFSATDSLDS 463 Query: 437 --LNKIFR--DRIGNEIFERVIRI 456 L F +I ++ I++ Sbjct: 464 NGLIDAFSGISSGSGDISQQSIQL 487 >gi|168036243|ref|XP_001770617.1| predicted protein [Physcomitrella patens subsp. patens] gi|162678138|gb|EDQ64600.1| predicted protein [Physcomitrella patens subsp. patens] Length = 634 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 82/301 (27%), Gaps = 35/301 (11%) Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 ++ R D T +E+ + Y L + PL + +++ L+ Sbjct: 162 ASYGARGSPDAEGGTKLEVKVTWFQSMTFDNGMYSLRV-PLVFPQEILPLETQLVSIIKV 220 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT----VKSYSSQN 219 ++ + V SM + +R+ L G S Sbjct: 221 KCAINTGTNDYVVVGAF-----GNSMEEAEREPGKVKLKKDGDDWKNQDFIASYKVWSDG 275 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYP---GPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + P + S + P + F+ S K R Sbjct: 276 IFPNLIYQDGEPEELDSRGSFCLSISPPDPNKIKVFQRAVVFLLDRSGSMYGKPIEDARQ 335 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS-FSWGVHKLIRTIVKTFAIDENEMG 335 A + ++ + + F+ + S I + + G Sbjct: 336 A---LFFALDSLKPEDSF---NIVAFDHELTLFSSQMERATPNAIGWAREWAMTNCTARG 389 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T I +Q A++ + + Y+ L+TDG + IC +S Sbjct: 390 GTDILGPLQQAFNFLEN---------FPWAVP--YVFLITDGA----VSDEQNICLAMQS 434 Query: 396 Q 396 + Sbjct: 435 R 435 >gi|29789036|ref|NP_036843.1| integrin alpha-M [Rattus norvegicus] gi|8917587|gb|AAF81280.1| integrin beta 2 alpha subunit [Rattus norvegicus] Length = 1151 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S F+ S +++ +++V+ +K + ++D Sbjct: 144 CPQQESNIAFLIDGSGSINTIDFQKMKEFVSTVMDQFQKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K G T ++ + Sbjct: 197 RTHFTFNDFKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELFQKINGARDNA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + +A+ GI I N + ++R L AS Sbjct: 251 KILVVITDGEKFGDPLNYEDVIPEAEEAGIIRYVIGVG-NAFHKPQSRRELDTIASKPAG 309 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F+ ++ LN I R+++ +IF Sbjct: 310 DHVFQVDNFEALNTI-RNQLQEKIFA 334 >gi|149067646|gb|EDM17198.1| integrin alpha M [Rattus norvegicus] Length = 1151 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 63/181 (34%), Gaps = 18/181 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ +++V+ +K + ++D F++ K Sbjct: 169 MKEFVSTVMDQFQKSKTL-----FSLMQYSD--EFRTHFTFNDFKRNPDPKSHVRPIRQL 221 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T ++ + K +V++TDGE D + +A Sbjct: 222 NGRTKTASGIRKVVRELFQKINGARDNA------AKILVVITDGEKFGDPLNYEDVIPEA 275 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIF 450 + GI I N + ++R L AS + F+ ++ LN I R+++ +IF Sbjct: 276 EEAGIIRYVIGVG-NAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTI-RNQLQEKIF 333 Query: 451 E 451 Sbjct: 334 A 334 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 40/117 (34%), Gaps = 13/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + +++ D TAIN+A++ A + Sbjct: 130 RIGIIAYAAQAYPQLPITTDYS-AAKMFLQSMNTDMLSSQGTAINEAIELASTYYDDETQ 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ KA +GIRI TI K Sbjct: 189 TN-----------RVLFIISDGED-HAEGTTEDAVEKATEEGIRIFTIGVGKEKGAP 233 >gi|62001340|gb|AAX58372.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 98/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPKATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|116622522|ref|YP_824678.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225684|gb|ABJ84393.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 324 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 62/194 (31%), Gaps = 32/194 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A A K + ++ FND+ + V ++ + Sbjct: 111 KLQKSRMAAAEFF---KTANPDDEFF---LVEFNDQPKMVVPLTRDVEQIQNQLT----- 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ DA+ A + S ++ +K +++++DG + + Sbjct: 160 WAQSKGRTALLDAIFLAMSELKKSTKN-----------RKALLIISDGGDNSSRYTESEV 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 N + + I I + L+ + + A +EL I Sbjct: 209 RNLVRENDVLIYAIGVYEFAGGRMRTPEEAGGPGLLTELSEQTGGRHLPA-DANELPDI- 266 Query: 442 RDRIGNEIFERVIR 455 +IG E+ R + Sbjct: 267 AAKIGVELRNRYVL 280 >gi|309364363|emb|CAP25017.2| CBR-CLEC-62 protein [Caenorhabditis briggsae AF16] Length = 393 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 52/157 (33%), Gaps = 10/157 (6%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA--MQTAYD 348 TVR+G +N++ + F I S + A + A Sbjct: 78 DKRTVRVGLVTYNNQATVQADLNR-FQSADDLFNSVFQILPKLSASDEVYLAKGLDAAES 136 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + ++ + ++ +D + E+ + KS G+ I T+AF + Sbjct: 137 VLSAGRKNATRSNYK----QLVLIYASDYRD-DGEEDPRPTAERMKSSGVSIATVAF--D 189 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 +T E + ASP F + +I I Sbjct: 190 QTGNEGVVKAIGEIASPGFNFTNEDADLVREIQGAMI 226 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 22/105 (20%), Positives = 41/105 (39%), Gaps = 19/105 (18%) Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 AM A + + H +++++DG+ + + I N+ K GI+ Sbjct: 539 EPAMTMALKDLKPNPASVKH-----------MIIISDGDPSPPSG---TILNQYKQAGIK 584 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 I T+A T L N A+ ++ A + L +IF+ Sbjct: 585 ITTVAVG---THGPAGSTPLQNIANATGGKYYVATNPKALPRIFQ 626 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 356 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 411 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 412 T--YDQRTEFSFTDYSTKEDVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 461 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 462 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 516 Query: 423 ASPNSFF 429 ++FF Sbjct: 517 KESHAFF 523 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 46/121 (38%), Gaps = 16/121 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT + G T I A+ A D + + + K IVL++DGE+T Sbjct: 114 RTEAKTAVATLSPTGWTPIGPALLKAADDL------------DGGDGSKRIVLISDGEDT 161 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ N + + A+ ++ T EL Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNTKMRRQLSCIAE--ATGGTYTSVEHTDELT 219 Query: 439 K 439 Sbjct: 220 D 220 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 346 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 401 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 402 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 451 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 452 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 506 Query: 423 ASPNSFF 429 ++FF Sbjct: 507 KESHAFF 513 >gi|169624118|ref|XP_001805465.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] gi|111056124|gb|EAT77244.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] Length = 1587 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 82/244 (33%), Gaps = 20/244 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS-EGQPLNCFGQPADRTVKSYSS 217 T++ +V + Y + S ++ + D +G+ C + +Y + Sbjct: 1077 TTRSVGSSVFKYYRAKLATSPGTSVEGFFHLWYCLVDVGDGETTGCTINGPWERLSNYFN 1136 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL----SEEHFVDSSSLRHVIKKKHL 273 ++ G E+ + + PL L + + + + + + Sbjct: 1137 EDISTGGLTEEPCVAKADESGARSGRGMDQPLVFKLTILGNRKSSSSNVVKSPHLSRLDV 1196 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ + I + N +G F + + V + Sbjct: 1197 LKQMFDAFINRLLA---YNFQTHIGLVAFGSKASVAQPITNAVENFRHKLNNML-----A 1248 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G TAI D++ A D + + E A+ I+ ++DGE+T+ N++ + + ++ Sbjct: 1249 SGDTAIWDSIALAQDQLQTYAEKYP-------TARLRIICISDGEDTKSNQDKVNVTSRL 1301 Query: 394 KSQG 397 G Sbjct: 1302 CRNG 1305 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 68/191 (35%), Gaps = 19/191 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S M+ S++ +D SS + ++ + ++++ ++ + D ++ Sbjct: 203 SHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIA 258 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A F FS+ + ++ G TA DA+ + Sbjct: 259 AVQFT--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP----- 311 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K ++V++TDG++ D + A + A GI I ++ + + + Sbjct: 312 ---IRESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDM 363 Query: 419 LSNCASPNSFF 429 S ++FF Sbjct: 364 ASKPKESHAFF 374 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 67/498 (13%), Positives = 143/498 (28%), Gaps = 91/498 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + L+PV +G+ + + + L A + A + S + Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTENNYRND 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S+ N + R + + + + + + +++MN YQ+ L Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEI-LGYTEYNASLQMN-----YQLAL 130 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW------------ 181 + Y + + G + + AE + + + S+ Sbjct: 131 LNSYLKQTPSPTWDVNENGAARKYLS--SIAEPIDVVFVTDFSGSMNLPFGDIELNNRIT 188 Query: 182 VIDFSRS----MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +D ++ + + ++G F + T + ++ SP + N Sbjct: 189 KLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGN 248 Query: 238 K---SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND- 293 Y + + + + + K H + + I+K N+ Sbjct: 249 GHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAI-------LSEIEKKHRDNEI 301 Query: 294 --TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R A F D+ S + I I+ + I+ M D Sbjct: 302 PTKTRDQAKNFLDKAYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFF 361 Query: 352 SSNEDEVHRM-----------------------------------KNNLEAKKYIVLLTD 376 E + + K +++L+D Sbjct: 362 CLKETNAKSLNFDPNSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSD 421 Query: 377 GENT-------QDNEEGI----------AICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 G++ D + GI +C K K GI+++ I + Sbjct: 422 GDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWE 479 Query: 420 SNCASPNSFFEANSTHEL 437 +C +F+ A + HEL Sbjct: 480 KDCVGTGNFYLAKNAHEL 497 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 8/126 (6%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + I+ +D + R+G + V + H+ ++++ E T Sbjct: 50 LTQVIESLDVGPNATRVGVVNYASAVKHEFPLK--AHRSKASLLQAVRRLEPLSTGTMTG 107 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A S E + K +++TDG D ++ A +A+ GI + Sbjct: 108 LAIQFAISRAFSEVEGARPL---SPALSKVAIVVTDGRPQDDVKDVSA---RAREAGIEL 161 Query: 401 MTIAFS 406 I Sbjct: 162 FAIGVG 167 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 42/116 (36%), Gaps = 15/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + + G TA NDA++ A D Sbjct: 130 RVGIVGYAGSAYPILPMTTDYSIAKMYLQSMNTNMVSSQG-TAFNDAIKLAVDY------ 182 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + K I+L++DGE+ + + AK +G+RI+TI K Sbjct: 183 ------FDVKDTSKLIILVSDGEDHGEGAS--EAIDLAKEKGVRILTIGVGTEKGA 230 >gi|48427894|sp|Q8SQ75|CO2_PONPY RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110309|gb|AAL82820.1| complement C2 [Pongo pygmaeus] Length = 752 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 43/138 (31%), Gaps = 8/138 (5%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + +I ++ D T A+ + Y + + E + I+ Sbjct: 313 SRDMTDVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAII 372 Query: 373 LLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ A+ N+ ++ + I I + S Sbjct: 373 LLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDG 432 Query: 425 PNSFFEANSTHELNKIFR 442 F T L+++F Sbjct: 433 ERHAFILQDTKALHQVFE 450 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 45/132 (34%), Gaps = 11/132 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 T A G T A+Q + + + N K K +V++TDGE + Sbjct: 202 TWESKVANIPYHEGGTFTASAIQKLVNYLFTPNGGTRPSAK------KILVVITDGE-SH 254 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELN 438 D ++A+ I I AR L+ AS + F+ + L Sbjct: 255 DRNLLKDAASQAEKNSIVRFAIGVG-KAFDYYNAREELNTIASDPDTDYVFKVTDFNALK 313 Query: 439 KIFRDRIGNEIF 450 I + GN I Sbjct: 314 NILQKLEGNIIA 325 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 340 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 395 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 396 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 445 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 446 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 500 Query: 423 ASPNSFF 429 ++FF Sbjct: 501 KESHAFF 507 >gi|74212905|dbj|BAE33399.1| unnamed protein product [Mus musculus] Length = 1232 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 16/205 (7%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ KK + ++D Sbjct: 144 CPQQESDIVFLIDGSGSINNIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSD--EF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + + + G T ++ + Sbjct: 197 RIHFTFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENA------A 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--N 426 K +V++TDGE D + + +A G+ I + + R + + P Sbjct: 251 KILVVITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGE 310 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 F+ ++ LN I ++++ +IF Sbjct: 311 HVFQVDNFEALNTI-QNQLQEKIFA 334 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 56/224 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 300 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLRR-MSDNGREAAGLAI 352 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE--- 385 G T I + ++ + +E I+LL+DG++T + + Sbjct: 353 NSLTSSGGTNIVEGLKKGVRVLEERSE---------QNPVASIILLSDGKDTYNCDNVNR 403 Query: 386 ---------------------GIAIC-------NKAKSQGIRIMTIAFSVNKTQQEKARY 417 +IC ++ + I + T F + Sbjct: 404 RQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTA---- 459 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDRIGN--EIFERVIRIT 457 + + S +F S + F IG + + +R+T Sbjct: 460 -MHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLT 502 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 89 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 134 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 135 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 187 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 58/184 (31%), Gaps = 26/184 (14%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L G + L+ +D S + V++A S+I + D R+ F+ Sbjct: 37 LNNGTRNAPLNLCLILDHSGSMTG-RPLTTVKEAAQSLIDRLNPGD------RIAVVAFD 89 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 +K G TAI+D M+ + + K Sbjct: 90 HHAKVLVP---NQLVEDPEQIKALIQRLEPKGGTAIDDGMKLGIEELA--------VGKQ 138 Query: 364 NLEAKKYIVLLTDGENTQ-DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++ + LLTDGEN DN+ A I + T+ F + + L Sbjct: 139 GTISQAF--LLTDGENEHGDNQRCQQFAELAAGYNITLNTLGFGSHWNE-----DVLEGI 191 Query: 423 ASPN 426 A Sbjct: 192 ADSG 195 >gi|171912902|ref|ZP_02928372.1| hypothetical protein VspiD_17020 [Verrucomicrobium spinosum DSM 4136] Length = 652 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 47/138 (34%), Gaps = 21/138 (15%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A ++ + + R+G F R + +I I Sbjct: 115 AKLAAQDLLERL----PND---RVGVIAFAGRSYLQAPLTNDHEAVIECIQSLDHTTI-P 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G ++I A+Q A +TI + H +VL TDG+ T +E + A Sbjct: 167 RGGSSIASAIQLAVETI-DKVKGREHG----------MVLFTDGQET--DEATLTAARMA 213 Query: 394 KSQGIRIMTIAFSVNKTQ 411 +G+ ++ + + Sbjct: 214 AQKGLIVIPVGVGTTEGA 231 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 45/127 (35%), Gaps = 15/127 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + IDN+++ R+G F + + + D TAI A+ Sbjct: 121 QLIDNLHND-RIGIIIFAGEAYVQLPITTDYS-AAKLFLNNITTDIVPTQGTAIGAAIDM 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + N K ++L+TDGEN +D+ ++ +A ++ + I I Sbjct: 179 GMKSFNFVNGTS-----------KAMILMTDGENHEDDA--VSAAKRASAKDVAIHVIGV 225 Query: 406 SVNKTQQ 412 + Sbjct: 226 GSEEGAP 232 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEA 360 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 55/152 (36%), Gaps = 20/152 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G + DR+ + + + +T +K F + T ++ + A + Sbjct: 76 KVGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDMSVGLDEAVKVL-KQGM 134 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT----------QDNEEGIAICNKAKSQGIRIMTIAF 405 D H IV+L DG N + +E+ +AK GI I TI Sbjct: 135 DPAHAP--------MIVVLADGNNDLDPNTGRTSKEASEQLNQAVKEAKGSGIPIYTIGL 186 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + ++ L+ + F +S +L Sbjct: 187 NADGKLNKETLAELAK-QTGGKSFTTSSADDL 217 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 64/465 (13%), Positives = 145/465 (31%), Gaps = 65/465 (13%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++ + +G F I ++ +L + ++ R+ L A + A Sbjct: 1 MFMQQPLLASIRRFKQDRSGAFAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + E+ A+ ++T Y+ + D ++ + + Sbjct: 61 ALAL------TAEDNGDGAQRNYTLSSDYFRAYMRHD---------VDVFKPTVIVKSGI 105 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 N + +Y S L + S S+ Q +R + + + Sbjct: 106 SPNNQNLSYVEYRVSGQTLQ----DSWFSSTFFPSFDKQVVIGDNGAARKFRSN--MDVI 159 Query: 181 WVIDFSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 +V DFS SM + + ++ L + SY+ N + + C+ Sbjct: 160 FVTDFSGSMNEGFGGSTKLAELKRIVLKLSDELFSYNIDNKVGFVPFGWGGKEGIDCD-- 217 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + GP+ + + +L I + S+ I N +++ Sbjct: 218 -FPFVSHGPVPSDILAGG--NYKALEKYI-----------DISGSVAAIPNPVHDIQIPL 263 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + N S SW V + G T ++ + + S Sbjct: 264 SNVNGSTCLRNSHSWKVPLTSSLAEINQINGMSAEGGTLVSSGVLLGVPYLAS------- 316 Query: 360 RMKNNLEAKKYIVLLTDG----ENTQDNEEGIAI--CNKAK---SQGIRIMTIAFSVNKT 410 ++K +V+++DG +N I C+K + S + I+F Sbjct: 317 ----GTASRKVMVIVSDGTDDPKNVMITPNLINAGMCDKIRQVLSTDESVGKISFIGIAY 372 Query: 411 QQEKARYFLSNCASPNSFFEANST----HELNKIFRDRIGNEIFE 451 +C +F+ + +L + + +G+ I + Sbjct: 373 YPTVDW---KSCVGDKNFYLPQTIDELEEDLRRAVFEEVGHNILK 414 >gi|237728581|ref|ZP_04559062.1| TerY1 [Citrobacter sp. 30_2] gi|226910059|gb|EEH95977.1| TerY1 [Citrobacter sp. 30_2] Length = 212 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLT--------DLLSFQMPA 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 73 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 131 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 175 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 56/152 (36%), Gaps = 9/152 (5%) Query: 302 FNDRVISDPSFSWG--VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + D S +W + ++ + + T D + + + +N + Sbjct: 876 WKDTTYSSTELNWKEWRATVTSSLSNNSLVHLSNSIDTITADGLTAIDEGLYEANNELSA 935 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 N+ +VL++DG + + I +AK I T+ N+ + + + Sbjct: 936 ITGNST-----VVLMSDGLDNAGHHSLIEEALRAKEHNTVIYTVGLGNNEDEVDPILCEI 990 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 +N + ++ A ++ L IF I +EI Sbjct: 991 ANI-TGGKYYFAPNSTVLEDIFIG-IASEITN 1020 >gi|84685162|ref|ZP_01013061.1| hypothetical protein 1099457000257_RB2654_09854 [Maritimibacter alkaliphilus HTCC2654] gi|84666894|gb|EAQ13365.1| hypothetical protein RB2654_09854 [Rhodobacterales bacterium HTCC2654] Length = 496 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 9/70 (12%), Positives = 28/70 (40%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ AL++ ++ G+ + D R + + ++ A + A+ L + + Sbjct: 11 RDEGGAVLVVFALMLAILSGIVALSYDFGRAAATQSEMQSFADNVALAAAGELDGGADAL 70 Query: 76 SSRAKNSFTF 85 + + Sbjct: 71 TRAQAAAEQL 80 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 56/389 (14%), Positives = 124/389 (31%), Gaps = 51/389 (13%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 + +K + + +++ + V L S + + ++ E R ++ ++ Sbjct: 7 NTVKTSLKISVLASMVALAISGCQNTD--------TGEQRAEQDKRVEIAKTEELQRQKK 58 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA---ETV 167 ++ V T + V YD+ + +S GI + + + Sbjct: 59 EQEQVVVTGARIMSEAKERNV---PMYDMAMPA----PQSPGIWQQPEVNREQYQHLDES 111 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQR---DSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 KE VS + + S + +R D P + +Y Q Sbjct: 112 GIFLAKEQPVSTFSIDVDTGSYSNVRRMLNDGYLPPEDAVRLEEFVNYFNYDYQTPDSTE 171 Query: 225 RDEKLSPYMVSCNKSLYYML---------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + ++ ++ S + L P + + +D S + K LV+ Sbjct: 172 QPFAVNTHVFSAPWNSNAYLMEIGIKGFEPEQQELPPSNLVYLIDVSGSMNSEDKLGLVK 231 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +L + + D ++ V GA+ V+ R ++ + G Sbjct: 232 KSLKLLAQESSDQDRISIVVYAGASG---VVLEP------TKGNDRMAIEQALDRLSAGG 282 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE---NTQDNEEGIAICNK 392 ST ++ AY + +K+ + ++L TDG+ T + E+ I + + Sbjct: 283 STNGGAGIELAYKLAEQA------FIKDGINR---VILATDGDFNVGTINREQLIDLVER 333 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + GI T+ F + Sbjct: 334 KRESGISFTTLGFGSGNYNEHLMEQLADK 362 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 90/340 (26%), Gaps = 58/340 (17%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G S + A + VS + + +D + R G A Sbjct: 378 GFGSGHGRAGASSPPVSARSAAAYAPEPEVALDPNGRFATTYRPGGGHLAAFEAALARGV 437 Query: 212 VKSYSSQ-NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS---------------- 254 V + + G V R P + L L L P Sbjct: 438 VPAAERELVGDVAARYAPEVPLALDKALGLRADLERAALGPGGGAFHLRLALRSAAAAAA 497 Query: 255 -----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 H V +S R A +++ + D+ + T F+ Sbjct: 498 ARPHLSVHLVLDTSGSMAGAPIDSARRAAQALVDRLAPADDFS------LTTFSSDAEVV 551 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEA 367 G R ++ E G T I + Y ED V Sbjct: 552 IED--GPVGPRRAAIRRAIEGLREGGGTNIGAGLSLGYAQASRPGIPEDAV--------- 600 Query: 368 KKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + ++L++DG T +E + A +GI+ + + Q +S AS Sbjct: 601 -RVVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTSALGLGDDFDGQ-----LMSAIAS 654 Query: 425 PNS--FFEANSTHELNKIFRDRIGNE------IFERVIRI 456 + ++ ++ + E +R+ Sbjct: 655 DGAGGYYYLRHPEQIAPALSTELDKRLDPVATAVEVRVRL 694 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 57/202 (28%), Gaps = 20/202 (9%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G ++ ALL+ + + +DV + L+ A ++A+ L ++ + A Sbjct: 17 GAISVMAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQLKRADSAANLNA 76 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTD---------------REVRDIVRDT--AVEM 122 T + T A ++ Sbjct: 77 AVLGTVRNIGAKNGYPSGIAMGCGDATGGGADAMTACLGVWDPASGGPKHFSATYEATKV 136 Query: 123 NPRKSAYQVVLSSRYDLLLNPLS---LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 +P Q + +++ S LF ++ S + + + +S+ Sbjct: 137 SPNAVRVQATQTVPILFVIDGGSGRQLFAEAIASGSPPVAAFSLGTGLLDVSTANSMLSV 196 Query: 180 QWVIDFSRSMLDYQRDSEGQPL 201 S S++D+Q Sbjct: 197 LLGNTVSLSLVDWQGLVNTTVT 218 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 55/175 (31%), Gaps = 24/175 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A+ + + +K D G TFF D + + V + + Sbjct: 118 RYDASMKAIDTFLD-FRKGDAF------GLTFFGDAFVHWVPLTTDVTAIRCSPPFMRPE 170 Query: 330 DENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TAI A+ + +E K IVL+TDG + Sbjct: 171 TVPPPFGGTAIAKALNGCKTELRRRDE-----------GDKMIVLITDGFSYDLTGNDEE 219 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFR 442 I ++G+ + I E ++ C + F A+ L +F+ Sbjct: 220 IARTLSAEGVAVFCI----IVGGFEPQAEIVNICRLTGGEAFRADDPDALPAVFK 270 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 72/217 (33%), Gaps = 27/217 (12%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L + FV S +K + A +++ ++ ID R Sbjct: 292 DGYFVHYFSPDGLPNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDVRPID------R 345 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYDTI 350 + F+ W +++ K + G T I D++ A + + Sbjct: 346 INIILFSSYAHV-----WREDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEIL 400 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + H + + I++LTDG+ + + + + + +I F Sbjct: 401 L------EHDTGDAMPL---IIMLTDGQVGNAAAIVRDVTSVIGGR-LSLFSIGFGNGVD 450 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + LSN A +E +S K F D + N Sbjct: 451 FPFLEKLSLSNQALARKVYEDSSASLQMKGFYDEVAN 487 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 61/217 (28%), Gaps = 49/217 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A +I + +VR+G F + ++ I + Sbjct: 106 TRIGASKQAARDLIVGLPA------SVRLGMVSFAATATVVLPPTDNRQDMLDAIDRFQL 159 Query: 329 IDENEMG--------------STAINDAM----------------QTAYDTIISSNEDEV 358 G + + + A + E E Sbjct: 160 QLGTATGSGLIQALAVLFPDDGIDLEAILFSGESLAPGPGGRSLTEAAAADAVRKREQER 219 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + ++LL+DG T + + A +G+R+ T+ F + Sbjct: 220 PAAQPGSYRHGAVILLSDGRRTTGP-DPLDAARMAAQRGVRVYTVGFGTPQGGAAAESGL 278 Query: 419 ----------LSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +F+A S +L++++R Sbjct: 279 SYYMQLDEPALRAVAAITNGEYFQAGSAADLSQVYRQ 315 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 160 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 219 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 220 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 272 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 273 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 328 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 329 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 379 Query: 424 SPNSFFE 430 + +FE Sbjct: 380 NKGYYFE 386 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 156 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 215 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 268 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 269 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 324 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 325 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 375 Query: 424 SPNSFFE 430 + +FE Sbjct: 376 NKGYYFE 382 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 156 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 215 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 268 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 269 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 324 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 325 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 375 Query: 424 SPNSFFE 430 + +FE Sbjct: 376 NKGYYFE 382 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 492 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 551 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 552 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 604 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 605 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 660 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 661 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 711 Query: 424 SPNSFFE 430 + +FE Sbjct: 712 NKGYYFE 718 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 195 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 254 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 255 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 307 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 308 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 363 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 364 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 414 Query: 424 SPNSFFE 430 + +FE Sbjct: 415 NKGYYFE 421 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 83/274 (30%), Gaps = 27/274 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 ++ + + + + + + ++ G G +E+ Y + Sbjct: 373 LLKDAGAQAALLKLGRRPAASVGLTLDNPDLTVFNPAWGIQGTINEQGIQYPSAPVIQAA 432 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT----VRM 297 Y ++ + +D S + A V + N+ T V Sbjct: 433 LDRYQTRYRRPVAAYYCLDGSGSMGDNDGWTGIEAAATQVFDPEQAAQNLLQTHPQDVTT 492 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSN 354 A FN V W V ++ A D G T + + A + Sbjct: 493 VAI-FNGGVTGGSP--WQVRGNDGDALRDLARSVADYEPEGGTNMYACLLRATTELTGQQ 549 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 N + K+ +VL+TDG++ + + + +S + ++ IAF + + Sbjct: 550 ---------NGDRKRLVVLMTDGQSGAEQRD--DALDALRSADVPVVAIAFGRDA-DPRQ 597 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 A+ +F + L R G + Sbjct: 598 LEEVAK--ATNGTFVREDD---LVAALRQAAGYK 626 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 53/153 (34%), Gaps = 25/153 (16%) Query: 295 VRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+G ++ +V + S ++++ I + + + M AI AM A+ Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + H V++TDG + + + A+ GI I + + Sbjct: 119 ARPNVPHVA----------VIVTDG---RPQDRVAEVAAAARESGIEIYAVGVARADMTS 165 Query: 413 EKARYFLSNCASP---NSFFEANSTHELNKIFR 442 L ASP + F S +L F Sbjct: 166 ------LRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 156 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 215 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 268 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 269 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 324 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 325 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 375 Query: 424 SPNSFFE 430 + +FE Sbjct: 376 NKGYYFE 382 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 43.8 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 61/214 (28%), Gaps = 48/214 (22%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +DA + I + + VR+G F + L + I Sbjct: 107 RLTAAQDAAKAFIAELPR------HVRVGIVAFAGSAQLAQLPTQNHEDLFKAIDSFQLQ 160 Query: 330 DENEMG--------------STA-INDAMQTAYDTIISSNE--DEVHRMKNNLEAKK--- 369 G + A S +E HR N A + Sbjct: 161 RGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMDEIGRPPHRGSNGRGADRPAP 220 Query: 370 ---------YIVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKT 410 I++LTDG+ T + + A +G+R+ TI F Sbjct: 221 VAPGSYSSAAIIMLTDGQRTTG-VDPMEAAQWAADRGVRVYTVGVGTVAGETIGFEGWSM 279 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + +F A + +L K++ Sbjct: 280 RVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYE 313 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 40/314 (12%), Positives = 98/314 (31%), Gaps = 38/314 (12%) Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 +S++ K + +++ + +++++ D Q +++ + Sbjct: 211 TGNEISKNADKLDPSAKIEIMNSTSALVEFSPDKGKQKEFGQLLGSEKESGLAGQFIVQY 270 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 + + ++ + + P L P FV S +K + +A+ +++ Sbjct: 271 DVERDPQGGEVLVRDGYFVHFFSPSELKPLPKHIVFVLDHSGSMSGRKIDQLIEAMQNIL 330 Query: 283 RSIKKIDNVNDTVRMG--ATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDENEMGSTAI 339 +K+ D + + VR G A ++ ++ + + ++ + G Sbjct: 331 TDLKETD-LFNIVRFGDLAMVWDVSQNQFTQLPNFNEYGNLEPHLREINLPRAVNG---T 386 Query: 340 NDAMQTAYDTIISSNEDEVHRM-------------KNNLEAKKY---IVLLTDGENTQDN 383 + ++ A I + + M KY IV LTDG Sbjct: 387 EENIEAAKKIIEDKSRLGMTNMMYGLEVGLFLIKRTQEETPDKYQPMIVFLTDGHPNAGM 446 Query: 384 EEGIAICNKAK-----SQGIRIMTIAFSVNKTQQEKARYFLSNCAS-----PNSFFEANS 433 I N + I +++F + FL +S +E++ Sbjct: 447 SGRDEITNTVTSLNSGKKKASIFSLSF-----GDFADKRFLRKISSKNSGFSRHIYESSD 501 Query: 434 THELNKIFRDRIGN 447 + F I Sbjct: 502 ASLQLQDFYRAISA 515 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 356 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 411 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 412 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 461 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 462 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 516 Query: 423 ASPNSFF 429 ++FF Sbjct: 517 KESHAFF 523 >gi|313247257|emb|CBY15545.1| unnamed protein product [Oikopleura dioica] Length = 409 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 76/237 (32%), Gaps = 14/237 (5%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRD-SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + S+ S Y +E + F D + + + Q + ++ Sbjct: 1 MKLIFSLFLSFAKSEGFQKYITLFTEHHFSSEFTVIVDSSSQKETLQIPAENFENVQVLF 60 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 P + + + SS K L ++++ + + ID+ Sbjct: 61 EENDDEIEEISRKRPLLCESRPLDLVILLDSSRSIDEKSWLLQKESVERMASVLFPIDDF 120 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N R+ + S + K F ++ T ++ ++ A+ Sbjct: 121 N--TRISLIRYGYSAFLAHRLSEEQSYPMIKE-KLFRLEHTYEDQTNVHFGLRKAFTEFS 177 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK--AKSQGIRIMTIAFS 406 + D N +KK IV++TDGE T+ IA + AK + I +A Sbjct: 178 TCPRDRR-----NQNSKKAIVIITDGEFTEP---KIAYEDLKIAKKNEVEIFILAIG 226 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 64/185 (34%), Gaps = 28/185 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + ++A +R++ + +G F S + I + Sbjct: 107 RFTVAKEAATEFVRNLP------EQFNVGLVSFARTATVVAPPSTNHQAAVDAIEQLTLT 160 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D STAI +A+ T+ + S + IVLL+DG NT Sbjct: 161 D-----STAIGEAVLTSLQAVRSLDAQAA-----EDPPPARIVLLSDGGNTSG-RPIDEG 209 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNC--ASPNSFFEANSTHELN 438 A G+ + TIA+ + + + R L A+ SF+ A S EL Sbjct: 210 ARAATEAGVPVSTIAYGTPEGTIDLEGRSIPVPADTESLRGLADATSGSFYAAESDEELR 269 Query: 439 KIFRD 443 ++ D Sbjct: 270 DVYSD 274 >gi|22127455|ref|NP_670878.1| hypothetical protein y3581 [Yersinia pestis KIM 10] gi|45442682|ref|NP_994221.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51597754|ref|YP_071945.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108809183|ref|YP_653099.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|108810630|ref|YP_646397.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|145597696|ref|YP_001161772.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|150260367|ref|ZP_01917095.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|153949002|ref|YP_001399501.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162420775|ref|YP_001605636.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165928282|ref|ZP_02224114.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165937805|ref|ZP_02226366.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|166008850|ref|ZP_02229748.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211902|ref|ZP_02237937.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398450|ref|ZP_02303974.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422005|ref|ZP_02313758.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426387|ref|ZP_02318140.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467477|ref|ZP_02332181.1| putative tellurium resistance protein [Yersinia pestis FV-1] gi|170022821|ref|YP_001719326.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896945|ref|YP_001874057.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|218927794|ref|YP_002345669.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|229837273|ref|ZP_04457436.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|229840487|ref|ZP_04460646.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842969|ref|ZP_04463120.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229900823|ref|ZP_04515947.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|294502682|ref|YP_003566744.1| putative tellurium resistance protein [Yersinia pestis Z176003] gi|21960549|gb|AAM87129.1|AE013961_1 hypothetical [Yersinia pestis KIM 10] gi|45437548|gb|AAS63098.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51591036|emb|CAH22700.1| Putative tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108774278|gb|ABG16797.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|108781096|gb|ABG15154.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|115346405|emb|CAL19277.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|145209392|gb|ABP38799.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|149289775|gb|EDM39852.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|152960497|gb|ABS47958.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162353590|gb|ABX87538.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165914217|gb|EDR32833.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|165919724|gb|EDR37057.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992189|gb|EDR44490.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206648|gb|EDR51128.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166960142|gb|EDR56163.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167050954|gb|EDR62362.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167054610|gb|EDR64417.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169749355|gb|ACA66873.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699971|gb|ACC90600.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229682162|gb|EEO78254.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|229690035|gb|EEO82093.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229696853|gb|EEO86900.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705396|gb|EEO91406.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|262360712|gb|ACY57433.1| putative tellurium resistance protein [Yersinia pestis D106004] gi|262364659|gb|ACY61216.1| putative tellurium resistance protein [Yersinia pestis D182038] gi|294353141|gb|ADE63482.1| putative tellurium resistance protein [Yersinia pestis Z176003] Length = 212 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 56/172 (32%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++ ++++ +T + F+ + ++ + Sbjct: 21 IEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSARQAVPLT--------DLLNFKLPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ DA+ I + + K + +I +TDG T D +G++ Sbjct: 73 LVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI--MTDGSPTDDWRKGLSDF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + L + + + F+ Sbjct: 131 KAART-GVVV------ACAAGHAVETKVLQEITEIVLQLDTADSSSIKAFFK 175 >gi|62001378|gb|AAX58391.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 32/108 (29%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 8 RFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGRD 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + ++ + T D V Sbjct: 68 DAITRAQTAIEKIANSAAFSGGGTGMSLGSNSSVTYEAGNDAGNTVTV 115 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 50/126 (39%), Gaps = 14/126 (11%) Query: 290 NVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 + +D V + F++ V + + + WG + D + G T + + A Sbjct: 70 DEDDYVSI--IAFDNEVTTVLAPTRWGTISRETAVDA--VADISAGGGTDMYSGLLEAKA 125 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ D+ + ++LL+DG +N+ D E + + ++GIRI Sbjct: 126 SLQDLPTDD--------NTARRVLLLSDGKDNSHDPEAFGTLAREIDTEGIRIKAAGIGS 177 Query: 408 NKTQQE 413 + ++ Sbjct: 178 DYREET 183 >gi|62001372|gb|AAX58388.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001460|gb|AAX58432.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001356|gb|AAX58380.1| AvrE [Pseudomonas viridiflava] gi|62001370|gb|AAX58387.1| AvrE [Pseudomonas viridiflava] gi|62001380|gb|AAX58392.1| AvrE [Pseudomonas viridiflava] gi|62001382|gb|AAX58393.1| AvrE [Pseudomonas viridiflava] gi|62001444|gb|AAX58424.1| AvrE [Pseudomonas viridiflava] gi|62001468|gb|AAX58436.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001338|gb|AAX58371.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 8/55 (14%), Positives = 25/55 (45%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 I+ +L+ + + G+L+D+ + ++ + A + L ++ ++R Sbjct: 2 ILVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGADQTTTR 56 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 56/184 (30%), Gaps = 28/184 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++I+S+K R+G F+D + + I Sbjct: 98 YPNRLEAGKRVLTNLIQSLKGD-------RVGFIPFSDSAYIQMPLTDDYNITQNYINA- 149 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A K K +++++DG + + Sbjct: 150 IDTTLISGGGTELYQALELA-----------EKSFKEIGSENKTVIVISDG----GDFDK 194 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++ + K I + +I + E S +LN F + Sbjct: 195 KSL-DFVKENKIDVYSIGVGTKEGNVIPEYLNGVKRGFIKDESGSAV--ISKLNSDFLQK 251 Query: 445 IGNE 448 I NE Sbjct: 252 ISNE 255 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 75/231 (32%), Gaps = 28/231 (12%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 IR + S +++ P +D S+ V+ Sbjct: 5 IRFLATLALVCSVTSTMHAQENAVPSAKVGKVMIVLDGSNSMWGQVDGEAKITIAKDVMT 64 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + I N +D V +G + R D K+ R + + G T I+ Sbjct: 65 DL--ISNWDDAVDLGLMVYGHRRKGDCSDIEVVALPGKVNRPALIDKVQSISPRGKTPIS 122 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR- 399 + A ++ K +VL++DG T D + C +AK+ GI Sbjct: 123 KTLLLAATSV------------GYFSGKSSVVLVSDGLETCDADP----CAQAKALGIIN 166 Query: 400 ----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + I F V + + + + + + FF AN+ EL R + Sbjct: 167 PGFDVHVIGFDVTEEEFKSLQCIATE--TGGKFFRANNAEELKDALRRTVA 215 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 59/182 (32%), Gaps = 28/182 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LVR L ++ ++ D R+ F+ +V + ++ + Sbjct: 138 KIKLVRKTLKQMLTFLQPCD------RLCLIMFDCKVYRLTRLMRVTQENVQKF-RVAIS 190 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + M+ A + HR N I LL+DG + E Sbjct: 191 SLQARGGTDIGNGMKMALSIL-------KHRKYKN--PVSAIFLLSDGVDEGAEERVRD- 240 Query: 390 CNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + I T F ++ +S A F+ + +++ F + Sbjct: 241 -DLIQYNIRDSFTIKTFGF-----GRDCCPKIMSEIAHYKEGQFYFVPNLTNIDECFAEA 294 Query: 445 IG 446 +G Sbjct: 295 LG 296 >gi|51103165|gb|AAT96307.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 58/168 (34%), Gaps = 24/168 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-S 353 V +G F S + R VK + STA +A+ TA +I + S Sbjct: 126 VNLGLVSFAGTAAVLVSPT-----TDRNAVKNGVDNLQLAESTATGEAIFTAMQSIDTFS 180 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDG-------ENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + +VLL+DG + + + A+ +GI + TI+F Sbjct: 181 RSLQGGPDQQGTPPPARVVLLSDGTQTVPGPDGENEPRGSFTAASDAQRRGIPVSTISFG 240 Query: 407 VNKTQQEK---------ARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + E + A S FF A + EL ++ D Sbjct: 241 TSYGSIELDGGRTPVAVDDASMERIASLSGGRFFTAATESELRAVYSD 288 >gi|319649593|ref|ZP_08003749.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] gi|317398755|gb|EFV79437.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] Length = 461 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 19/108 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I +++ AY + +K+I+LLTDG++ + + + I Sbjct: 6 SITPGGGTEIFTSLEQAYSELEEL-----------QLQRKHIILLTDGQSATNGDYELLI 54 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTH 435 K + I + T+A Q+ R L + A F++ + Sbjct: 55 -EGGKEKNITLSTVAL-----GQDADRGLLEDLAGMGLGRFYDVTDSS 96 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 21/155 (13%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR G ++D++IS + + + + + G T A+ ++ Sbjct: 26 DRVRFGVVQYSDKIISQFFLT--QYASMAGLSAAIDNIQQVGGGTTTGKAL----SKMVP 79 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ + +Y++++TDG++T + G+ I I T + Sbjct: 80 VFQNTARI-----DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGVRDANTTE 131 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 L AS FF L I ++ I + Sbjct: 132 ------LEEIASKKMFF-IYEFDSLKSIHQEVIRD 159 >gi|62001456|gb|AAX58430.1| AvrE [Pseudomonas viridiflava] gi|62001458|gb|AAX58431.1| AvrE [Pseudomonas viridiflava] gi|62001470|gb|AAX58437.1| AvrE [Pseudomonas viridiflava] gi|62001472|gb|AAX58438.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|320016843|gb|ADW00415.1| putative tellurium resistance protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 212 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 56/172 (32%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++ ++++ +T + F+ + ++ + Sbjct: 21 IEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSARQAVPLT--------DLLNFKLPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ DA+ I + + K + +I +TDG T D +G++ Sbjct: 73 LVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI--MTDGSPTDDWRKGLSDF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + L + + + F+ Sbjct: 131 KAART-GVVV------ACAAGHAVETKVLQEITEIVLQLDTADSSSIKAFFK 175 >gi|62001388|gb|AAX58396.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001466|gb|AAX58435.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|290981305|ref|XP_002673371.1| predicted protein [Naegleria gruberi] gi|284086954|gb|EFC40627.1| predicted protein [Naegleria gruberi] Length = 353 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 59/157 (37%), Gaps = 16/157 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ + R+G F+D+ + + V L + +K + E GST MQ Sbjct: 20 IDNLREFERLGIVLFDDKAETFLPLTI-VQDLDKKSLKETVLKIKEKGSTNFEAGMQRGI 78 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMT--IA 404 D + + ++ I+ LTD + + I K + G I + I Sbjct: 79 DLFSTLDSSDLSNSNR-------IIYLTDACPNVGGTDTLDILTKDANSGPYNIFSTFIG 131 Query: 405 FSVN-KTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ + + + C ++F ST + KI Sbjct: 132 IGLDFNSDIVEELTQVRGC----NYFSVKSTEDFTKI 164 >gi|62001420|gb|AAX58412.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKKKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 71/236 (30%), Gaps = 25/236 (10%) Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC-----N 237 +D S D + D P +T+K + + + + N Sbjct: 153 LDSMGS--DPKPDHVVTADFDKLDPVIQTIKDKACNGIDKEQTESHSASPVDDGALKTKN 210 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK--KHLVRDALASVIRSIKKIDNVNDTV 295 ++ + + +D S K ++ + ++S + + Sbjct: 211 QNRFRSSSVTVCKAKVDVGFLLDGSGSVEFYAKGNFQRCKNFINKFVKSFMVSKDDSHF- 269 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 G F+ S+ F + H +I T + A D + S+ Sbjct: 270 --GLVLFSS--DSNVEFKFDDHYDAASITAAVNATNYPGMGTYAGKGLTLAKDDLYSAP- 324 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +++ + + ++++TDG ++ D + G+ I + N Q Sbjct: 325 -----VRSGVP--RILIVMTDGISSDDVAGP---AKALRDMGVEIFALGIGKNYDQ 370 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 48/137 (35%), Gaps = 16/137 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + ++S + + G F+ S+ F + H +I + Sbjct: 30 KNFINKFVKSFMVSKDDSHF---GLVLFSS--DSNVEFKFDDHYDAASITAAVNATKYPG 84 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T + A D + S+ +++ + + ++++TDG ++ D + Sbjct: 85 MGTYAGKGLTLAKDDLYSAP------VRSGVP--RILIVMTDGISSDDVAGP---AKALR 133 Query: 395 SQGIRIMTIAFSVNKTQ 411 G+ I + N Q Sbjct: 134 DMGVEIFALGIGKNYDQ 150 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 375 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 430 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 431 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 480 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 481 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 535 Query: 423 ASPNSFF 429 ++FF Sbjct: 536 KESHAFF 542 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 57/167 (34%), Gaps = 19/167 (11%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++++ + G + +A+L VM+ + +D + + A+ A I + Sbjct: 7 TRRVAQPVAGQSLVFSAILFFVMIAFAALAIDTGEAFSRQRQQQAASTAASIAGLESMNS 66 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENN---LKKNFTDREVRDIVRDTAVEMNPRKS 127 ++ +Q I + L N N + ++ + + + Sbjct: 67 EIDGTDGA-------VQQAIRDALAANGITNAVYINGDWGNLDPSQNYYRAFYTQRGSRV 119 Query: 128 AYQVVLSSRYDLLLNPL---------SLFLRSMGIKSWLIQTKAEAE 165 Y V + N L ++F +++GI + + + +A Sbjct: 120 EYPVGSGGQVSTEFNGLRVEVRSARNTIFGQALGIDTLEVSAENKAT 166 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 50/192 (26%), Gaps = 22/192 (11%) Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 N +E S + + +SSS+ L + Sbjct: 7 NKIQSAANEACLTTSADFASSSQKLAKTPGKEAQRLVFLVDESSSVGQANFLSEL--KFV 64 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDR-----VISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ + R+ F+ + + S + + Sbjct: 65 RKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLSREIPAITYR 121 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + +A Sbjct: 122 GGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGGDPRPVAA--SL 169 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 170 RDFGVEIFTFGI 181 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 56/224 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 218 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLRR-MSDNGREAAGLAI 270 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE--- 385 G T I + ++ + +E I+LL+DG++T + + Sbjct: 271 NSLXSSGGTNIVEGLKKGVRVLEERSE---------QNPVASIILLSDGKDTYNCDNVNR 321 Query: 386 ---------------------GIAIC-------NKAKSQGIRIMTIAFSVNKTQQEKARY 417 +IC ++ + I + T F + Sbjct: 322 RQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTA---- 377 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDRIGN--EIFERVIRIT 457 + + S +F S + F IG + + +R+T Sbjct: 378 -MHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQELRLT 420 >gi|51103123|gb|AAT96266.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 53/389 (13%), Positives = 119/389 (30%), Gaps = 53/389 (13%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 ++ N +R + + N + Y + L + + + ++ Sbjct: 304 DSQSNNAEERGNGYQSTGSELTFNASSNDYIGLRFRNIALPQGAV-VSNAYLEFTAYQNS 362 Query: 160 TKAEAETVSRSYHKEHGVSI-----QWVIDFSRSML-DYQRDSEGQPLNCFGQPADRTVK 213 + A R+ ++ S + + +++ ++ + P+ ++ Sbjct: 363 SNNSASMTIRAANEADPNSFNNYPRYLLRNKAKTASVNWSGIERWYRNRDYQSPSVASII 422 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + + + ++ + N Y P + F ++ + Sbjct: 423 NQLVERNDWQLGNDMVFILSDFSNTRGAYTYAERPSGAAKLVIEFQGQATPGQTSTVREH 482 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-TFAIDEN 332 + + + S + DT+ G ++ + + + G + + T+ + T Sbjct: 483 LVSKVDELSAS--GYTPIVDTLYEGVMYYG-GLDVNYGLARGNNSVSNTVRRNTRVSHRL 539 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY----------IVLLTDGE-NTQ 381 D + SSN + IVLL+DGE N Sbjct: 540 SYSGQDATLPSGCEEDNLSSSNCITQQIPQGARYISPISDRQCQVNNHIVLLSDGEANNN 599 Query: 382 DNEEGI-----AIC--------------NKAKSQG------IRIMTIAFSVNKTQQEKAR 416 + + I A C N A S+G I TI F+ N +A Sbjct: 600 HSVDEIESLLSASCTGSGGEKCGLSLVRNVADSEGSVIDSRIITHTIGFAAN----TEAN 655 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRD 443 FL+ A F++A+++ EL F+ Sbjct: 656 SFLNQIALQGGGGFYQADNSQELLGAFQS 684 >gi|325292999|ref|YP_004278863.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] gi|325060852|gb|ADY64543.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] Length = 579 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 95/300 (31%), Gaps = 47/300 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +K TG+ I L P+ +G+ + VD + + L+Q + Sbjct: 10 RFLKDSTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQ---------------TA 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFEN-NLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + A + P++ +++Y N N +K V + + + + Y Sbjct: 55 DLAAISAAANAADPEKAVQQYFALNGMNLGVKTPNGLLTVDGLQPFDPLNEFAKSNGYAE 114 Query: 132 VLSSRYD---------------LLLNPLSLFLRSMG---IKSWLIQTKAEAETVSRSYHK 173 V+ Y+ L N + + + G S + + S K Sbjct: 115 VVKGHYEPDATLPVGQRFVENALPTNAMKVNIVEKGHIFFASAFTTPPKISAAGTASSQK 174 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 S+ + LN T S + + + + + ++ Sbjct: 175 IAAFSV----------GSRLASLDEGILNSLLGGLLGTTVSLKVMDYQALVAADVNALHV 224 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V +L L + + + K ++ A+A++++++ K N ++ Sbjct: 225 VE---ALAVDLNLTAGTYKDVLKTEITYGKFLDALTKTTGLQPAVANILKTLAKTANKSN 281 >gi|62001474|gb|AAX58439.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 43/308 (13%), Positives = 84/308 (27%), Gaps = 36/308 (11%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 + T + +T+ SI V+D S SM + + Sbjct: 88 TLEAVTGQDIKTIGGKSG-----SIVLVLDNSGSMGWGSSPTPADYARDALKEFANEFLK 142 Query: 215 YSSQNGKVGIRDEKLSPYM--VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + K+G+ + + Y+ + S VD Sbjct: 143 NGNSGNKLGLVTYSSGSGVPIYDAYDEIKYVQGYSMTENSDIFGQVVDGLQTPSGETDVQ 202 Query: 273 LVRDALASVIRSIKKIDNVNDTVRM-GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + ++ + + V GAT + R S G + T Sbjct: 203 MGIKTARDILAADTSGNPQFILVFSDGATNRSARPTSAGELG-GANISPCTFGDKIYDMT 261 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + Y+ V T+G ++ Sbjct: 262 FKFDGFDYGAQGSAVYNDG---------------------VYTTNGNVNAHTVAAVSEAL 300 Query: 392 KAKSQGIRIMTIAF---SVNKTQQEKARYFLSNCASPNSFFEAN--STHELNKIFRDRIG 446 AK QGI I ++ + ++N + + + N AS + E + + +IF + I Sbjct: 301 LAKDQGIDIYSVFYHNPALNDLEYGAGVFVMKNSASSGQYTEISPGNAGAFAEIFTE-IE 359 Query: 447 NEIFERVI 454 +I E + Sbjct: 360 KQIQESIA 367 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 52/186 (27%), Gaps = 22/186 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFF----NDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 A ++ I+K+ + R G + N V S + + + + D Sbjct: 289 AKRCLVNLIEKVASYGVRPRYGLVTYATYPNVLVRVSDPKSSDANWVTEKLNQISYEDHK 348 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ Y+ + + + + I+L+TDG + + + + N+ Sbjct: 349 LKTGTNTKRALVEVYNMMSWPGDVPP---EGWNRTRHVIILMTDGLHNMGG-DPVTVINE 404 Query: 393 AKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + + S + F L Sbjct: 405 IRDLLNIGKDRKNPREDYLDVYVFGVG-PLVEPANINALASKKENEQHVFRVKDMEHLED 463 Query: 440 IFRDRI 445 +F I Sbjct: 464 VFFQMI 469 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 84/226 (37%), Gaps = 38/226 (16%) Query: 236 CNKSLYYMLYPGPLD----PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 C + + P P+ P S S+ +K + D + S + IDN+ Sbjct: 15 CWVASQDIQGPKPIAYQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRF--IDNL 72 Query: 292 ND-TVR--------MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAIND 341 D R GA ++D V + R +K G T + Sbjct: 73 RDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSG--RDELKASVDAVKYFGKGTYTDC 130 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQG 397 A++ + ++ ++L+ KY++++TDG E ++ G+ N+AK G Sbjct: 131 AIKKGLEELLIGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLG 182 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 +++ ++A + + + LS A+ +++ F A + Sbjct: 183 VKVFSVAITPDHLEPR-----LSIIATDHTYRRNFTAADWGQSRDA 223 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 82/257 (31%), Gaps = 38/257 (14%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N + S K R + KSL + PLD + Sbjct: 199 NAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSLDSVSSRAPLDLVTVLDVSGSM 258 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + K L++ A++ VI+++ D R+ F+ + Sbjct: 259 SGI-----KLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLFPLRRMTLTGRQ 307 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ G T I DA++ + + R KN + + I+LL+DG++T Sbjct: 308 QALQAI-SSLVASGGTNIADALKK------GAKVVKDRRRKNPVSS---IILLSDGQDTH 357 Query: 382 DNEEGIAICNKA----------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 G A N + S ++I T F + + A S +F Sbjct: 358 SFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGTDHDSAA-----MHAIAETSNGTFS 412 Query: 430 EANSTHELNKIFRDRIG 446 ++ + F +G Sbjct: 413 FIDAEGSIQDAFAQCMG 429 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 53/171 (30%), Gaps = 22/171 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A ++ ++ D +G F + V R V+ I Sbjct: 55 KIFRAKQAALGLLDILRDKD------YVGVYGFAGKFYKVLEP---VPATKRGEVERAII 105 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T I YDT+ E+ ++N + I+ +TDGE T + I Sbjct: 106 SLKLGSGTNI-------YDTLKKLVEETKKVLQNGALSLVRIIFITDGEPTVGKKNPKKI 158 Query: 390 CNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 AK G + I ++ +R + A F + L Sbjct: 159 LEMAKKLREAGASALIIGVGTEYNEKLLSRMAM---ALNGEFEHISDPASL 206 >gi|290986713|ref|XP_002676068.1| predicted protein [Naegleria gruberi] gi|284089668|gb|EFC43324.1| predicted protein [Naegleria gruberi] Length = 413 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 60/160 (37%), Gaps = 16/160 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ D R+G F+D+ + + V L + +K + E GST M+ Sbjct: 82 IDNLKDFERLGIVLFDDKAETFLPLTI-VQDLEKKSLKERVMKITEKGSTNFEAGMKRGI 140 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIA 404 D + + ++ I+ LTD + + + K + GI I Sbjct: 141 DLFSTMDSSDLSNSNR-------IIYLTDACPNVGGTDSLDVLTKDANSGPYGILSTFIG 193 Query: 405 FSVN-KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ ++ + + C ++F S+ E KI + Sbjct: 194 IGLDFNSEIVEELTKVRGC----NYFSVKSSEEFKKILNE 229 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 596 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 655 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 656 AKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALK 715 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 716 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 763 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I + ++ S C P + + +F D + + Sbjct: 764 IGIGDMFHETHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 803 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 597 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 656 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 657 AKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALK 716 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 717 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 764 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I + ++ S C P + + +F D + + Sbjct: 765 IGIGDMFHETHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 804 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 604 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 663 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 664 AKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALK 723 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 724 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 771 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I + ++ S C P + + +F D + + Sbjct: 772 IGIGDMFHETHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 811 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 599 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 658 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 659 AKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALK 718 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 719 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 766 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I + ++ S C P + + +F D + + Sbjct: 767 IGIGDMFHETHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 806 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 334 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 393 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 394 AKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALK 453 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 454 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 501 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I + ++ S C P + + +F D + + Sbjct: 502 IGIGDMFHETHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 541 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 45/128 (35%), Gaps = 14/128 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI----SDPSFSWGVHKLIRTIV 324 + ++R+ ++I + N + +G ++D + G + + Sbjct: 224 SRISILRNRAKALIDQFSGLGN----IYVGIIPYSDDAYISGTKSFVLANGTNV---NTI 276 Query: 325 KTFAIDENEMGSTAINDAMQTAY---DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 K G T DAM+ +Y S + + + + Y++LL+DG+ T Sbjct: 277 KNKIDSLTAQGMTNTGDAMRVSYYATKQFKDSPNSIDNTLPTDTKVIPYMILLSDGDPTV 336 Query: 382 DNEEGIAI 389 + Sbjct: 337 FSATTREW 344 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 57/169 (33%), Gaps = 18/169 (10%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+D + V+ + R+G F+D + ++ S + +++ + + Sbjct: 1653 DLVKDFASGVVSEFQISTTE---TRVGVVQFSDTLRTEFFMS--SFSTKQQVLQAISDID 1707 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T A+ A + S+ N +++++TDG + + Sbjct: 1708 YIQGNTLTGAAITFATASSFSTP------AGNRANFPDFMIVVTDG---LSQDSVVQPAQ 1758 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 A+ QGI I + + L P + +L Sbjct: 1759 SARDQGITIFAVGV----GNEVDFATLLQITGVPEYILQVTDFSDLLAA 1803 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 61/214 (28%), Gaps = 15/214 (7%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + C + Q + + + + PL Sbjct: 1039 EGNTVTFGCNFGFLLSGTDNTQCQADGSWSNPLPVCIAITTPAPTQAPACNDFPLFNGTD 1098 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D S LV+ +V+++ D R+ ++D+ ++ S Sbjct: 1099 LVFLLDGSGSVGSNN-FDLVKTFTKNVVQNF---DISETATRVAVVQYSDQFSTEFSL-- 1152 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + G T A+ ++ +S E +V++ Sbjct: 1153 NAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGERDG------YPDLLVVV 1206 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 TDG +T D + A++QG+ I + + Sbjct: 1207 TDGLSTDDVSGP---ADTARAQGVTIYAVGVGSD 1237 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 48/114 (42%), Gaps = 15/114 (13%) Query: 295 VRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA-YDTIIS 352 R+G ++D + + S+ ++L+ ++ + + G T A+ ++ + Sbjct: 1417 TRVGVVQYSDSPTLEFNLNSFNTNELVDLAIR---NIQYQQGGTNTGQAIDFVRVNSFSA 1473 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +N D + ++++TDG Q +++ + A++ GI + + Sbjct: 1474 NNGDR-------SDVPNVMIVVTDG---QSSDDVVGPAQTARNAGISMYAVGIG 1517 >gi|167526046|ref|XP_001747357.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774192|gb|EDQ87824.1| predicted protein [Monosiga brevicollis MX1] Length = 889 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 55/152 (36%), Gaps = 14/152 (9%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + ++MGA F+DRV F+ + +V T + G++++ +A+ Sbjct: 589 DSGVIQMGAFSFDDRVQPISRFT-----SVEVLVDTLQSAKWTGGASSLAEALSFTGARY 643 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + ++ +V+L DG + + + G+ I IA + Sbjct: 644 FQD-------LSLPADHRRILVILLDGAADDTVAQVARQATQLRQMGVTIFAIAVQDSND 696 Query: 411 QQEKARYFLSNCASPNSFF--EANSTHELNKI 440 Q++ ++ +S + + L Sbjct: 697 VQQQEAQLMAAVSSTVEYHLMRVPNMEALMDA 728 >gi|7766811|pdb|1CK4|A Chain A, Crystal Structure Of Rat A1b1 Integrin I-Domain. gi|7766812|pdb|1CK4|B Chain B, Crystal Structure Of Rat A1b1 Integrin I-Domain Length = 198 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 65/188 (34%), Gaps = 16/188 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + +A + +K++D ++G + + V + + +V I Sbjct: 17 IYPWESVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAANKIG 75 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T + TA + +K K +V++TDGE + DN + Sbjct: 76 RQGGLQTMTALGIDTARKEAFTEARGARRGVK------KVMVIVTDGE-SHDNYRLKQVI 128 Query: 391 NKAKSQGIRIMTIAFSVNKTQ----QEKARYFLSNCAS---PNSFFEANSTHELNKIFRD 443 + + I+ +IA + + EK + + AS FF + L I + Sbjct: 129 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVK- 187 Query: 444 RIGNEIFE 451 +G IF Sbjct: 188 ALGERIFA 195 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 269 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKG------FIKGGINR--- 314 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEA 360 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 71/186 (38%), Gaps = 25/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A + + D + +G F + + + ++ I Sbjct: 106 SRFEAMQVAAKEFVDVLP------DRINLGLVSFAGTATTVVTPTTDRGQVSTAIDN--- 156 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STAI +A+ T+ I + + + + + IVLL+DG NT + A Sbjct: 157 --LELAESTAIGEAVFTSLTAIENF-QSSLDADGEEVPPAR-IVLLSDGYNTVGRPDTQA 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEANSTHEL 437 + + A GI + TIAF + + R L A + S+ EA S EL Sbjct: 213 V-SAALDAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATLEEIADQTGGSYSEAASAAEL 271 Query: 438 NKIFRD 443 ++++D Sbjct: 272 EQVYQD 277 >gi|295698036|ref|YP_003602693.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060148|gb|ADF64885.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 197 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT--------DLLSFQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 58 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 115 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 116 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 160 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 46/121 (38%), Gaps = 16/121 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT K G T I A+ A + + +A + IVL+TDGE+T Sbjct: 114 RTEAKAAVATLAPTGWTPIGPALLGAAEDLE------------GGDATRRIVLITDGEDT 161 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ + + + R A+ ++ T EL+ Sbjct: 162 CAPLDPCEVAREIAAKGIHLVIDTLGLVPDAKTRTQLRCIAE--ATGGTYTSVQHTDELS 219 Query: 439 K 439 Sbjct: 220 D 220 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 37/279 (13%), Positives = 93/279 (33%), Gaps = 25/279 (8%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 S+ + ++ ++D S++ + +D + T+ S + + Sbjct: 243 PSFKPDTSLASYHLLD---SVVHFCKDGDFHSHRSETPNKHNTMCGGVSTWTVIMRNPDF 299 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + + +D S + + +++ + I+ + Sbjct: 300 AFYHNRPVDLDQLPAEFKIVKPTGSRFVVVMDVSDSMKQCNRIDKLGESVRAWIK--NDV 357 Query: 289 DNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + +G F+ K+ + ++K D + T I + A Sbjct: 358 PTGSQ---LGMVMFSSTAHIVSELQVISDMKIRQEMMKKVPKDLYSI--TCIGCGLDLAV 412 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + K N + IVL+TDG N+ + + GIR++T+AF Sbjct: 413 QML---------QEKGNNKTGGIIVLVTDGRNSAGYLDISDVEEDIVKAGIRVVTVAFG- 462 Query: 408 NKTQQEKARYFLSNCASPNSFF--EANSTHELNKIFRDR 444 ++ + L++ S++ + +S+ L + F Sbjct: 463 --SEADSNIERLADVTGGKSYYIKDGDSSEALQRAFTGA 499 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 37/97 (38%), Gaps = 3/97 (3%) Query: 9 FYSKKLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 F K S G+F I TA ++P+M+G+ G VD++ S+ L+ + A + Sbjct: 2 FALKAFWSSERGNFAITTAFAMLPIMIGLAG-AVDLIGTSHDASQLQNSLDAAGLAIGTK 60 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 + + F + + + ++ Sbjct: 61 FSPDMAA-GDVQQLGLQFFAANMSAADQQEYLGSVSA 96 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 35/326 (10%), Positives = 96/326 (29%), Gaps = 40/326 (12%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + QA V ++Q + +++ Q ++ + + + N Sbjct: 307 DVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAIS 366 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + R A V + + G + ++ + Sbjct: 367 FVTKTFFSKANGNRGGAPNVAV--------------VMVDGWPTDKVEEVSR-------V 405 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E G+++ ++ + + Q E + + + + L Sbjct: 406 ARESGINVFFITVEGAAEREKQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVK 465 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ + S +D SS + +++ +A++ + + D Sbjct: 466 RVCDTDR----LACSKTCLNSADIGFVIDGSSSVGTSNFRTVLQF-VANLSKEFEISDTD 520 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+GA + F + + I+ G T+ A+Q A + + Sbjct: 521 ---TRVGAVQYT--YEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF 575 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ +K ++++TDG Sbjct: 576 KKSKPNK---------RKVMIIITDG 592 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 81/253 (32%), Gaps = 23/253 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G + + G+ G + P + C+ Y G D + FV Sbjct: 555 GPAGEPGQRGPRGDGGRDGDPGPEGDPGLTECDVMNYIRETCGCCDCEKRCGALDIVFVI 614 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDRVISDPSFSWGVH 317 SS + L ++ + + I + I D ++T R+G ++ Sbjct: 615 DSSESVGLTNFTLEKNFVINTINRLGSIAKDPSSETGTRVGVVQYSHNGTFQAIRLNDSK 674 Query: 318 KLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + K G T A++ AYD +I + R K N+ +V++TD Sbjct: 675 IDSMSAFKDAVKKLEWIAGGTWTPSALKFAYDNLIRDS----RRSKANVT----VVVITD 726 Query: 377 GENTQDNEEGI--AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANS 433 G +++ + +C I + I Q E+ S C Sbjct: 727 GRYDPRDDDKLLNYLCTDT---SIDVNAIGIGDMFDQPEENESLKSIACRKDGRVMGMRR 783 Query: 434 THELNKI-FRDRI 445 +L F DRI Sbjct: 784 FADLVAEDFIDRI 796 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 6/84 (7%) Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + NE + I+ LTDG + EE ++ G I +I Sbjct: 468 SNVNEQIEEANSGGKKFPSMIIALTDGTLMPEPYEETKIEAENSRQLGATIYSIGVM--- 524 Query: 410 TQQEKARYFLSNCASPNSFFEANS 433 + LS SP+ F ++ Sbjct: 525 --DYRRDQLLSIADSPDHVFGVDN 546 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 42/118 (35%), Gaps = 9/118 (7%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G ++D V + S V G+TA D + A + Sbjct: 413 IGLVSYSDDVYINLPISRFDLNNRSYFVGAVG-GLQAGGATATFDGIAVAMKMLE----- 466 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 E + L+ K ++ L+DGE + I + GI I TI ++ + E+ Sbjct: 467 EQLALDPKLKPKIFV--LSDGETNRG-HSLNDIRKLVEESGIPIYTIGYNADIKALEE 521 >gi|299138555|ref|ZP_07031734.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599801|gb|EFI55960.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 381 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 71/182 (39%), Gaps = 28/182 (15%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D +S +DSS + +K V +A + +K + +++ F+ Sbjct: 136 KHEDLPVSLGILIDSSG--SMYEKSAAVNEASLDL---VKLSNPLDEAF---LVDFSSEA 187 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 D F+ + KL + + + G TA+ DA+ + D + KN Sbjct: 188 YIDQDFTNSIAKLQQGL-----AYIHTSGGTALYDAVVASADYLS----------KNAKH 232 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFS----VNKTQQEKARYFLSN 421 K+ ++++TDGE+ + + + + G I I V++++ + AR L Sbjct: 233 PKQVLLIVTDGEDNASSASLESAIRRVQDLDGPAIYCIGLLFGDDVSRSEAKHAREVLQE 292 Query: 422 CA 423 A Sbjct: 293 LA 294 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 400 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 455 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 456 T--YDQRMEFSFTDYNTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 505 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI I ++ + + + S Sbjct: 506 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITIFSVGVAWAPL--DDLKDMASKP 560 Query: 423 ASPNSFF 429 ++FF Sbjct: 561 KESHAFF 567 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 59/219 (26%), Gaps = 53/219 (24%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I V+D S SM + V + N + S + Sbjct: 95 IVLVLDTSGSMNE-------------------KVGKVCTNNRGWYCKTHNSSDLYHRESL 135 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN----DT 294 + + + +H+ S K ++ A + I +K + ++ + Sbjct: 136 FYHNWINDYCEEHGKVGQHYASYSKS----TKMEELKKAANNFIDKMKDVPDLKICIVNY 191 Query: 295 VRMGAT---FFN-------------DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +N + + S + + +G T Sbjct: 192 SSEATINPCGYNGDKNSASVEEDRHHTIPNYKSLGTKFLNSNDNTLHSMINGLKALGGTN 251 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + ++ A + ++D KK IV ++DG Sbjct: 252 TGEGLRKAEYMLEQGDKDA----------KKTIVFMSDG 280 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 225 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 284 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 337 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 338 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 393 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ CA Sbjct: 394 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACA 444 Query: 424 SPNSFFE 430 + +FE Sbjct: 445 NKGYYFE 451 >gi|118431306|ref|NP_147675.2| hypothetical protein APE_1031.1 [Aeropyrum pernix K1] gi|116062628|dbj|BAA80016.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 463 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 45/255 (17%), Positives = 79/255 (30%), Gaps = 23/255 (9%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWV--IDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 L+ +T ++ + ++ D SR + + Q E + Sbjct: 191 QLLSMMGAGDTSVLAFDENIEFILRIARETDVSRVLENVQGIREIMRRRSRRETRSPKGW 250 Query: 214 SYSSQNGKVGIRDEKLS-----PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + G R Y + S +LY LD S + + S V Sbjct: 251 FDGLEYGSDLERIHYSQLILPDEYFWASFSSSKLLLYRKVLDSSRGPIYVLLDKSGSMVG 310 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K R ++ R ++ + R A FF+ + + Sbjct: 311 AKIDWARAVAVALFRR-----SLAENRRFSARFFDSVTYPAIHLRPRSKPRDFLELVKYL 365 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A++TA D I + E IVL+TDGE+ + N + + Sbjct: 366 AAVKAGGGTDITAAIKTAADDISRTPRGEQRISD--------IVLITDGED-RLNIDVVE 416 Query: 389 ICNKAKSQGIRIMTI 403 + K R+ T+ Sbjct: 417 --DSLKRSDARLHTV 429 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 27/177 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 V+ A + ++ + D R+ F+ + S +K Sbjct: 55 TPLQTVKQAASRIVDRLSNRD------RISIIAFDHKAEVLIS---NELASDPQAIKRRI 105 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G T I+D ++ + + S E + + ++LLTDGEN DN I Sbjct: 106 NSLRAGGGTCIDDGLKAGIEQLASGKEGYISQ----------LLLLTDGENEHGDNSRAI 155 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 + + A + + T+ F + L A S E F Sbjct: 156 KLADVAIGYNLTVNTLGF-----GDHWNQDVLEQIADAGGGSLSYIEHAEEAIATFG 207 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 65/494 (13%), Positives = 146/494 (29%), Gaps = 87/494 (17%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G I TAL + + G+ ++ R+ ++ L A + A + + + ++ Sbjct: 5 RQQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTA---NQDDK 61 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + + + I I N + D T R+ +Q +++ Sbjct: 62 TYENQLATNYVQTYIRNIAIINDIKVERSEGIDY------YPTPDGNEEREY-FQYRVTA 114 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ-- 193 + D + LS + + + +A A Y + + I +V DFS SM + Sbjct: 115 KTDHT-SWLSSDIIPSFSPTETVANRALARNY-PIYLGDKDIDIVFVSDFSGSMKGNKIR 172 Query: 194 ----------------RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 RD E + N N + + P N Sbjct: 173 ALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCLTQLDYRPNFNGGN 232 Query: 238 KSLY-------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL------------ 278 S Y + + S ++ + R + + + + Sbjct: 233 YSSYEDIDWSTWSTWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNTSKSETGSGWYFPD 292 Query: 279 --------ASVIRSIKKIDNVNDTVRMGATFFNDRVIS----DPSFSWGVHKLIRTIVKT 326 SV ++ N ++ + S + +T + Sbjct: 293 PYSYINFPDSVAKTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLT-----SEKTTLSP 347 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGE-----N 379 + + G T++ + + + + I++L+DG+ + Sbjct: 348 IQNNMSPDGGTSVYQGLIRGAQILEQGRPTSPSPETSAAYNSRIKMILMLSDGQEMPYVS 407 Query: 380 TQDNEEGIAICNKAKSQG--------IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 T + +CN K+Q + ++ I F + + NC N+ Sbjct: 408 TFNQLVNQGLCNTIKAQFNDSDQQLYMGVLGIEFDA------QGQQGFKNCVGQNNITNV 461 Query: 432 NSTHELNKIFRDRI 445 + +L K + I Sbjct: 462 DDVDDLIKEILEMI 475 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 36/115 (31%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQQALQAV 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + E I+LL+DG +T Sbjct: 310 NSLVANGGTNIFDGLRKGAKVMEDRRERNSVAS---------IILLSDGRDTYTT 355 >gi|62001342|gb|AAX58373.1| AvrE [Pseudomonas viridiflava] gi|62001376|gb|AAX58390.1| AvrE [Pseudomonas viridiflava] gi|62001426|gb|AAX58415.1| AvrE [Pseudomonas viridiflava] gi|62001440|gb|AAX58422.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 57/166 (34%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L++ AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 286 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQE 413 I+ TDG+ + A+ K + +GI + T+ + + Sbjct: 332 ILFTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEA 377 >gi|320105608|ref|YP_004181198.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924129|gb|ADV81204.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 373 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 4/60 (6%) Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 YD I ++++ +H+ +K ++LLTDGE+ E + A+ + +I F Sbjct: 192 YDAIFLASDEVIHK----QPFRKALILLTDGEDNGSKESLSSAIEAAQRADTAVYSIYFK 247 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 18/151 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S S+++ + + ++LA ++ +RM F+ R + + Sbjct: 31 VLDKSGSVKNHWTEIYSFVESLAEKF--------ISPMLRMSFIVFSSRGTTIMKLTENR 82 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + R + +E G T +++ + A + I H + I+ LTD Sbjct: 83 EAIRRGLDTL--KEELPGGDTFMHEGFKRANEQIY-------HETYGGVRTASVIIALTD 133 Query: 377 GENTQDN-EEGIAICNKAKSQGIRIMTIAFS 406 GE N+A+S G + + Sbjct: 134 GELQDAQFYYAEQEANRARSFGAIVYCVGVK 164 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 44/111 (39%), Gaps = 7/111 (6%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + NE + AKK ++++TDG+ +D E + +A+ GI I Sbjct: 227 TATGILAVVNELFHSKNGARRSAKKILIVITDGQKYKDPWEYRDVIPQAERAGIIRYAIG 286 Query: 405 FSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + + + S + F+ ++ L I ++ E++ Sbjct: 287 VGDAFQEPIARQELNTIGSAPSQDHVFKVDNFAAL-----SSIQKQLQEKI 332 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 212 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 267 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 268 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 313 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 314 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 359 >gi|62001318|gb|AAX58361.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 43.4 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 46/164 (28%), Gaps = 30/164 (18%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + D+ + F+ G R +K E G T I Sbjct: 239 LVDRLDANDDFS------LVTFSSGADVR--IDDGPVGPRRAAIKATIDGIREGGGTNIG 290 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENTQ---DNEEGIAICNKAKS 395 + Y + + K ++LL+DG +E + A Sbjct: 291 QGLALGY----------AQAARPGIPEGAVKVVLLLSDGRANAGITSSERLSRLALDAFQ 340 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHEL 437 G++ T + +S AS + ++ ++ Sbjct: 341 GGVQTSTFGLGADYDG-----ALMSAIASDGAGGYYYLRDPDQI 379 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 43.4 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 21/175 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 S + D + + S + + A++ + IDN++D R+ Sbjct: 79 GSKMEEVKRKGADLMILLDVSNSMLSQDLSPNRLENAKRAISQL------IDNLHDD-RI 131 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F + + + + T + TAI A+ + N Sbjct: 132 GIIVFAGQAYVQLPITTDYS-AAKLFLNTINTNMVPTQGTAIGAAIDLGMQSFDFKNGMS 190 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K ++++TDGEN +D+ ++ N A+ + + + I + Sbjct: 191 -----------KAMIVITDGENHEDDA--VSAANHARDKDVTVNVIGVGSEEGAP 232 >gi|149440247|ref|XP_001521494.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3, partial [Ornithorhynchus anatinus] Length = 390 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 82/282 (29%), Gaps = 42/282 (14%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + S+ ++ + K+ VS + +D RS LNC Sbjct: 124 ISSLDAEATFMTNDLLGVLTKSFSGKKGHVSFKPTLDQQRS-----------CLNCTTSM 172 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + ++ Y V + P L FV S Sbjct: 173 MNGDFTITYDVVRESPANVQIVNGYFV-------HFFAPKDLPVVPKNVVFVIDVSGSMY 225 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D + F+ + + K F Sbjct: 226 GRKLVQTKEALLKILEDMKEED------YLNFILFSSEITTWKDTLIKATPENLKKAKEF 279 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---YIVLLTDGENTQDNE 384 + + G T IND + + + E N+ K+ I++LTDGE Sbjct: 280 VKNIKDEGLTNINDGLMRGIKMLNEARE-------TNVVPKRSTSLIIMLTDGEANVGEI 332 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCA 423 I ++ + + F + L A Sbjct: 333 RADKIQENVRNAIGGKFPLYNLGF-----GYDLNYNLLEKMA 369 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 54/186 (29%), Gaps = 27/186 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K VR A I + D V+ F+ R + Sbjct: 60 KIDRVRRATIRAIEMLDAQDVVS------VVIFDHRTEVLIP---ATPVAKPAELADRVN 110 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I A++ I K + ++LLTDG+ T+ + + Sbjct: 111 RVRDSGGTRIAPAIEAGLREI----------DKGPSHMVRRLILLTDGQ-TESESDCLRR 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 A + + I + ++ L A S + + ++ F++ I Sbjct: 160 AEDAGRRNVPITALGV-----GKDWNEDLLIEMANRSGGTADYIDRPEKIVDYFQNTIQR 214 Query: 448 EIFERV 453 V Sbjct: 215 AQATTV 220 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 125/393 (31%), Gaps = 53/393 (13%) Query: 47 SYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 ++ + + I++ P E+ S+ + T Q++ + + + Sbjct: 1 MRNKNIIMLLISSLILSGCGP---EPEDKESQQQQPSTPSDQQV--LVAQQVAIKEVEQS 55 Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + ++ + + L R L F R+ + I A Sbjct: 56 AAAAKAAADAKALAQQEVQQYSDKQTLQGR----LKEAPTFARAAKANATHIANPGTARY 111 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN---GKVG 223 + V+ ++ FS + + + LN P V+ N Sbjct: 112 QQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDWD 171 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLD--------------------PSLSEEHFVDSSS 263 I+D++ P ++ Y L P P + P+ + +D+S Sbjct: 172 IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSG 231 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ L++ +L +++ +++ DN+ G + R+ + + Sbjct: 232 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS-----ISGSHKAE 282 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + GST ++ AY +K + I+L TDG+ Sbjct: 283 INAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR---ILLATDGDFNVGI 333 Query: 384 EEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++ +I + K Q G+ + T+ + + Sbjct: 334 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 366 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 38/133 (28%), Gaps = 20/133 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDR-----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++ + R+ F+ + + S + + Sbjct: 108 VRKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSREIPAITY 164 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T A Q A + S E+ K I L+TDG + + IA Sbjct: 165 RGGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGGDPRPIAA--S 212 Query: 393 AKSQGIRIMTIAF 405 + G+ I T Sbjct: 213 LRDFGVEIFTFGI 225 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 38/133 (28%), Gaps = 20/133 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDR-----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++ + R+ F+ + + S + + Sbjct: 106 VRKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSREIPAITY 162 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T A Q A + S E+ K I L+TDG + + IA Sbjct: 163 RGGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGGDPRPIAA--S 210 Query: 393 AKSQGIRIMTIAF 405 + G+ I T Sbjct: 211 LRDFGVEIFTFGI 223 >gi|62001402|gb|AAX58403.1| AvrE [Pseudomonas viridiflava] gi|62001404|gb|AAX58404.1| AvrE [Pseudomonas viridiflava] gi|62001408|gb|AAX58406.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001390|gb|AAX58397.1| AvrE [Pseudomonas viridiflava] gi|62001392|gb|AAX58398.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 72/254 (28%), Gaps = 35/254 (13%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 S S + + + C+ S L Sbjct: 62 ASSTSTSITDLTPDVDYSVSINSYDGAEESIPILGQITKCSVSAVADL------------ 109 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ S + +R + ++ + D D R+ ++ ++ S Sbjct: 110 VFLVDGSWSVGRENFRFIRSFIGAMAGAF---DIEEDKTRVAVVQYSSDTRTEFSL--NT 164 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H +++ + G+T DAM ++ + + + K +++TD Sbjct: 165 HFRRPDVLRAIKNLPYKGGNTMTGDAMDY----LVKNTFTQAAGARKG--FPKVAMIITD 218 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G + + + ++ G+ I + + L AS + + Sbjct: 219 G---KSQDPVEEYAERLRNIGVEIFVLGIKGADEDE------LKEIASRPHSKHVYNVPN 269 Query: 434 THELNKIFRDRIGN 447 +N++ + I Sbjct: 270 FDMINQVQKALITE 283 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 57/174 (32%), Gaps = 25/174 (14%) Query: 276 DALASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPSFSW--GVHKLIRTIVKTFAIDEN 332 + + I + + D D V++G ++ DP W H+ ++ A Sbjct: 1186 KTIRNFIARMVGVFDIGPDRVQIGLAQYS----GDPKTEWHLNAHRTRTQLLDAVANLPY 1241 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + G+T A+ N + ++K VL+TDG + ++ +A Sbjct: 1242 KGGNTLTGLALNYILQNNFKPN------VGMRPNSRKIGVLVTDG---KSQDDIVANSQN 1292 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 + QGI + I + L AS + L I D Sbjct: 1293 LRDQGIELYAIGVKNADENE------LRTIASDPDDIHMYNVADFSFLLDIVDD 1340 >gi|62001374|gb|AAX58389.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|326426493|gb|EGD72063.1| hypothetical protein PTSG_00082 [Salpingoeca sp. ATCC 50818] Length = 571 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 78/244 (31%), Gaps = 33/244 (13%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 +S L+ + + E+ +DFS S++ + + Sbjct: 70 ESELLTSTCASTFGGCRSKTENR-----RLDFSSSVMRTASTPFDDDVRQEACWTRQLDN 124 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-------PLDPSLSEEHFVDSSSLRH 266 ++ S NG + + + S +Y +YPG DP L + +S + Sbjct: 125 TFISINGGTSENTDTATKWQYVGTSSGFYRIYPGVPQQDCNAYDPRLRPWYVAATSGPKD 184 Query: 267 VI------------KKKHLVRDALASVIRSIKKIDNVNDTVR-MGATFFNDRVISDPSFS 313 ++ + DA +V+ ++ D V V A+ I S Sbjct: 185 IVIVLDRSGSMATNNRWETAMDAAETVLETLTIADFVAIVVFDTSASQVCGTTIPCGSLV 244 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + T+ A N GST A Q A+ + + E + I+ Sbjct: 245 QATADNVGTLRTLLAN-FNPDGSTNFESAFQVAFSVLKQTGERTSNCHTA-------ILF 296 Query: 374 LTDG 377 +TDG Sbjct: 297 MTDG 300 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 60/526 (11%), Positives = 160/526 (30%), Gaps = 105/526 (19%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I+ +G + ++ LL +L + + ++ + L + + A+++ S Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSL-TAENNS 68 Query: 72 LEEVSSRAKNSFTFPKQK--------------IEEYLIRNFENNLKKNFTDREVRDIVRD 117 + + + + ++ +KN + + + Sbjct: 69 GRKTTDYKLGGSNPSDESFNISSEVGKRDHAIVTAFVKTFLPQTDEKNMHLTPLCKTINN 128 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EH 175 T+ + + S +S + S F L+ ++ Q +++ + + Sbjct: 129 TSGKGHTSSSEVTCTVSG----TVEHKSWFPLKVGNLEVIPKQVNVASQSRAIKKNTFNI 184 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFG-----------QPADRTVKSYSSQNGKVGI 224 + + V D S SM ++E + + ++QN ++ + Sbjct: 185 PIDLMVVADLSGSMNFDLDNNEIKKTGKPSKISILKEVLVELADKTLLSEDANQNNRIYV 244 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI-- 282 L + + N +L Y + + ++ ++ + + + L+ + + + Sbjct: 245 TPFALGAEINNNNCALPYSWSVESSSRTQNIKNILNKQNSQ--YNRADLINNLVYKISTK 302 Query: 283 RSIKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 ++ I+ + V F + + + W + + + G+T + Sbjct: 303 ETLDNINGKQNYNVTFSKNAFCLKDMKTSNKGW-YSRSDKLEFTNYVQSIKANGATLASS 361 Query: 342 AMQTAYDT---------------------IISSNEDEVHRMKNNLEA------------- 367 + A + + N++ V NN Sbjct: 362 GVLVAANNMIRDGSRTEQLKEQTKRVILVLSDGNDEIVKGDPNNKVPFLNYTRITENLIY 421 Query: 368 ---------KKYIVL---------LTDGENTQDNEEGIAICNKA--------KSQGIRIM 401 KK + LTD TQ + +CN K + +I+ Sbjct: 422 GRQEEFSSEKKRVSFGHSTTIETYLTD---TQPKKVTDGMCNVIRDKLDTLNKDKNTKIV 478 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + F + ++ +C +++ AN L F+ IG Sbjct: 479 FVEF----GYKSTSKQAWEHCVGNGNYYSANDKASLLNSFKQAIGE 520 >gi|62001452|gb|AAX58428.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|62001446|gb|AAX58425.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 97/363 (26%), Gaps = 13/363 (3%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQTAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQV---VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + +++ +Q V + +L L + T+ Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELTRLADA 1473 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S +K IQ +D + + + + + T K ++ + + Sbjct: 1474 KQSEYANKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVLD 1533 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS- 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1534 NARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKASP 1593 Query: 285 -------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ D + + G L + F + G T Sbjct: 1594 GTMARVRLEPKDEMMQKIDKGTREGRITQSDIIGMLNDRDNLRIKAITVFKLAGQSDGFT 1653 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1654 TPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKKD 1713 Query: 397 GIR 399 G+ Sbjct: 1714 GME 1716 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 43.4 bits (100), Expect = 0.086, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 66/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 400 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 455 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 456 T--YDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--------I 505 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A A GI I ++ + + + S Sbjct: 506 RESPNKNFLVIVTDGQSYDDVQGPAA---AAHDAGITIFSVGVAWAPL--DDLKDMASKP 560 Query: 423 ASPNSFF 429 ++FF Sbjct: 561 KESHAFF 567 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 43.4 bits (100), Expect = 0.086, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 67/191 (35%), Gaps = 38/191 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI--VKTF 327 K R+A +R + D + ++D V + + + I ++ Sbjct: 82 KIAKAREAAIEAVRRLSDGDLFS------LVVYDDSVETLVP-----AQPVSDIGDIEAR 130 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 GSTA+ A+ + E H ++ +VLL+DG N Sbjct: 131 IRRIRPGGSTALFGAVSQ------GAAEVRKH---SDAPYVNRVVLLSDG---LANVGPS 178 Query: 388 AICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + A+ +GI + T+ + + ++ A S + + S+ +L + Sbjct: 179 RPADLARLGAALLKEGISVTTVGVGTDFNE-----DLMTQLAERSDGNHYFVESSRDLPR 233 Query: 440 IFRDRIGNEIF 450 IF +G+ + Sbjct: 234 IFAAELGDVLS 244 >gi|288818655|ref|YP_003433003.1| thiol:disulfide interchange protein [Hydrogenobacter thermophilus TK-6] gi|288788055|dbj|BAI69802.1| thiol:disulfide interchange protein [Hydrogenobacter thermophilus TK-6] gi|308752244|gb|ADO45727.1| Disulfide bond isomerase, DsbC/G [Hydrogenobacter thermophilus TK-6] Length = 247 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 53/154 (34%), Gaps = 6/154 (3%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 V++++ +I +++V+ + + + +V I Sbjct: 32 DAVKNSVKDLIPQDFTVESVSGVKEINGLCEVVIKVGAQPLVFYTDSTGNYVVAGNIISL 91 Query: 332 NEMGSTAIN---DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + + M+ + D + E E H + KK+I +TD + + Sbjct: 92 KDKKNLTRERQQEFMKVSTDQL---KELEKHVNFTYGQGKKFIYYITDPDCPFCKKSEPI 148 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I + AK G++I I F + Q + C Sbjct: 149 IEDWAKKAGVQIKVILFPLPIHPQAFGKSVALVC 182 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 13/78 (16%), Positives = 29/78 (37%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 7 RRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGARELDGR 66 Query: 72 LEEVSSRAKNSFTFPKQK 89 + ++ Sbjct: 67 DDAITRAQTAIEKISNSA 84 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 66/186 (35%), Gaps = 23/186 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+DA A ++ + + D FNDR + K +K Sbjct: 65 VKDAAARIVDQLGQDD------YFSLVVFNDRADVVIPAQRAIKKAD---LKAAIAQIEA 115 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + M A + R + I+LLTDG D + I + Sbjct: 116 AGGTEMAQGMALALQEV--------QRPFLTRGISR-IILLTDGRTYGDESRCVEIARRG 166 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG--NEIFE 451 +S+GI + + E ++ + + + A ++ K+F D + + IF Sbjct: 167 QSRGIGLTALGIG--TEWNEDLLETMTASENSRAQYIAT-AQDVVKVFADEVKRLHAIFA 223 Query: 452 RVIRIT 457 + ++++ Sbjct: 224 QQVQLS 229 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 65/213 (30%), Gaps = 26/213 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LD S S + +I + VR ++ + + R+G F D Sbjct: 99 ALDLSQSMDTRDFGDPSGALIPRVQAVRQVVSGFVAR----RPGD---RIGLIVFGDAPY 151 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + I T + M + + + +++ Sbjct: 152 PLAPFTLDHQLVQTLI-------------TGLLPGMAGPSTALGDAIGLGIKMFEHSEAP 198 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCASP 425 +K +++LTDG +T AK + + + TI E+ L A+ Sbjct: 199 EKVLIVLTDGNDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEKVDLGVLQRLAAQ 258 Query: 426 N---SFFEANSTH-ELNKIFRDRIGNEIFERVI 454 FF A+ E DRI + + Sbjct: 259 TGGRYFFGADQAGLETIYATLDRITPHNQKTLS 291 >gi|312072174|ref|XP_003138945.1| hypothetical protein LOAG_03360 [Loa loa] gi|307765891|gb|EFO25125.1| hypothetical protein LOAG_03360 [Loa loa] Length = 1596 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 65/177 (36%), Gaps = 17/177 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ ++ +I I + D VR+G ++D+ + +K+ ++ + E Sbjct: 931 LKEGISMLIDEIFDLSP--DIVRVGFVEYSDKASVPVPLGYYDNKVQ--LLADISNSEQL 986 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+ I ++ A + D + ++L+T G N + + Sbjct: 987 GGTPIILRGLRAAKEQFQRHGRDN---------VSRILLLVTSGANRGN---VAFAADDL 1034 Query: 394 KSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + I + + ++ Q S+ + +S ++L + +IG + Sbjct: 1035 REHLNVSIFVLVVNASQGAQIMLNRLTSDEYTQQRVISISSANKLQEAELLQIGQAL 1091 >gi|126309704|ref|XP_001376377.1| PREDICTED: similar to complement C2 [Monodelphis domestica] Length = 822 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 46/135 (34%), Gaps = 10/135 (7%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + + D + T IN AM Y+ + + R K+ + + I+LLTDG Sbjct: 383 EIAKKLEDLKYGDPDIGTGTNINKAMMQIYEMMNNEMAIFGGRQKDWEKIRHVIILLTDG 442 Query: 378 ENTQDNEEGIAICNKAKS---------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 ++ A+ K K + I I ++ S Sbjct: 443 KSNMGGSPTEAV-KKIKEVLNIRQERTDYLDIYAIGVGKLDVDWKELNELGSKKDGERHA 501 Query: 429 FEANSTHELNKIFRD 443 F + L ++F + Sbjct: 502 FILPDSKALLQVFEN 516 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 258 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQLALQAV 310 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + ED + R I+LL+DG +T Sbjct: 311 NSLVANGGTNIVDGLRKGAKVM----EDRLERNSVAS-----IILLSDGRDTYTT 356 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 232 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQLALQAV 284 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + ED + R I+LL+DG +T Sbjct: 285 NSLVANGGTNIVDGLRKGAKVM----EDRLERNSVAS-----IILLSDGRDTYTT 330 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQLALQAV 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + ED + R I+LL+DG +T Sbjct: 310 NSLVANGGTNIVDGLRKGAKVM----EDRLERNSVAS-----IILLSDGRDTYTT 355 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 258 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQLALQAV 310 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + ED + R I+LL+DG +T Sbjct: 311 NSLVANGGTNIVDGLRKGAKVM----EDRLERNSVAS-----IILLSDGRDTYTT 356 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTR-MSDAGRQLALQAV 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T I D ++ + ED + R I+LL+DG +T Sbjct: 310 NSLVANGGTNIVDGLRKGAKVM----EDRLERNSVAS-----IILLSDGRDTYTT 355 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 43.0 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 43/132 (32%), Gaps = 17/132 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I + +I + + G F + + T I Sbjct: 119 LIEDLFRIFDNDKV---GLIVFAGDAFVQLPITSDFISAKMFLDNINP-SLIGTQGTDIG 174 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ A +H ++ K IV++TDGE+ + E + +KA+ GI++ Sbjct: 175 QAINLA-----------MHSFSPTSKSGKAIVVITDGEDNEGRAE--EMASKAQKAGIQV 221 Query: 401 MTIAFSVNKTQQ 412 + + Sbjct: 222 YILGVGSTSGAE 233 >gi|62001438|gb|AAX58421.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 43.0 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 100/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + I L A Q + Sbjct: 1355 SNNRVRFANTAGATAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|222055940|ref|YP_002538302.1| hypothetical protein Geob_2856 [Geobacter sp. FRC-32] gi|221565229|gb|ACM21201.1| conserved hypothetical protein [Geobacter sp. FRC-32] Length = 391 Score = 43.0 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 81/272 (29%), Gaps = 19/272 (6%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++ V L + G+ +++ E L AA+ + + + + Q ++ + Sbjct: 16 IMLVVFLVIAGLAINIGYMYVSEDDLHNAAELSALAGAQAIGQQMQLSARTGTGKLKETI 75 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL-NPLSL 146 + R + +R++ + ++ + + N S Sbjct: 76 YDQVQPAARAAAIDHVSGHHQASALIEIRNSNIN--------RLTTENDLTMGFWNISSR 127 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP------ 200 G +Q + S S SI I S+ + P Sbjct: 128 TYTPGGTPVNAVQVRTRRTAESESAGLGTLGSILSKISGSQKLNYTPEAVAAIPALADAN 187 Query: 201 -LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHF 258 C + + + P M S KS Y Y P ++S + Sbjct: 188 FSVCVDACGTECTYPNICTIQERKLSPDSSDPRMDSPTKSRYAYTSLSYPPGDTMSLSNL 247 Query: 259 VDSSSLRHVI--KKKHLVRDALASVIRSIKKI 288 + + K+ + +RD + +R ++ + Sbjct: 248 ICMGMPPKEVCGKEIYTIRDRDDNALRDMESV 279 >gi|17538702|ref|NP_499959.1| hypothetical protein C18H7.1 [Caenorhabditis elegans] gi|14573846|gb|AAF98615.2| Hypothetical protein C18H7.1 [Caenorhabditis elegans] Length = 425 Score = 43.0 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 59/174 (33%), Gaps = 16/174 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDEN 332 +D ++ +K + ++ A F + + + K +++ ++ Sbjct: 258 KDLSKRLVSQLKIGPH---YTQVAAVTFAT--VGRTRVRFNLKKYQTQEEVLRGIDNLKS 312 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G+TAI ++ A + DE + + K +V TDG + + + Sbjct: 313 RGGTTAIGAGIEKALTQL-----DESEGARPGIATKVMVVF-TDGWSNKGP-DPEKRARD 365 Query: 393 AKSQGIRIMTIAFSV-NKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDR 444 A S G + T+A++ LS + F + L + R Sbjct: 366 AVSSGFEMYTVAYTAHTPGAVTLNNETLSAISGDVHHTFTDVTFQALIDKIKQR 419 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 43.0 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 33/107 (30%), Gaps = 6/107 (5%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST--AINDAMQTAYDTIISSNED 356 F+ V ++ ++ R+I K GS YD + + Sbjct: 113 LISFDINVDLLSDYTNSPREIKRSIDKATINTGAGTGSVTGNSTPKGTLLYDAVYLA--- 169 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 H +K +V+LTDG + E A+ + I Sbjct: 170 -AHDKLRQEAGRKILVMLTDGGDQGSQETLKTATEAAQKANAIVYVI 215 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 25/116 (21%), Positives = 46/116 (39%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E +VLLTDGE+ + Sbjct: 139 PTYALGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VVLLTDGEDNTASS----- 184 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F + ++A +SN + F+ ++ L F Sbjct: 185 CIDEVKQSGAIVH---FIALGSAADEAVIEMSNITGGSHFYASDEAQNNGLIDAFG 237 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 54/146 (36%), Gaps = 18/146 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAI 329 VR + ++ + I+ R+G ++D + S G+ K I+ ++ + Sbjct: 20 EKVRQWIGKLVDTFD-IEEDGGGTRVGVVIYSDAPRMEISLGNGLGKTDLIKAVLVIYLN 78 Query: 330 DEN---------EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + E G+T ++++ A + S + + + +++LTDG Sbjct: 79 LIDFLFAQSLMYERGNTLTGESIRYASEVAFSETS-GARALSEGIN--RIMIVLTDGRAQ 135 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFS 406 + I A+ GI I + Sbjct: 136 DNVAGPAVI---AQEDGIVIYAVGVG 158 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 54/146 (36%), Gaps = 18/146 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAI 329 VR + ++ + I+ R+G ++D + S G+ K I+ ++ + Sbjct: 20 EKVRQWIGKLVDTFD-IEEDGGGTRVGVVIYSDAPRMEISLGNGLGKTDLIKAVLVIYLN 78 Query: 330 DEN---------EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + E G+T ++++ A + S + + + +++LTDG Sbjct: 79 LIDFLFAQSLMYERGNTLTGESIRYASEVAFSETS-GARALSEGIN--RIMIVLTDGRAQ 135 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFS 406 + I A+ GI I + Sbjct: 136 DNVAGPAVI---AQEDGIVIYAVGVG 158 >gi|297202051|ref|ZP_06919448.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197714313|gb|EDY58347.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 518 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 60/374 (16%), Positives = 111/374 (29%), Gaps = 65/374 (17%) Query: 31 PVMLGVGGMLVDVVRWS---YYEHALKQAAQTAII-----------TASVPLIQSLEEVS 76 PV +GV V + WS ++QA Q + + L+ +S Sbjct: 123 PVAIGVRSATVKALGWSPADVTWSQVEQAVQDGRLTYGMTDPARSNSGFSTLVSVASALS 182 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 K L F+ K D +++ ++ Sbjct: 183 GAQSALTDADVTKAAPRLKEFFK-GQKLTSGSSGWLAAAYDRRGDVDA------LLNYES 235 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 L + L++ G+ + + T +R+ VS + + + + Sbjct: 236 VLLGIPGLTVIRPRDGVITADYPLTSLRSTSARTREDVRRVSEDLRTERIQ--REITART 293 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 +P+ PA + R E P S L PS + Sbjct: 294 HRRPVVASVPPASGLDTT---------RRRELPFPGTRSVADGLLDSYENELRRPSRTVY 344 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--------DTVRMGATFFNDRVIS 308 S S+ + ++ ALA + ++ + V +VR V+ Sbjct: 345 VLDTSGSMEGD--RLDRLKTALADLTGDFREREEVTLMPFGSQVKSVRT-------HVVK 395 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G+ ++ + G TAI +++ AYD + + + Sbjct: 396 PSDPRAGL-----DAIRDDTSALSADGDTAIYTSLEKAYDHLGAGRDAFTS--------- 441 Query: 369 KYIVLLTDGENTQD 382 IVL+TDGENT Sbjct: 442 --IVLMTDGENTAG 453 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 55/167 (32%), Gaps = 30/167 (17%) Query: 292 NDTVRMGATFF------NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 T R+ + + RV S +K T G T A++ Sbjct: 77 ASTTRVAVISYSSCNQIHIRVNYISSPE---NKNKCTFDNDLTSVNYHPGGTCTAGALEA 133 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A ++S ++ ++LLTDG + A K KS+G++I TI Sbjct: 134 AGRDVLSHGRPGA---------QRVVMLLTDGASNDGG-PPHANAQKLKSEGVKIFTIGI 183 Query: 406 SVNKTQQEKARYFLSNCASPNSFF-----EANSTHELNKIFRDRIGN 447 K + L+ A+ + + L + +D I + Sbjct: 184 GSIKLSE------LNAIATSVDEYVYILADFGDVRNLATVVKDDIKD 224 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 39/111 (35%), Gaps = 13/111 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ A F FS+ + ++ G TA +A+ + Sbjct: 382 KIAAVQFT--YDQRTEFSFTDYITKENVLAALRGIRYMSGGTATGEAISHTTRNVFGPVR 439 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D ++ ++V+LTDG++ D A+ GI I +I + Sbjct: 440 DGGNKN--------FLVILTDGQSYDDVRGP---AVAAQQAGITIYSIGIA 479 >gi|256397309|ref|YP_003118873.1| hypothetical protein Caci_8209 [Catenulispora acidiphila DSM 44928] gi|256363535|gb|ACU77032.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 177 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 17/156 (10%), Positives = 45/156 (28%), Gaps = 4/156 (2%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ ++ G + + PV++ + +VDV A A + + Q Sbjct: 17 RRALRDDRGSLAMAVVIWAPVVVLLMAFVVDVGLLISDRTQASDYADQAARRVAQDIDQG 76 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + P K+ + + T+ ++ + + Sbjct: 77 WLKTHNVRGPNGEDPGIKVNVDPDTGDCVPDAEQYLLDNQITDTTITSCQVTGNPTEVDL 136 Query: 132 VLSSRY----DLLLNPLSLFLRSMGIKSWLIQTKAE 163 ++ R + PL + G + A Sbjct: 137 YVNPRITVTLQMQYKPLFVGFALKGDSTVTGTGSAT 172 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 32/137 (23%), Positives = 55/137 (40%), Gaps = 16/137 (11%) Query: 249 LDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR- 305 LD S S V ++S + + + +++ ++ VIR + D R+ FND Sbjct: 76 LDVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRKLADGD------RLSIVAFNDGP 129 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V S V R+I G TA+ A++ A + +R+ Sbjct: 130 VKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAVKILDERQGSSRNRVG--- 186 Query: 366 EAKKYIVLLTDGENTQD 382 +I+LLTDG++T Sbjct: 187 ----FILLLTDGDDTTG 199 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 71/200 (35%), Gaps = 23/200 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S K L++ A ++ + + D V+ V GA ++ D Sbjct: 179 PPSNLVFLIDVSGSMDEPNKLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDS 234 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K++ ++ GSTA + ++ AY D + + Sbjct: 235 TPGNEKDKILDALMN-----LEAGGSTAGAEGIKLAY--------DVAKKNFIKSGNNR- 280 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ E + + K + +GI + + F + K + Sbjct: 281 VILATDGDFNVGISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGN 338 Query: 428 FFEANSTHELNKIFRDRIGN 447 + ++ E K+ + +G Sbjct: 339 YAYIDNIAEARKVLVNEMGA 358 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 63/214 (29%), Gaps = 5/214 (2%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++ ++LGV +D+ + L+ A TA + +V + E+V Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAV-VADKTEDVDQAEAAVIATLS 59 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 E A P Y + + R + +S + Sbjct: 60 SIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYV--RVAVTDMGISQY 117 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY--QRDSEGQPLNCFG 205 L ++ + A A + + + I D + ++ D R P Sbjct: 118 LSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRPPGYDPNVDMD 177 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 +K N +G + +L + + S Sbjct: 178 PSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNS 211 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 39/115 (33%), Gaps = 14/115 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + T I A+ A Sbjct: 131 KLGLIVFAGDAFVQLPITSDYVSAKMFLDNINP-SLIGTQGTDIGKAINLA--------- 180 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +H N + K IV++TDGE+ + E A+ +A+ +GI++ + + Sbjct: 181 --MHSFTPNTQTGKAIVVITDGEDNEGGAE--AMAKQAQEKGIKVFILGIGSTQG 231 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 43.0 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FVKGGVNR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 43.0 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 56/184 (30%), Gaps = 25/184 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + S+ NV F + RT +K Sbjct: 110 RLDAAKSGAKDFVDSLPSAFNVA------LVTFAGTANVKMPPT-----TDRTQLKAAID 158 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 STAI + + T+ D + + + IVLL+DG + Sbjct: 159 AIRLAPSTAIGEGIYTSLDVLEKLAPQDPDHPDDPAPGA--IVLLSDGATNMG-RDSADA 215 Query: 390 CNKAKSQGIRIMTIAFSVNKT---------QQEKARYFLSNCA--SPNSFFEANSTHELN 438 +AK + + I TIA+ + LS A S + A+S L Sbjct: 216 ATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAVNHAELSQVAKLSGGKKYSADSMKNLQ 275 Query: 439 KIFR 442 +++ Sbjct: 276 AVYQ 279 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 43.0 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 84/297 (28%), Gaps = 42/297 (14%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 G+S D D G D S+ + S Sbjct: 11 GLSACAKDDDDGGNPFSANDEASSITTDGGATEDEAGDDGSTTTPMYFDLPGEESQPTAE 70 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL-ASVIRSIKKI-DNVND 293 + + + + VD S AL ++ + + + D Sbjct: 71 GATTDCANVEVDTSPTTPTVVLLVDQSGSMWDDFGGQPRWVALENTLFDPVNGVVKPLED 130 Query: 294 TVRMGATFFNDR---------VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 VR G ++ +I++ + S+G H ++ TFA + T D+++ Sbjct: 131 QVRFGLALYSSMNGSFGGECPLITEFAPSFGNH---ASLAATFASAM-PLDDTPTGDSIK 186 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--------QDNEEGIAICNKAKSQ 396 +T+ + ED K IVL TDGE + + A Sbjct: 187 AVAETLAAFPED----------GPKIIVLATDGEPDTCAVPDPQEGQPLSLEATQAAFDD 236 Query: 397 GIRIMTIAFSVNKTQQEKARYFL--------SNCASPNSFFEANSTHELNKIFRDRI 445 GIR I+ N+ + F++ + EL F I Sbjct: 237 GIRTFVISVG-NQVTDAHLQELANAGVGLPTQGAVENAPFYKTLNPAELVSAFEAVI 292 >gi|47847422|dbj|BAD21383.1| mFLJ00114 protein [Mus musculus] Length = 1188 Score = 43.0 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 91/297 (30%), Gaps = 44/297 (14%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q EA +S + W++ ++ R++ CF + Sbjct: 100 LQVPPEAVNMSLGLSLAAATNPSWLLACGPTVHHTCRENIYLTGLCFLLSSSFKQSQNFP 159 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + ++ +++ + S+ + LD + + S R + + Sbjct: 160 TAQQECPKQDQDIVFLIDGSGSISSTDFEKMLDFVKAVMSQLQRPSTRFSLMQF------ 213 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +D R+ FN+ + + S G T Sbjct: 214 --------------SDYFRVHF-TFNNFISTSSPLS------------LLDSVRQLRGYT 246 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ + ++ +A K ++++TDG DN ++ A++ Sbjct: 247 YTASAIKHVITELFTTQSGAR------QDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS 300 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIFE 451 I I E ++ L AS S F + L I +++ +IF Sbjct: 301 IIRYAIGVG-KAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDI-ENQLKEKIFA 355 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 43.0 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERINSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+CN + + Sbjct: 709 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCN----HDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHERHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|159184836|ref|NP_354603.2| hypothetical protein Atu1609 [Agrobacterium tumefaciens str. C58] gi|159140120|gb|AAK87388.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 579 Score = 43.0 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 98/301 (32%), Gaps = 47/301 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +K TG+ I L P+ +G+ + VD + + L+Q + Sbjct: 10 RFLKDKTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQ---------------TA 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRN-FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + A + T ++ +++Y N + +K + + + + Y Sbjct: 55 DLAAISAAANATDAEKAVQQYFALNGMDLGVKTDQGLLTAKGLQPFDPLNEFASSKGYAE 114 Query: 132 VLSSRYD---------------LLLNPLSLFLRSMG---IKSWLIQTKAEAETVSRSYHK 173 V+ Y+ L N + + + G S + + + S K Sbjct: 115 VIKGHYEPDATVPVGKRFVDNALPTNAIKVNIVEQGQIFFASAFTKPPKVSAVGTASSQK 174 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 S+ S L + L ++K Q + Sbjct: 175 IAAFSV-------GSRLASLDEGILNSLLGGLLGTTVSLKLMDYQALLAADVNALKIVEA 227 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 ++ + +L Y L +S F+D V+ K ++ A+ +++ +++K N ++ Sbjct: 228 LAIDLNLTAGTYKDVLQTEISYGKFLD------VLTKTTGLQPAVVNILNTLQKTANKSN 281 Query: 294 T 294 Sbjct: 282 V 282 >gi|38347892|ref|NP_941141.1| putative tellurium resistance protein [Serratia marcescens] gi|226807627|ref|YP_002791321.1| TerY1 [Enterobacter cloacae] gi|226809937|ref|YP_002791631.1| TerY1 [Enterobacter cloacae] gi|1354148|gb|AAC44737.1| terY [Plasmid R478] gi|38259369|emb|CAE51594.1| putative tellurium resistance protein [Serratia marcescens] gi|226425852|gb|ACO53945.1| TerY1 [Enterobacter cloacae] gi|226426163|gb|ACO54255.1| TerY1 [Enterobacter cloacae] Length = 197 Score = 43.0 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLT--------DLLSFQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 58 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 115 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A++ G+ + + L + + + F+ Sbjct: 116 KAART-GVVV------ACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFK 160 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 43.0 bits (99), Expect = 0.100, Method: Composition-based stats. Identities = 50/338 (14%), Positives = 96/338 (28%), Gaps = 45/338 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I++ L VML V G+ +D +R L+ +A++ Sbjct: 9 RFVTRDDGAITILSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVL---------- 58 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ + I E + + N D E + A Sbjct: 59 -AGAGAPADATAADVKLIVEDYFDAADLSQYLNTIDPETDIVASLNAKS----------- 106 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S+ +L ++ +R G+ + A A + + I +D S SM Sbjct: 107 VSASVELEMDTF--LMRLSGVDTLTTAGGATAAIAAPR------MEIVLALDVSGSMAGE 158 Query: 193 QRDSEGQPLNCF-GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + F + + ++ + S M Y +D Sbjct: 159 RLTKMKSAAKQFVTDVMSASDQGTTTISIVPYSWSVTPSDEMFEALSVDVRHNYSTCIDF 218 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN---VNDTVRMGATFFNDRVIS 308 S+ R + + +L +I D R + D Sbjct: 219 LESDFEKTAIDPARSYGQTIY---TSLTGSFGNIGIGDPTVTNTAYDRT---CYTDEYFR 272 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +S V L I GST+ + M+ A Sbjct: 273 ILPYSNSVTALHNKIDS-----LKAAGSTSTHLGMKWA 305 Score = 40.0 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIF 441 ++ IC KS+GI + TIAF + + A L NCA SP+ F A + + +++ F Sbjct: 527 DDRMADICGATKSKGIVVYTIAF--EMGEFDSAADRLENCASSPSQHFNATTLN-ISQAF 583 Query: 442 RDRIGNEIFERVIRITK 458 + +R+T+ Sbjct: 584 GSI---AANVQKLRLTQ 597 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 43.0 bits (99), Expect = 0.100, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 47/150 (31%), Gaps = 18/150 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P+ + +D S + +K L++ + ++ + + D V V GA Sbjct: 247 NAERPASNLVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGENDKVAIVVYAGAAGM---- 302 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + ++ + GST ++ AY + +K + Sbjct: 303 -----VLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATEN------FIKGGVN 351 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 ++L TDG+ + A + Sbjct: 352 R---VILCTDGDFNVGVTSTSDLVTMAADK 378 >gi|301609302|ref|XP_002934202.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 904 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 77/253 (30%), Gaps = 38/253 (15%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 +R S S+ + + D K +P N + D +D S Sbjct: 258 NRICNSRSTWDVIMNSTDIKSTPPQADSNIPVPSFSLLQSSD--RVVTLVLDVSGSMASG 315 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + I V + +G F+ + + + R +K+ Sbjct: 316 GRIERLYQAAEVFLMQI-----VEEGSYVGILTFSTSISLLSNLVQVIENTQRKQLKSL- 369 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + N G T I ++ +++R + +VLLTDGE+ D + Sbjct: 370 LPTNAFGGTDICLGIRE---------GIKINRQYDGSSYGTELVLLTDGEDNYDTSK--- 417 Query: 389 IC-NKAKSQGIRIMTIAFSVNKT-------QQEKARYFLSNCASPNSFFEANSTHELNKI 440 C + G I IA N + FL+ + T L + Sbjct: 418 -CFPDITNSGAIIHVIALGPNAAKALETIVDMTEGLRFLAT--------DKVDTQGLIEA 468 Query: 441 FRD-RIGNEIFER 452 F G+ + Sbjct: 469 FISLTAGDGATTQ 481 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 59/192 (30%), Gaps = 37/192 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVN--DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + ++A +R + + NV R A + V Sbjct: 107 RLIAAKEAAQQFVRDLPERFNVGVVAFARTAAVVISPTTDHA-------------AVTNA 153 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T+I +A+ A D+I S + + IVLL+DG+NT Sbjct: 154 IAGLTTRPGTSIGEAVFNALDSIRSFDREAA-----TDPPPAAIVLLSDGDNTSG-RPVS 207 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCASPNSFFEANST 434 A + + + TIA+ + + + + A S Sbjct: 208 EAIEAAANAKVPVSTIAYGTPDGYVMIDNRPVQVPVNKAALQELSEG--TGGRAYTAESA 265 Query: 435 HELNKIFRDRIG 446 EL ++++ IG Sbjct: 266 SELREVYQQ-IG 276 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 66/208 (31%), Gaps = 37/208 (17%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + + + A+ + + + D R+G FN Sbjct: 102 WDALDGTPTEDPDRMRITHAKRAIREFLPLLSERD------RVGLAVFNRTYRMIQPIVD 155 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + + + AY + S E+ + + ++ +V+L Sbjct: 156 DPALVLEKLDAIE------------RPSREQAYTELYRSMEEALTSFEEEGR-RRVLVVL 202 Query: 375 TDGEN-----TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NS 427 +DGEN + + A GI I F + + + AS Sbjct: 203 SDGENFPVDPEKSPATPGTAVDLAHRYGITCYVIHFG------TEKDRLIGDLASETGGR 256 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIR 455 F+A + EL ++ I E+V++ Sbjct: 257 VFDARNALELASVYT-----AIQEQVLQ 279 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 83/296 (28%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S S + P + D S+ + V++ L ++ Sbjct: 148 V----CSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW---DAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T + T A Sbjct: 201 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 14/129 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + ++ N T R+ F++ + F K + G+T Sbjct: 73 VHAFLKGFDVAPN---TTRVAVISFSEDAVVHADFLKDPGNKCHLSRKMQGVHSANQGAT 129 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +Q A++ S KK ++L+TDG T + + K K+ G Sbjct: 130 NTGAGLQAAWEVFQRSRPTA----------KKLLILVTDGMATMGP-DPVKKAEKLKNMG 178 Query: 398 IRIMTIAFS 406 + I Sbjct: 179 VDIFVFGIG 187 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 41/278 (14%), Positives = 85/278 (30%), Gaps = 24/278 (8%) Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 + + S+ + + D S S Y ++ R V + + Sbjct: 90 TSCSKDISFDNNAAIGMSIGADSSFS-NLYICGNQHTTSCPTTPLQKRMVGACYKKPMTS 148 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +P + C K + LD S S + + + ++ Sbjct: 149 STATMFKTPCVPGCPKIILIADIMFVLDDSSSVDD-------TAFRSALNWIIQVVSYFS 201 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I D +R+G F++ G+ K +K + + ST Sbjct: 202 SYIDSGD-----LRVGVYGFSND-DHRSGIRIGLRKWTSATLKKQIGELLNVKSTGAGTY 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A + E K I+LLTDG D + A+ +GI +++ Sbjct: 256 ISHAIKETVKVFEAN-----GRKGISKEIILLTDG-GASDWWLLKGEADTARDKGIVLVS 309 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + L+ + + F+A + L+++ Sbjct: 310 VGVGTSVNN----DQLLAIAGNKSRVFQATDYNTLDEV 343 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 59/185 (31%), Gaps = 33/185 (17%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ G I + ++M ++L GGM VD++R+ L+ + A + A+ Sbjct: 11 FQRFRNDEDGALIIFSLMMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 L+++ A + + + ++ + Y+ Sbjct: 66 DLDQIMPPADVVRDYLDKAGMLHFLQGE----------------------PTVSQGINYR 103 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS---IQWVIDFSR 187 VV F I + A VS + E V+ + V+D S Sbjct: 104 VV---SAQASAPMALFFYDLPRIFTSPFSPGMTAINVSGASTAEERVTDVEVSLVLDVSS 160 Query: 188 SMLDY 192 SM Sbjct: 161 SMNSL 165 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 6/104 (5%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ I KNN K I++L+DGE +D + NK + + Sbjct: 419 TKTASAINHVLTDIFIP----EKGSKNNTA--KIIIVLSDGEILEDPMTLDEVLNKPQMK 472 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 G+ +I ++ A + A P ++ +S LN I Sbjct: 473 GVTRYSIGVGDGILKKPNAVKEMMQIADPGKYYSVSSYGALNDI 516 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 41/106 (38%), Gaps = 9/106 (8%) Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAK--------KYIVLLTDGENTQDNEEGIAICNKAK 394 ++ Y+ + + H + + + K I++L+DG+ D + NK + Sbjct: 228 IKQIYN-LTKTASAINHVLTDIFIPENGSKDNSAKIIIVLSDGKILGDPMTLDEVLNKPQ 286 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +G+ +I + A + A P ++ +S LN I Sbjct: 287 MKGVTRYSIGVGDGILKNLDATEEMMQIADPGKYYNVSSYRALNDI 332 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 219 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 274 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 275 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 320 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 321 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 366 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 40/97 (41%), Gaps = 7/97 (7%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 H+ AKK ++++TDG+ +D E + +A+ GI I + Sbjct: 6 HKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQEL 65 Query: 419 LSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + ++P + F+ ++ L I ++ E++ Sbjct: 66 NTISSAPPQDHVFKVDNFAAL-----GSIQKQLQEKI 97 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 43.0 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 52/150 (34%), Gaps = 17/150 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F D F+ H+L++T++ + M + + Sbjct: 140 RIGLIVFGDAPYPLAPFTLD-HRLVQTLIA------------DLLPGMAGPSTALGDAIG 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQE 413 + +++ +K +++LTDG +T AK + + + TI +++ Sbjct: 187 LGIKMFEHSEAPEKVLIVLTDGNDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEK 246 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIF 441 L A + +F L I+ Sbjct: 247 VDLDVLQKLAAQTGGRYFFGADQAGLETIY 276 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 276 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 327 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 328 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQAKG 381 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 46/138 (33%), Gaps = 23/138 (16%) Query: 274 VRDALASVIRSIKKIDNVNDT------VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + I I + + + RMG F++ ++D +K Sbjct: 73 MKLGAKTFIDLIDEATDSSQDGQIGSGSRMGVVSFSNTAVADTQLI-----TSVDALKAA 127 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + GST DA A + K +V+ TDG NT Sbjct: 128 VDNLSAGGSTNHADAFAKAIQ-LFDPASANA----------KVMVMFTDG-NTTIGAPPA 175 Query: 388 AICNKAKSQGIRIMTIAF 405 + A++QGI I I Sbjct: 176 PVAAAARAQGIIIYCIGL 193 >gi|189220466|ref|YP_001941106.1| hypothetical protein Minf_2455 [Methylacidiphilum infernorum V4] gi|189187324|gb|ACD84509.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 340 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 46/149 (30%), Gaps = 22/149 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + S L + + D G T + A SS Sbjct: 130 RVGLVAFTKNAFIEAPLSTDYELLEEILSELSPDD-FPNGGTNFAAMLDEALQFFSSSGR 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 KK ++LL+DGE+ + + K + I +++I + + Sbjct: 189 S-----------KKMLILLSDGEDHGGGWQQRLV--DFKKESIPVLSIGIGSSNGAVIRN 235 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDR 444 S S ++ + + IF Sbjct: 236 --------SNGSLYKDYNGEPIVSIFNPA 256 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 56/136 (41%), Gaps = 20/136 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED 356 GA ++D V + +K +G T + A++ + ++ Sbjct: 237 GALHYSDSVELIQGLTRMPSGQKN--LKDRVEAVQYIGKGTHTDCAIKRGIEELLIGG-- 292 Query: 357 EVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ 412 ++ + KY++++TDG E ++ G+ N+AK GI++ +IA S + Sbjct: 293 ------SHQKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSIAISPGHLEP 346 Query: 413 EKARYFLSNCASPNSF 428 LS A+ +++ Sbjct: 347 R-----LSIIATDHTY 357 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 59/161 (36%), Gaps = 26/161 (16%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G RV G + V +A G TAI DA+ A + E Sbjct: 393 SGLVGQPARVQFAA----GDLQAASAQVLAYADSLVADGGTAIYDALTLAQQQ--ARQEL 446 Query: 357 EVHRMKNNLEAKKY--IVLLTDGENTQD-NEEGIAICNKAKSQG----IRIMTIAFSVNK 409 + +++ IVLLTDG NT + + G +R+ I F + Sbjct: 447 RA-------DPERFVSIVLLTDGANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQ 499 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHE--LNKIFRDRIGNE 448 + + +A L+ F+A +T + L +F++ G + Sbjct: 500 SGEMQALAALTG----GRAFDARNTGKSGLPLVFKEIRGYQ 536 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 83/296 (28%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S S + P + D S+ + V++ L ++ Sbjct: 148 V----CSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW---DAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T + T A Sbjct: 201 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 71/200 (35%), Gaps = 23/200 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S K L++ A ++ + + D V+ V GA ++ D Sbjct: 181 PPSNLVFLIDVSGSMDEPNKLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDS 236 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K++ ++ GSTA + ++ AY D + + Sbjct: 237 TPGNEKDKILDALMN-----LEAGGSTAGAEGIKLAY--------DVAKKNFIKSGNNR- 282 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ E + + K + +GI + + F + K + Sbjct: 283 VILATDGDFNVGISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGN 340 Query: 428 FFEANSTHELNKIFRDRIGN 447 + ++ E K+ + +G Sbjct: 341 YAYIDNIAEARKVLVNEMGA 360 >gi|254420639|ref|ZP_05034363.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] gi|196186816|gb|EDX81792.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] Length = 613 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 65/208 (31%), Gaps = 23/208 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P L+ VD S K L + A+ I ++ D ++ T++ + Sbjct: 241 PQGEQRPLNLTFLVDVSGSMRSPDKLDLAKQAMNLAIDRLRPQDTLS------VTYYAEG 294 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + G KL ++ G TA M AY D+ Sbjct: 295 AGTTLQPTPGDQKLK---MRCAVASLRASGGTAGATGMTNAY--------DQAQASFARD 343 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNK-AKSQGIRIM-TI-AFSVNKTQQEKARYFLSNC 422 + + ++ TDG+ + + + A+ +G + ++ F Q + + Sbjct: 344 KVNRILMF-TDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARMQTIAQAG 402 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIF 450 ++ + ++F F Sbjct: 403 NGVAAY--VGDLRDARRLFGPMFDKGAF 428 >gi|75750453|ref|YP_319892.1| hypothetical protein ATV_gp61 [Acidianus two-tailed virus] gi|123849288|sp|Q3V4Q4|Y892_ATV RecName: Full=Putative VWFA domain-containing protein ORF892 gi|74474836|emb|CAI59910.1| hypothetical protein [Acidianus two-tailed virus] Length = 892 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 33/73 (45%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + G T I A+ A I + D +++ +L + ++LLTDGE+ ++ Sbjct: 780 VLGSMKFGGTNIGSAVLYALKNIDKPDSDYDRKLRESLRKTRTLILLTDGEDEIPDDIAR 839 Query: 388 AICNKAKSQGIRI 400 I + K + + Sbjct: 840 EINSLKKKNKVEL 852 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 38/127 (29%), Gaps = 15/127 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +D+ + ++G F + + + + T I A+ Sbjct: 121 NLVDHFTND-KIGLLVFAGDAFVQLPITSDYVSA-KMFLSSIDPSMMATQGTDIARAIDM 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A H K I+++TDGE+ + + AK G+R+ + Sbjct: 179 A-----------THSFTQEEGIGKAIIVITDGEDHEG--GALESAEAAKKAGMRVYVLGV 225 Query: 406 SVNKTQQ 412 + Sbjct: 226 GSTQGAP 232 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 47/353 (13%), Positives = 101/353 (28%), Gaps = 37/353 (10%) Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV----VLSSRYDLLLN 142 K + E N + N E + + + R + + V++ RY + + Sbjct: 152 KASLVEQHRPNIFSTQVANIAPDESVTVEIEYQEAVLYRDGEFSLRFPTVVAPRY-IPVV 210 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 PL+ I + ++ + T+ S + + + + + S Q+ Sbjct: 211 PLNKVPDVNEI-TPFLRDLQDDPTLPFSLNIDLNAGLPIAVINTPSHAFTQQKLSEDHYI 269 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 D + + + Y +L P + S+ Sbjct: 270 LSLIQPDIADRDVVLSWRPKATDLPSTALFTQHVEGQGYGLLLTMPPQVNHQVNSTTSSA 329 Query: 263 ------------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S + + AL ++ ++ D+ N FN + Sbjct: 330 LFHQSVTFVLDISGSMYGESIEQAKQALRYGLQQLQPEDSFN------IVTFNHEAMLYS 383 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 V T F + G T + A++ A+ + H N+ Sbjct: 384 EQLLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSI-------KTHDQLNSTRWLNQ 436 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSN 421 IV +TDG ++ A+ + + Q + R+ T+ R + Sbjct: 437 IVFITDGSVGNES----ALFDLIEQQLVDRRLFTVGIGSAPNSYFMTRAAMKG 485 >gi|87199929|ref|YP_497186.1| hypothetical protein Saro_1912 [Novosphingobium aromaticivorans DSM 12444] gi|87135610|gb|ABD26352.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 435 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 68/212 (32%), Gaps = 3/212 (1%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+++ G+ ++ A P ++ G VD +W ++ ++ AA A ++A+ + Sbjct: 15 FRRLLRARAGNATLLMAFGAPALIATAGFAVDTAQWYLWKREMQYAADQAALSAAYSKSK 74 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + +L +VR +A Sbjct: 75 GISTTAYETHAVQEYNANLQLVTFSDTPTVSLAAYNGGTNNSVVVRASATRELAFSG--- 131 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 +VL + ++ + + S +I T A A+ S + S S Sbjct: 132 IVLGKPTTVSVSAQATYTAGATYTSCIIATNATADGAITIGGSSILKSGCGIAALSNSTN 191 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 + D + A +++ V Sbjct: 192 AIKVDGSPTIDVNYVLAAGGIDDWFNTNTDDV 223 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 53/168 (31%), Gaps = 21/168 (12%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S + K L++ +L + + + + D+V V GA Sbjct: 243 NLVFLLDVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPA------- 295 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + GST ++ AY I S N + + ++L Sbjct: 296 --TKGNEYQVISNALNNLSAGGSTNGAQGIELAYQ-IASQNFKKEGINR--------VIL 344 Query: 374 LTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 TDG+ + + + GI + T+ F Sbjct: 345 ATDGDFNVGMSSVDALKKLIANKRKTGIALTTLGFGQGNYNDGLMEQL 392 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K L++ +L ++R ++ D R+ + + Sbjct: 130 PPANLVFLVDVSGSMDAPDKLPLLQSSLKLLVRQLRAQD------RITLVTYAGNISVVL 183 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + + GSTA ++ AY ++ + + N Sbjct: 184 PPTPGDQQGR---IVEAIDALQSGGSTAGASGIELAYK---AAQQGYLRGGINR------ 231 Query: 371 IVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + + GI + T+ F A + Sbjct: 232 ILLATDGDFNVGVTNFDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLAD--AGDGA 289 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + ++ E K+ +G + Sbjct: 290 YAYIDTALEARKVLTHELGATLAT 313 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 286 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 332 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 389 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 390 YSYIDSLSEAQKVLKD 405 >gi|290996921|ref|XP_002681030.1| predicted protein [Naegleria gruberi] gi|284094653|gb|EFC48286.1| predicted protein [Naegleria gruberi] Length = 353 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 58/157 (36%), Gaps = 16/157 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ + R+G F+ + + + V L + +K + E GST MQ Sbjct: 20 IDNLREFERLGIVLFDHKAETLLPLTI-VQDLDKKSLKETVLKIEEQGSTNFEAGMQRGI 78 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMT--IA 404 D + + ++ I+ LTD + + I K + G I + I Sbjct: 79 DLFSTLDSSDLSNSNR-------IIYLTDACPNVGGTDTLDILTKDANSGPYNIFSTFIG 131 Query: 405 FSVN-KTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ + + + C ++F ST + KI Sbjct: 132 IGLDFNSDIVEELTRVRGC----NYFSVRSTEDFTKI 164 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 46/128 (35%), Gaps = 13/128 (10%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--- 377 R+ + G T I + ++ ++ + + R + ++LL+DG Sbjct: 225 RSEMHALVDTLQPGGGTNIYEGLERGFEIAKEARVNHPDRAQR-------VILLSDGLAT 277 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 E D+ IA+ G+ + T+ + E R A +F+ + Sbjct: 278 EGITDSASIIALSEAFIEGGMGLTTVGVGASFN-VELMRGLAERGA--GNFYFVEDPEAV 334 Query: 438 NKIFRDRI 445 ++F + + Sbjct: 335 REVFTEEL 342 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 83/296 (28%), Gaps = 28/296 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + SI V + +M + F + +Q G Sbjct: 88 STATCEKLNLQTSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTG 147 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S S + P + D S+ + V++ L ++ Sbjct: 148 V----CSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW---DAVKNFLEKFVQ 200 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGSTAINDA 342 + G + + + +K IV T + T A Sbjct: 201 GLDIGPTKTQV---GLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGA 255 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 +Q A S+ A K +V++TDGE + D A+ ++ I Sbjct: 256 IQYARKYAYSAASG------GRRSATKVMVVVTDGE-SHDGSMLKAVIDQCNHDNILRFG 308 Query: 403 IAFSV----NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFE 451 IA N + + AS FF + L + +G +IF Sbjct: 309 IAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEK-AGTLGEQIFS 363 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 49/157 (31%), Gaps = 25/157 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I + D R+ F+ R +K Sbjct: 59 LETVKKAAGELIDRLNPGD------RISVVVFDHRAKVLIP---NQDIDDPESIKKQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAI 389 G T+I++ ++ + + + K ++ + LLTDGEN + + + Sbjct: 110 LRTSGGTSIDEGLKLGIEELG--------KGKVERISQAF--LLTDGENEHGDNNRCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A + + ++ F + + L A Sbjct: 160 AKLATDYNLTLNSLGF-----GNDWNQDILEKIADEG 191 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 294 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 345 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 346 ESSYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQAKG 399 >gi|145549918|ref|XP_001460638.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124428468|emb|CAK93241.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 67/220 (30%), Gaps = 18/220 (8%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + + F D + K + CNK L+ + F Sbjct: 322 RRVENFEWDFDPKQNIDQKKQLKDKLIKIWQEYGQDICNKLQKSYQNIQQLNHNRVHYIF 381 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVH 317 + SS ++ + I+ IK+ D V + + FN + ++ Sbjct: 382 ILDSSESMNKDWTD-IKKGVREFIKKIKEKDQVENKEFWISLILFNKEQTTL------IN 434 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +KT G T ++ A + + N ++ I+ TDG Sbjct: 435 SKRARDIKTKFKMNFLGGGTNFGKPIKKAINLVKKDNTSDLF----------LILFYTDG 484 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + +E + N + + +I IA + + Q K Sbjct: 485 KAAIPEQELKKMQNLEEEKRKKIHLIACTHDSNTQNKVLN 524 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 17/113 (15%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T A++ A + + K I+L+TDG+ T I K K Sbjct: 24 GKTNTGGALERAQQML----------AEGRPSVPKIILLITDGDATDKERLDAQI-EKLK 72 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIG 446 I I TI + E L+ A+ F +E ++KI +G Sbjct: 73 KSNILIYTIGVGDLIDRNE-----LNRIATDEDFVYETRDFDSISKIKSSLLG 120 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 53/149 (35%), Gaps = 21/149 (14%) Query: 278 LASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + ++ + D + V++G ++ V SD + G V+ + G Sbjct: 159 AKNFVANVSSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGD 218 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-AICNKAKS 395 T A++ I ++N E K ++ +TDG T + + A + Sbjct: 219 THTGTALRY----ISTNNRWRE-------EVPKILIFVTDG--TPQDRAIVPAAARSLRD 265 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +G+RI I + L AS Sbjct: 266 KGVRIFAIGVGNAVESE------LKEIAS 288 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 222 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 277 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 278 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKG------FIKGGINR--- 323 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 324 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 369 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/135 (25%), Positives = 49/135 (36%), Gaps = 22/135 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD------- 348 R+G F + + L + EN G TAI A+ + D Sbjct: 147 RVGVVIF-AGAARSFPVTSDMGVLSLFLAHADPRTENP-GGTAIGKALDKSIDLLVAVRR 204 Query: 349 --------TIISSNEDEV---HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + EDE EA + IVLLTDGE+T E +A +A+ G Sbjct: 205 DDSGARADQVEGEGEDESGAPEAAPALSEADQVIVLLTDGEDTVGRPEEVAA--RAEQLG 262 Query: 398 IRIMTIAFSVNKTQQ 412 IRI T+ + + Sbjct: 263 IRIYTVGIGSDSGEP 277 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 33/202 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S N+ + +G F + + + +I+ I Sbjct: 109 TRLQAAQEAATSF------ARNMTPGINLGLISFAGTATVLVNPTTDRNGVIKAIENLKL 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 STA + + A ++ S + + + IVL++DG E+ Sbjct: 163 AQ-----STATGEGIFAALQSVESFSSLV--GGADGPPPAR-IVLMSDGKQTVPEDLYAA 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKARYF------LSNCA--SPNSFFEAN 432 G AK G+ I +I+F +K + L A S F++A Sbjct: 215 RGGYTAAQAAKQAGVPISSISFGTTHGSVTIDDKPQPVSVDDESLREIARLSGGDFYKAA 274 Query: 433 STHELNKIF---RDRIGNEIFE 451 S EL K++ ++IG E+ + Sbjct: 275 SAEELKKVYADLGEQIGYELKD 296 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 63/190 (33%), Gaps = 33/190 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + + + +G F S + V Sbjct: 107 RITAAKEAAQKFVEDLP------ERFNVGVVAFARSASVVVSPT-----TDHQAVSASLG 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + TAI +A+ ++++ + + + + IVLL+DG+NT Sbjct: 156 NLTTRAGTAIGEAV---FNSLDAVRSFDQQAVTD--PPPAAIVLLSDGDNTSG-RSVAEA 209 Query: 390 CNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCASPNSFFEANSTHE 436 + A S + I TIA+ + + + +EA S + Sbjct: 210 VDAAMSARVPISTIAYGTQEGTVSIDGRDVNVPVNKATLQTLSEG--TSGRAYEAESGSQ 267 Query: 437 LNKIFRDRIG 446 L +++ ++IG Sbjct: 268 LREVY-EQIG 276 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 49/162 (30%), Gaps = 18/162 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D D R+G F+ + + ++ + G T + + Sbjct: 58 VDAFDIAADFTRVGVVQFSSFFTEEFPLDR--YSDKASLKQAIGNIPQRGGGTLLGQVIN 115 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTI 403 +T + + + + + + VL+TDG + +A + ++ GI +I Sbjct: 116 YLVNTSFTEAK-GARPLSDGIP--RIAVLMTDGSAHDNPTTVLAPAIDALRASGIIAFSI 172 Query: 404 AFSVNKTQQEKARYFLSNCASP-------NSFFEANSTHELN 438 R L A ++ + +L Sbjct: 173 GV-----GPSVNRDQLEAVAGDTDRVFLVGAYSVIDDIRDLL 209 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 79/225 (35%), Gaps = 15/225 (6%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + + + + + K+ S K+ + + S+ + Sbjct: 313 NDATKQEVLKKFSSYVLEGNNQSKATSYGFNKLDDYSFEEQTTDGNLLMSMQNLWKKNKN 372 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + FV S + + ++ +L + + +N+ ++G ++D V + Sbjct: 373 NSQPIVGVFVTDVSGSMDGEPMNNLKKSL------LNSLQYINEENQIGLVSYSDDVTIN 426 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 ++ ++ + G TA D A I+ +MK N A+ Sbjct: 427 VPID-TMNSTQKSYFTSAIKGLTPSGGTATYDGTLVAVKMILD-------KMKENPGARP 478 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I +L+DG+ T E + K+ GI + TI ++ + + K Sbjct: 479 VIFVLSDGQ-TNGGYEFERVEPIIKALGITVNTIGYNADLKELTK 522 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 53/174 (30%), Gaps = 21/174 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P ++ +D+S K L++ + ++ + + D V+ G+ RV+ Sbjct: 349 PPMNLVFLIDTSGSMQDANKLPLLKQSFRLMLGQLGEEDMVSIVTYAGSAG---RVLEP- 404 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 R + GSTA +Q AY T D Sbjct: 405 -----TKASDRQTILDALDRLEAGGSTAGQAGLQQAYATATEMARDGAVSR--------- 450 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 ++L TDG+ D ++ + G + + F + + Sbjct: 451 VILATDGDFNVGISDPDDMKDYIETQRGTGTYLSVLGFGRGNLDDATMQALAQH 504 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 78/261 (29%), Gaps = 31/261 (11%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP----GPLDPSLSEEH 257 N P S +N ++ + ++S Sbjct: 298 NTDEYPKVNVYFSLYDENNQLVEDMNPVKTAFTVKEGDKETKNASFSKLTEKPQAISTNL 357 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 +D S K V+DA + N +G F+D + Sbjct: 358 VIDVSDSMSEDNKLTKVKDAATQFLSHASFASNDV----VGLMSFSD----ASNIRQSDF 409 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +K+ G TA+ +A+ A N++E KY+V+ TDG Sbjct: 410 TTEIESIKSSIAGMQTSGCTALYEALNQAVSN----------TAYNSVEGSKYVVVFTDG 459 Query: 378 ENTQDN----EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +NT + + N A G+ I I + Q+ A + N Sbjct: 460 KNTICDGTNWVSPSTVINNALQWGVPIYAIGVEEDADLQQIAEQ-----TNGQYHVLGND 514 Query: 434 THELNKIFRDRIGNEIFERVI 454 +LN I+ D N+ + VI Sbjct: 515 FTDLNAIYSDIYTNKKKQYVI 535 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 68/175 (38%), Gaps = 23/175 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D++ D R+G FN S G + + +K ++ + T ++ MQ A Sbjct: 1001 LDHLEDDDRLGLVLFNTGAELAEPVSLIGAKNMQK--LKGDVLEISATDGTRLSAGMQMA 1058 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTD-----GENTQDNEEGIAICNKAKSQGIRIM 401 + E N E + I+ LTD G+ ++++ G+ N K+ + Sbjct: 1059 TELYDEFLE------VNQSEYENRIIFLTDAMPNLGQTSEESLLGMTEANANKN----VY 1108 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 T + + +++ +++ +S + F++R+ +E V + Sbjct: 1109 TTFIGIGVDFNTELVEYITKIRG-ANYYSVHSAKQ----FKERMDDEFEYMVTPL 1158 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 332 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVY 385 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ I + KN + + I+LL+DG++T Sbjct: 386 -SLTSNGGTNIAEGLRKGSKVIED------RQAKNPVCS---IILLSDGQDTYTVSPT 433 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 286 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 332 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 389 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 390 YSYIDSLSEAQKVLKD 405 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 286 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 332 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 389 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 390 YSYIDSLSEAQKVLKD 405 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 45/157 (28%), Gaps = 20/157 (12%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L G I+ + M M+G + + W + L+ A A + + L Sbjct: 8 RRLCTGTEGAVAIMVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAFELGSG 67 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 A + + + + + +V Sbjct: 68 SNSSVISAAAIQDAGRNGFQATGGATIAVHTPPASGKYAGNPQMVEVSVS---------- 117 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 P +L ++ +KS + +A A+T S Sbjct: 118 ----------QPTTLLFSALFLKSLQVNARAVAKTGS 144 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 71/267 (26%), Gaps = 43/267 (16%) Query: 165 ETVSRSYHKEHGVSIQWVIDFS--RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 E + + S + ++D S S + LN + + Sbjct: 49 EESTEGDEGQKDDSYEDLLDESVRTSESNKLETVIQTNLNQSLFVTSASNVKPQIDFTYL 108 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 GI + E V S K +++A I Sbjct: 109 GITPINPMQGQEFTVRYKLTPNPFQHNISKPKEIVLVLDGSGSMSGTKLTNLKNAAKDFI 168 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT---------------- 326 +K +DN+ + F+ +P G K+ T + Sbjct: 169 DRLKGVDNLK----VAIVVFSSNATINPISVSGTTKIKSTDKSSESSIPNYKTLQNEYFL 224 Query: 327 ---------FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + G T D ++ A + K + A K I+L++DG Sbjct: 225 DINDSRLITMINNIDAQGGTNTGDGLRKAEYLLS---------QKGDSVANKTIILMSDG 275 Query: 378 ENT---QDNEEGIAICNKAKSQGIRIM 401 T E G+ + K + I Sbjct: 276 LPTYYSGSTESGVNYYKEIKDDVVGIF 302 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 52/125 (41%), Gaps = 14/125 (11%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 +R ++ + + R+ F R + G + + ++++ Sbjct: 128 VRRLRAMSPAD---RVALIAFAGRSYILTPLT-GDDGALELFLDNLDPGVVGQAGSSLSR 183 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A++ + +++S+ + +VLL+DGE+ E+ + ++A S+GI ++ Sbjct: 184 AIRQGSELLLASDGSA----------DRALVLLSDGESFDSAEDIESAASEAGSKGISVV 233 Query: 402 TIAFS 406 T+ F Sbjct: 234 TVGFG 238 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 54/174 (31%), Gaps = 22/174 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 SL + P +D S K R A ++ + D Sbjct: 290 EGTWSLTLVPPAEPSSAPRDVVVVLDRSGSMGGW-KMVAARRAAGRIVDMLDAGD----- 343 Query: 295 VRMGATFFNDRVISDPSFSWGV---HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R F+DR+ + P+ G+ R ++ G T + + A + + Sbjct: 344 -RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQPLTNAVEMLA 402 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 S ED +VL+TDG+ + ++ ++ RI + Sbjct: 403 DSGEDRQAS----------VVLVTDGQISGEDHLLRSLAPVVGRT--RIYCVGV 444 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 73/216 (33%), Gaps = 21/216 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT-VRMGATFF-- 302 P L P + V S I + + A + + + +R + Sbjct: 34 PSALSPKVELVLDVSGSMRTRDIDGQSRMSAAKQAFNDVLDAVPEEVQLGIRTLGANYPG 93 Query: 303 NDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 +DR + D + V L RT KT G T I A+ A D + Sbjct: 94 DDRKVGCKDTKQLYPVGPLDRTEAKTAVATLAPTGWTPIGPALLGAADDL---------- 143 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYF 418 + +A + IVL+TDGE+T + + ++GI I T+ N +++ Sbjct: 144 --DGGDATRRIVLITDGEDTCGPLDPCEVARDIAARGIHLVIDTLGLVPNAKIRQQLTCI 201 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A+ ++ EL+ + + + Sbjct: 202 AE--ATGGTYTAVQHADELSGRVKQLVDRAAEPTIT 235 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 73/196 (37%), Gaps = 19/196 (9%) Query: 236 CNKSLYYMLYPGPLDPSLSEEH-FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 KS G D + + F+ SS I +D + S+ + ++ V Sbjct: 28 GTKSNLLARKNGLQDSTCFIDVVFIVDSSESSKIILFDKQKDFVDSLSDRVFQLTPVRSL 87 Query: 295 ---VRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +++ A F+ V DPSFS W + + VK+ T A+ A + Sbjct: 88 KYDIKLAALQFSSSVQIDPSFSSWKDLQTFKQRVKSM---NFIGQGTFSYYAISNATGLL 144 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + K +L+TDG + N + +I A++ GI +TI S Sbjct: 145 ---------KREGRKDGVKVALLMTDGIDHPKNPDVQSISEDARTAGILFITIGLSTVVN 195 Query: 411 QQEKARYFLSNCASPN 426 + + +S +S Sbjct: 196 EAK--LRLISGDSSSE 209 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 77/220 (35%), Gaps = 20/220 (9%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 ++++ G D S K L + F+ SS I Sbjct: 2 KTRTFTRHRYDSGEVDRTPLCLKSSSAKRKLPALTVDS--TCFIDVVFIVDSSESSKIIL 59 Query: 271 KHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS-WGVHKLIRTIVKT 326 +D + S+ + ++ V +++ A F+ V DPSFS W + + VK+ Sbjct: 60 FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDLQTFKQRVKS 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T A+ A + + + + K +L+TDG + N + Sbjct: 120 M---NFIGQGTFSYYAISNATGLL---------KREGRKDGVKVALLMTDGIDHPKNPDV 167 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +I A++ GI +TI S + + +S +S Sbjct: 168 QSISEDARTAGILFITIGLSTVVNEAK--LRLISGDSSSE 205 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 63/184 (34%), Gaps = 19/184 (10%) Query: 272 HLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ L +I ++ I + V M + + + ++R I + Sbjct: 237 EHVKRTLGLMIDNLCDGISPDTNRVAMLRYSSDVKEDLNFIEGSNEPTVMRNIQRLKYKP 296 Query: 331 ENE--MGSTAINDAMQTAYDTIISS-----NEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + GST AM A TI +S N +K E +V++TDGE+ + Sbjct: 297 ITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTEDGIKVRTE----VVIITDGESNDPD 352 Query: 384 EEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 E K K GI++ + K + + + S F S +L F Sbjct: 353 ETFTIQGQKVKYDEYGIKVYALGVGDIKKDEIRQLTSMD----DESIFYLMSWKDL-AAF 407 Query: 442 RDRI 445 I Sbjct: 408 NRII 411 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 65/184 (35%), Gaps = 15/184 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D L VD+S + RD + ++ R I R+G +N + + Sbjct: 37 SDIWLDVVVVVDNSQRVNKRSFVSSTRDTINNIFRE-ASIP----RTRVGFVTYNSQATT 91 Query: 309 DPSF-SWGVHKLIRTIVKTFAIDEN--EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + ++ V D N + I + A + + + H Sbjct: 92 NADLNKFKSYGDLQQGVYNSYNDMNLSPEKTPYIGTGLIAAGELLQIQGSADGHVN---- 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K I T N + +++ N KS GI I+TIA + + + L+ ASP Sbjct: 148 HPKVIIAYAT-ALNGTGLLDPLSVANTLKSAGITIITIAVDTDDNGVIEKQ--LAPLASP 204 Query: 426 NSFF 429 + F Sbjct: 205 GAAF 208 >gi|297685993|ref|XP_002820555.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Pongo abelii] Length = 150 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 23/137 (16%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----KTFAI 329 +DAL +++ ++ D+ + F++R+ W H + T K + Sbjct: 24 KDALFTILHDLRPQDHFS------IIGFSNRIKV-----WKDHLISVTPDSIRDGKVYIH 72 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T IN A+Q A + N+ H + IV LTDG+ T + I Sbjct: 73 HMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKI 128 Query: 390 CNKAKSQG---IRIMTI 403 N + + I T+ Sbjct: 129 LNNTREAARGQVCIFTL 145 >gi|73958318|ref|XP_547049.2| PREDICTED: similar to integrin alpha X precursor [Canis familiaris] Length = 1149 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 50/121 (41%), Gaps = 11/121 (9%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ + + S+++ K ++++TDG+ D+ + A Sbjct: 213 GGLTHTATAIRIVTNELFSASKGARKDA------SKILIVITDGQKKGDSLGYEDVIPMA 266 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEIF 450 ++ GI + + + + L++ AS S F+ + L I ++++ +IF Sbjct: 267 EAAGIIRYAVGVGTAFQKMQSWKE-LNDIASKPSHEYIFKVENFDALRDI-QNQLKEKIF 324 Query: 451 E 451 Sbjct: 325 A 325 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|302532683|ref|ZP_07285025.1| predicted protein [Streptomyces sp. C] gi|302441578|gb|EFL13394.1| predicted protein [Streptomyces sp. C] Length = 248 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 51/142 (35%), Gaps = 21/142 (14%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVI---KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P P + V S + I + + A V+ ++ D V +R + Sbjct: 115 PAKEPPKVELVLDVSGSMRANDIDGQSRMAAAKQAFNEVLDAVP--DEVRLGIRTLGATY 172 Query: 303 --NDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 +DR + D + V + RT KT G T I A+Q A + N Sbjct: 173 PGDDRALGCKDTKQLYPVGTVNRTEAKTAVATLAPTGWTPIGPALQAAAQDLEGGNAT-- 230 Query: 359 HRMKNNLEAKKYIVLLTDGENT 380 + IVL+TDGE+T Sbjct: 231 ----------RRIVLITDGEDT 242 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 52/170 (30%), Gaps = 26/170 (15%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F+DR + R + G T A + A D +++ Sbjct: 408 LGFITFSDRPEWVFRPRQATLQGKREMTAAILN-VAPQGGTIFEPAYREALDVLMAQEAA 466 Query: 357 EVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-----ICNKAKSQGIRIMTIAFSVN 408 H +++LTDG + T G A + + GI TIA + Sbjct: 467 VKH-----------VIVLTDGKFADGTGPFSRGPAPDFGRLAALGRRSGITTSTIAIG-D 514 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ A ++EA L +IF + + + Sbjct: 515 GADPQQLTTIAR--AGGGRYYEALDVSTLPRIFT---TEALSATRSLLRE 559 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 66/191 (34%), Gaps = 30/191 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K +V+ LA + +K D R+ F+ + + F+ +++ R+ K Sbjct: 64 SKIEMVKSTLAFMFDQLKPTD------RIALVEFDSNISTSLQFT-NMNESGRSKAKQVV 116 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + T ++ A+ ++ + + + ++L TDG + Sbjct: 117 SNIRAGSCTNLSGAL---FEGLRLIGQRTNANEVTS------LLLFTDGLANEGITNTNE 167 Query: 389 ICNKAK-------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNK 439 I K + + T F + L++ + + ++ +T ++ K Sbjct: 168 IVKKMTTMIHEEIRTNLTVFTFGFGTDTDA-----NMLTSISQAGNGLYYFLQTTDDIPK 222 Query: 440 IFRDRIGNEIF 450 F + IG I Sbjct: 223 AFGNVIGGLIS 233 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 71/200 (35%), Gaps = 23/200 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S K L++ A ++ + + D V+ V GA ++ D Sbjct: 167 PPSNLVFLIDVSGSMDEPNKLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDS 222 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K++ ++ GSTA + ++ AY D + + Sbjct: 223 TPGNEKDKILDALMN-----LEAGGSTAGAEGIKLAY--------DVAKKNFIKSGNNR- 268 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ E + + K + +GI + + F + K + Sbjct: 269 VILATDGDFNVGISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGN 326 Query: 428 FFEANSTHELNKIFRDRIGN 447 + ++ E K+ + +G Sbjct: 327 YAYIDNIAEARKVLVNEMGA 346 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 69/197 (35%), Gaps = 26/197 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A S +D++ V +G F S + RT+ K Sbjct: 107 RLDAAKEAAQSF------VDDLTPGVNLGIVSFAGIATVLVSPT-----TDRTVAKQAID 155 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TA +A+ ++ TI ++ + IVL+TDG+ T E A Sbjct: 156 GLTLDERTATGEAIISSLQTIELFSKTLPPDGTDTGPPPARIVLMTDGKRTVGRTEQDAA 215 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF---------LSNCA--SPNSFFEANSTHELN 438 +A G+ + IAF + + A S F +A S EL Sbjct: 216 -QRAADAGVPVSVIAFGTDNGSITVNDEVIPVPLDTEAMQQIAQISGGDFHQAASAEELK 274 Query: 439 KIF---RDRIGNEIFER 452 I+ ++IG E ER Sbjct: 275 SIYAQLGEQIGYETKER 291 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 65/214 (30%), Gaps = 11/214 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I+ G +I + M V++ V +D+ L+ + A + ++ Sbjct: 8 RGIRKQRGLVVVIVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAA--ALAAAIVLDK 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR-----DIVRDTAVEMNPRKS 127 + ++A K+ N + EV+ + D+ +P Sbjct: 66 DGTEAQADTIARSTLTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVFPDSGYSSSPDGD 125 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y V+ ++ DL F R MG+ L + + + +++ D + Sbjct: 126 RYVRVVINQLDLE---SFFFARVMGVTKRLTASAVAGPSPGGNACNIVPMAVCEG-DDAG 181 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + + PA + G Sbjct: 182 TNGYDAGVVYALKIADQNDPAMGSGNFQLLDFGS 215 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 78/193 (40%), Gaps = 22/193 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-----WGVHKLIRTIV 324 K + ++A+ + + + NV+ V G ND S + + K +T Sbjct: 199 KMDIAKEAIQQFVSDLPEAVNVSLRVY-GHKGSNDEKDKTASCGAIENIYTLQKYDQTTF 257 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + +G T + +A++ + +T S+ E++ K I +++DG T Sbjct: 258 RQSLDGFQPVGWTPLAEAIKKSTETFQSAKEND----------KNIIYIVSDGVETCGG- 306 Query: 385 EGIAICNKAKSQGIR-IMTI-AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K + I+ IM I F V+ +++ + S + ANS EL F+ Sbjct: 307 NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAE--VSKGKYVLANSAKELQDQFK 364 Query: 443 DRIGNEIFERVIR 455 + G +I R ++ Sbjct: 365 ET-GKDITSRRLK 376 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF--NDRVIS 308 P + VD S K L++ +L ++R ++ D R+ + N V+ Sbjct: 231 PPANLVFLVDVSGSMDAPDKLPLLQSSLKLLVRQLRAQD------RITLVTYAGNTSVVL 284 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 P+ +++ I GSTA ++ AY ++ + + N Sbjct: 285 PPTPGDQQGRIVEAID-----ALQSGGSTAGASGIELAYK---AAQQGYLRGGINR---- 332 Query: 369 KYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+L TDG+ ++ + + + GI + T+ F A Sbjct: 333 --ILLATDGDFNVGVTNFDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLAD--AGD 388 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E K+ +G + Sbjct: 389 GAYAYIDTALEARKVLTHELGATLAT 414 >gi|71028596|ref|XP_763941.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350895|gb|EAN31658.1| thrombospondin-related protein [Theileria parva] Length = 552 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 87/277 (31%), Gaps = 26/277 (9%) Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + F ++ I S L + + + + EH V+ + +D ++ D+ L Sbjct: 89 NSFNKNSRISSVLPMETLDRLSEAITRSSEHPVTFE-ALDGGSVVVTNNSDTFSIKLYPS 147 Query: 205 GQPADRT------------VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + T + + N ++ + ++ LY P Sbjct: 148 LPGLNLTPGMLPTNKPNSHINFTGNHNEHALLKHALHDLSTSNYDRGLYPDGIKKPSSYC 207 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV---ISD 309 E I K+ + + + +K I + V + F+ + IS Sbjct: 208 HRELDLTILVDESSSIMKEEWEK-LIPFLKSLVKSISISPNYVHLSVVTFSTSIRWLISF 266 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + S L ++ + G T A+ + + + A K Sbjct: 267 LNPSGKDENLALRVIDELKNSKPVFGFTFTGQALNFITEAVY--------QFGARQNAPK 318 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 I+L+TDG +TQ N A + G+ I+ + Sbjct: 319 AIILITDGSSTQPNVTSQASA-MLREAGVTILVVGVG 354 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 78/193 (40%), Gaps = 22/193 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-----WGVHKLIRTIV 324 K + ++A+ + + + NV+ V G ND S + + K +T Sbjct: 199 KMDIAKEAIQQFVSDLPEAVNVSLRVY-GHKGSNDEKDKTASCGAIENIYTLQKYDQTTF 257 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + +G T + +A++ + +T S+ E++ K I +++DG T Sbjct: 258 RQSLDGFQPVGWTPLAEAIKKSTETFQSAKEND----------KNIIYIVSDGVETCGG- 306 Query: 385 EGIAICNKAKSQGIR-IMTI-AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K + I+ IM I F V+ +++ + S + ANS EL F+ Sbjct: 307 NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAE--VSKGKYVLANSAKELQDQFK 364 Query: 443 DRIGNEIFERVIR 455 + G +I R ++ Sbjct: 365 ET-GKDITSRRLK 376 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 49/133 (36%), Gaps = 12/133 (9%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAIN 340 I+ ++ + D VR+G +++ ++ + S + I+ T Sbjct: 57 IKIVRDLPIHEDAVRVGLVQYSESAKTEFNLS--KYSERNDIIAHMETLTFMQVEDTRTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ A + I N + + I++ TDG + ++ + +G++I Sbjct: 115 VALNKADEEIFDFNGGARLKAT------RLIIIFTDGLSM---DKPSKAAKALRRKGVKI 165 Query: 401 MTIAFSVNKTQQE 413 TI+ + E Sbjct: 166 YTISVNSIGFIPE 178 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 66/196 (33%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K Sbjct: 286 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGANR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 332 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 389 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 390 YSYIDSLSEAQKVLKD 405 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 52/159 (32%), Gaps = 27/159 (16%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ + + TA + A+ A + E ++ +K IVLL+D Sbjct: 149 HAAFEALLGMVSTELVSDQGTAFDSALNLA-MRLFERTEPPGDV--KEVQGEKVIVLLSD 205 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-KTQQEKARY----------FLSNCASP 425 GEN N A+ + K G+ + TI + + +SP Sbjct: 206 GENHSGNF--RAVADALKQSGVSVFTIVLGKPLPAAIPLGQSSGVKKDAAGKIVKTRSSP 263 Query: 426 -----------NSFFEANSTHELNKIFRDRIGNEIFERV 453 +FF+A+ + +RI + Sbjct: 264 ETMRRLAGDSGGTFFDASEDDAVYDRVAERISTLVSTSR 302 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 58/181 (32%), Gaps = 22/181 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V++++ + + + R+G F + S L I + Sbjct: 118 TRLQAVQESVRKFVAA-------RQSDRIGLVIFASQAWPFAPVSEDKQALQTRINQLS- 169 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + N EA K +LLTDG +T Sbjct: 170 -PGMVGQQTAIGDALGVGVKLLDN---------TTNTEASKLAILLTDGNDTASQLAPAL 219 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCA--SPNSFFEANSTHELNKIFRDR 444 A + +++ TIAF + + + N A + + A ++ Sbjct: 220 AAQLAAAHHVQVHTIAFGDINSTGDDKVDLPLMQNIAQITGGQSWTAANSGAALDSVWKE 279 Query: 445 I 445 I Sbjct: 280 I 280 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 37/117 (31%), Gaps = 13/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + + + T + A++ +S Sbjct: 129 RVGLVAFSGEAQVMVPLTLDYGTVQMVLRELNPGWLMP--GTNLESAIRKGMTLFKNSGG 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H + ++L++DGE + A +A GI+I TI + Sbjct: 187 ASQHSV---------MILMSDGEELEAAAVNAAK--EAAEFGIKIYTIGIGSREGVP 232 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 63/187 (33%), Gaps = 26/187 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S + + + + A + ++ I+N + G F+ + + Sbjct: 310 LDKSGSMNGFNRLNRMNQAAKHFL--LQTIENGSWV---GMVHFDSTAYIKSNLIQIISS 364 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 R + ++ G T+I +++A+ I IVLLTDGE Sbjct: 365 KERNNL-LESLPTTANGGTSICAGIKSAFQVI---------GEIYPQIDGSEIVLLTDGE 414 Query: 379 NTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE- 436 + C + K G I IA + ++A +S N FF ++ Sbjct: 415 DN-----TAKNCIGEVKQSGAIIHLIALGPSA---DQAVIEMSTITGGNHFFASDEAQNN 466 Query: 437 -LNKIFR 442 L F Sbjct: 467 GLIDAFG 473 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 44/355 (12%), Positives = 109/355 (30%), Gaps = 54/355 (15%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 D+ R S + L+ A +A + S+ L ++ + L + + Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQL------PDDPDVSTGAVTAAATQNLLANDADA- 69 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 T + + + V +S+ + + + +GI Sbjct: 70 ----------------TGILVESGGATRSVCVSAEAKVEMT----LSQVIGI-------- 101 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + + + + + + V+D + SM + + + S +S K Sbjct: 102 GDQTVTAEACAGYNDIELVMVLDATGSMKGTPIANVKEAATNLVNLIMPSSSS-TSTRSK 160 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 +G+ + + + G + + +++ LR K Sbjct: 161 IGLVPFQGKVRIDGNDPVTAEANPDGVGPGCRNADGTLNNGKLRTEYSKTTT-------- 212 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK--TFAIDENEMGSTAI 339 K + G + +D+ S S + I+ T T I Sbjct: 213 -----KTNIFYGYTLSGVSTTSDKTCSGMSPIRALSSDKSAILSNITALNAGQVTSGTII 267 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 ++ ++ + + + + +K +++LTDG+ T+D G + + +K Sbjct: 268 SEGIKWGRHVLTPTAPYVE--GSTDTKVRKIMIVLTDGD-TEDGRCGGSYASASK 319 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 51/136 (37%), Gaps = 16/136 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT G T I A+ A + +A + IVL+TDGE+T Sbjct: 113 RTEAKTAVASLAPTGWTPIGPALLGAAKDLE------------GGDATRRIVLITDGEDT 160 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ + +++ A+ ++ + T +L+ Sbjct: 161 CAPLDPCQVAREIAAKGIHLVIDTLGLVPDAKTRQQLTCIAE--ATGGTYTSVHRTEDLS 218 Query: 439 KIFRDRIGNEIFERVI 454 + R + V Sbjct: 219 RRVRQLVDRAADPVVT 234 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 72/205 (35%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + ++ LV+ +L ++ ++++ D V + GA Sbjct: 240 PTENLPASNLVFLIDVSGSMYGPERLDLVKSSLKLLVNNLREKDKVAIVIYSGAAG---- 295 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ + GSTA + + AY + Sbjct: 296 ----EKLA-STPGSDKQKIREAIDELEASGSTAGGEGIMLAYKI-------AQKNFISGG 343 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 344 NNR--IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDNKMQTLAEK- 400 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 401 -GNGNHAYIDNLQEANRVLVNEFGA 424 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 39/124 (31%), Gaps = 4/124 (3%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISS 353 R+G F + KL+ + +G DA+ ++ + Sbjct: 127 RIGLVTFAGSAGLLVPPTDDTDKLLAALKSLTTSRGTAIGQGILTSIDAIAEVDPSVPPT 186 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 D A IV+LTDG NT + +A ++ +R+ TI F Sbjct: 187 GADVP-GGTGGEYAADVIVVLTDGANTVG-VDPRTAAGEAATRRLRVYTIGFGTTTPAPM 244 Query: 414 KARY 417 Sbjct: 245 VCDS 248 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 62/166 (37%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 274 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGINR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T+ + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEA 365 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 78/252 (30%), Gaps = 28/252 (11%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + + +++ S G SP + Y Sbjct: 156 LNWTQALEKVFMENSQEDPSLLWQAFGSATGVTRYYPATPWKSPDKIDLYDVRRRPWYIQ 215 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ D+ + R FN++ Sbjct: 216 GASSPKDMVILVDVSGSVSGLT-LKLIKASVMEMLDTLSD-DDYVNVAR-----FNEKAE 268 Query: 308 SDPSFSWGVHKLI------RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + L+ + I K G+T A++ +++ Sbjct: 269 AVVPC---FKHLVQANVRNKKIFKDAVQQMQAKGTTDYKSGFHFAFNQLLNKT------N 319 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K I+L TDG + + + ++ +R+ T + + ++ Sbjct: 320 VPRANCNKIIMLFTDGGEDRAQDVFMQY--NWPNKTVRVFTFSVGQHNYDVTPLQWI--A 375 Query: 422 CASPNSFFEANS 433 C + +FE S Sbjct: 376 CTNKGYYFEIRS 387 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 51/152 (33%), Gaps = 17/152 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 VR ++ ++ + D R+G + + + + ++++ Sbjct: 54 VRKWVSDLVATFDIGP---DYTRVGVVVYAEEPEMAIAL--NQYTDRDSLIQAVGNITYL 108 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T A++ + S + ++LTDG + +A Sbjct: 109 NGNTRTGKAIRFMNEESFSIANGARDIEFG---YNRLAIVLTDGRAQDNVFNPSL---EA 162 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++ GI++ + S ++ L+ AS Sbjct: 163 QNNGIQLYAVGVSTAVVEE------LNEIASD 188 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 50/182 (27%), Gaps = 22/182 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 FV S + V+ L S++ G ++ + + H Sbjct: 7 FVVDGSSSIPADEFEKVKTFLNSIVGHFDIGPTATQV---GVVQYSSSPQQEFAL--NAH 61 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLT 375 + ++ + T A+ A D N K +V +T Sbjct: 62 SSLVSLQQAITNIIIIGRGTNTGSALTFA--------RDVALTAANGARPGLPKIVVTMT 113 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG +E+ + ++ G+ I + + S + F A+ Sbjct: 114 DGA---SSEDVLTPSQNLRNDGVITFAIGVTSRASD----WQVEEIAGSLDRVFTASDFD 166 Query: 436 EL 437 L Sbjct: 167 AL 168 >gi|62001322|gb|AAX58363.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 101/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1355 SNNRVRFANTAGVTAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|62001360|gb|AAX58382.1| AvrE [Pseudomonas viridiflava] gi|62001394|gb|AAX58399.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 101/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1355 SNNRVRFANTAGVTAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 71/188 (37%), Gaps = 25/188 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ P + P S + L ++A A+++ + D T F Sbjct: 79 VVNPAAVKPKSISLVLTIDISESMQKQYMPLAKNAAAAIVNKLPL-----DISECAVTSF 133 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ND + F+ KL+++I G T N I SN + +K Sbjct: 134 NDVSFINTDFTRDRFKLLQSIQTLV-----PAGGTDYNKGF-------IKSNAGGLDILK 181 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 L +K ++ LTDG + I +AKS G ++ I + + E+ + ++ Sbjct: 182 KGLH-EKVLIFLTDG---YGDVNPTEIIQQAKSIGAKVYVITLGM--SAPEELKRIVT-- 233 Query: 423 ASPNSFFE 430 A+ S++E Sbjct: 234 ATNGSYYE 241 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 458 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 509 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 510 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQVKG 563 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 46/136 (33%), Gaps = 16/136 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT G T I A+ A D + + + K IVL++DGE+T Sbjct: 107 RTEAKTAVATLAPTGWTPIGPALLKAADDL------------DGGDGSKRIVLISDGEDT 154 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ N + A+ ++ EL Sbjct: 155 CAPLDPCEVAREIAARGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYTSVEHQDELT 212 Query: 439 KIFRDRIGNEIFERVI 454 + + V Sbjct: 213 DRVNELVDRAAEPVVT 228 >gi|62001312|gb|AAX58358.1| AvrE [Pseudomonas viridiflava] gi|62001314|gb|AAX58359.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 101/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1355 SNNRVRFANTAGVTAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|156933960|ref|YP_001437876.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] gi|156532214|gb|ABU77040.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] Length = 197 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 44/124 (35%), Gaps = 10/124 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT--------DLLSFQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 58 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 115 Query: 391 NKAK 394 A+ Sbjct: 116 KAAR 119 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 58/335 (17%), Positives = 103/335 (30%), Gaps = 31/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + S D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N + MG ++ ++ + G ++ +K + G T A Sbjct: 299 TLADSGITNNININMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 357 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 358 LRDAGDMLATPNGH-----------KKVIVLLTDG 381 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 54/174 (31%), Gaps = 14/174 (8%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + DA +I+ + + + F + V ++ + Sbjct: 32 KIENLYDATNEMIKVFSDAVSKEKIIDIAIITFGENVELHTPYT-----SVVDFKSRGLN 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + A++ A D I + + +VL++DG +E Sbjct: 87 PFLASGMTPLGTALRMAKDMIEDKE------TTPSNIYRPAVVLVSDG---VPTDEWRGP 137 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + K+ G F+V + S +FF A + ++ F+ Sbjct: 138 LDNFKNNGRSSKCQRFAVAIGNDADNQMLKSFAECNENFFIAENVSDIVDKFKQ 191 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 1/69 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G F ++T + M VM+ V GML D+ + L++ A A + L + + Sbjct: 22 RRERGAFAMMTIIFMTVMIAVLGML-DIGNVFFQRRDLQRIADMAALAGVQRLDATCSQA 80 Query: 76 SSRAKNSFT 84 A S Sbjct: 81 PVSASRSAA 89 >gi|62001410|gb|AAX58407.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 36/364 (9%), Positives = 101/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1355 SNNRVRFANTAGVTAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSLHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +EGIA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEGIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 56/176 (31%), Gaps = 3/176 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G ++ ++++ ++LGV +D+ + L+ A TA + +V + E+V Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAV-VADKTEDV 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 E A P Y + + Sbjct: 66 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYV-- 123 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 R + +S +L ++ + A A + + + I D + ++ D Sbjct: 124 RVAVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVED 179 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 47/156 (30%), Gaps = 6/156 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G F ++ A+ + V+ + G+L DV L+Q A A + + Sbjct: 22 RRERGSFAVMAAVFLVVIAAIFGVL-DVGNTYLQRRDLQQIADMAAAAGVQRVDNLCVQA 80 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + A NS T + + + +T +P + V Sbjct: 81 PTSATNSATVNGLNTSQGDTIAVTCGRWDPTVNPAPSYYLANTNTSGDPNRLQLNAV--- 137 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + + + +++ + A A + Sbjct: 138 --QVNVTRQVRHMFVGPLQTVHATSTARATAIDVFS 171 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 39/143 (27%), Gaps = 12/143 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + A+ + + + D N F+ V S + F Sbjct: 419 IAQAKSAILNALAGLGSQDTFN------VIAFDSSVRSLSPVALSATAANLGKANLFVQS 472 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 G T + A+ A S +K K +V +TDG + + + Sbjct: 473 LEADGGTEMAPALLRALSQPESGVSSISSAVKPERL--KQVVFITDGAVGNEASLFALIA 530 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ 412 N + + + T+ Sbjct: 531 ANIGRQR---LFTVGIGAAPNGY 550 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 79/250 (31%), Gaps = 24/250 (9%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + +++ S G +P + Y Sbjct: 148 LNWTQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQ 207 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ D+ + R FN++ Sbjct: 208 GASSPKDMVILVDVSGSVSGLT-LKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAE 260 Query: 308 SDPSFSWG----VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + I K G+T A++ +++ Sbjct: 261 AVVP-CFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNKT------NVP 313 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I+L TDG +D + I ++ +R+ T + + ++ C+ Sbjct: 314 RANCNKIIMLFTDG--GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWI--ACS 369 Query: 424 SPNSFFEANS 433 + +FE S Sbjct: 370 NKGYYFEIRS 379 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 79/250 (31%), Gaps = 24/250 (9%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ + E + +++ S G +P + Y Sbjct: 134 LNWTQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQ 193 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ D+ + R FN++ Sbjct: 194 GASSPKDMVILVDVSGSVSGLT-LKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAE 246 Query: 308 SDPSFSWG----VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + I K G+T A++ +++ Sbjct: 247 AVVP-CFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNKT------NVP 299 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I+L TDG +D + I ++ +R+ T + + ++ C+ Sbjct: 300 RANCNKIIMLFTDG--GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWI--ACS 355 Query: 424 SPNSFFEANS 433 + +FE S Sbjct: 356 NKGYYFEIRS 365 >gi|298247107|ref|ZP_06970912.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297549766|gb|EFH83632.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 62/183 (33%), Gaps = 22/183 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTV---RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++++ + K + T FN + + P SW V ++ Sbjct: 384 DQLKESSELLFDQTKARQYLLQTHPQDLTSVMVFNSDIAAGPDGSWTVEGNDPQKMRGLY 443 Query: 329 IDEN---EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T + +Q + + + + + N K+ I+++TDG++ + N Sbjct: 444 DNIQAREPDGGTNMYACLQRSVE---------LFKQQPNENRKRLIIVMTDGQSEKGN-G 493 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I S G+ ++++AF + + S + + R+ Sbjct: 494 VDQIIQSVASLGVPVISVAFGSDADVTQLNE------ISTATHGSVTKKDNMVDAMREAT 547 Query: 446 GNE 448 G + Sbjct: 548 GYK 550 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 57/175 (32%), Gaps = 24/175 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH------KLIRTIVKTF 327 V+ L ++RS D G ++ + + G + + Sbjct: 436 VKRWLRDIVRSFNLGVTEQDV---GVVVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKI 492 Query: 328 AIDENEMGSTAIND-AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-E 385 G T A + A + + + K+ +AKK I+LLTDG T N + Sbjct: 493 LKKLAYEGGTTYTGYAFKLANEMLTGN--------KSRPDAKKMIILLTDGATTAANTLQ 544 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + +++ + I+ + + + +FF EL K+ Sbjct: 545 LKEELDVSRAANVMILAVGV-----GKFNQTELIQIAGDRKNFFAVTKFSELEKV 594 >gi|256374530|ref|YP_003098190.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918833|gb|ACU34344.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 550 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 63/173 (36%), Gaps = 17/173 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK-LIRTIVKTFAIDEN 332 ++ ++ +V+ + + R+G F D + + G + + A Sbjct: 70 IKTSINAVVGDVVSASGGD--YRLGLVTFKDSINVVTGLAAGNAGTVTGYVTNVLAASGG 127 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNN---LEAKKYIVLLTDGENTQDNE----- 384 A ++A++TA ++ + + A+K++VL+TD ++ Sbjct: 128 GGEPEASDEALRTAV-SLRPAAGIPQNADFTGPWRSNARKFVVLVTDARPGGFDDAFTAA 186 Query: 385 ---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 A+ N A + G+++ + + + + N A + + +A Sbjct: 187 DQASATAVANSALAAGVKLSAVYVPTSPSMTPTIAPIMQNYATTTSGVYVQAQ 239 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 59/166 (35%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L++ AL ++ ++ DN+ G V Sbjct: 136 PPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 191 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ + E +K + Sbjct: 192 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR--- 237 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQE 413 I+L TDG+ + A+ K + +GI + T+ + + Sbjct: 238 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEA 283 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 25/55 (45%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 L KS G F+ A+++ V++ + VDV ++ + L A + + L Sbjct: 6 LFKSKKGTTFVFFAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQEL 60 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 45/117 (38%), Gaps = 9/117 (7%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + V +G ++ V + + R++ G+TA DA+ A + Sbjct: 423 SSDVSIGLVSYSTDVNINLPIA-KFDLNQRSLFVGAVESLAAGGNTATFDAIIVATKML- 480 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + KN AK + +L+DG I + K+ GI I TI ++ N Sbjct: 481 -----KEEKAKNP-NAKLMLFVLSDGVTNYG-HSLNDIKDMMKTFGIPIYTIGYNAN 530 >gi|282900974|ref|ZP_06308907.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] gi|281194065|gb|EFA69029.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] Length = 487 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 61/188 (32%), Gaps = 27/188 (14%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D+SS K V+ A + ++ + ++ F V Sbjct: 44 KPTTNPQAIVLLIDTSSSMSDGKLTE-VKTAASQFMQRRNL-----EVDQIAVVNFGSEV 97 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + ++ I+ E GST + + + TA + + Sbjct: 98 ATPAPLTNDIN-----ILNNAINQLLENGSTPMGEGIDTAQGQLQA------------TT 140 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS---VNKTQQEKARYFLSNCA 423 K I+L TDG N ++ GI+++ +A N Q L A Sbjct: 141 LNKNIILFTDGIPDDPNFA-YNSALSVRNAGIKLIAVATGGADTNYLTQITGDRSLVFYA 199 Query: 424 SPNSFFEA 431 + F +A Sbjct: 200 NSGQFDQA 207 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 72/207 (34%), Gaps = 20/207 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S ++ +++V+ +K + ++D Sbjct: 144 CPQQDSDIAFLIDGSGSIDPVDFDRMKKFVSTVMSRFQKSKTL-----FALMQYSD--DF 196 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII-SSNEDEVHRMKNNLEA 367 F++ K G T ++ + SS+ H + Sbjct: 197 RTHFTFNDFKRNSDPELLVRPIGQLFGRTHTATGIRKVVRELFHSSSGARNHAL------ 250 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--- 424 K ++++TDGE D E + +A +GI I + +K+R L AS Sbjct: 251 -KIMIVITDGEKYLDPLEYRDVIPEADRKGIIRYVIGVG-DAFNSKKSRKELDTIASKPP 308 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFE 451 + F+ N+ L I ++++ +IF Sbjct: 309 ADHVFQVNNFEALKTI-QNQLQEKIFA 334 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 41/127 (32%), Gaps = 16/127 (12%) Query: 278 LASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT----IVKTFAIDEN 332 + I + + VR+G F R W + T + Sbjct: 33 VRQFIEDFVNSMPISPEDVRVGLITFATR----SKVRWNLSDPKATNPSLAISAARSLSY 88 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T + +Q A + +N +NN+ K ++++TDG + ++ + Sbjct: 89 STGVTYTHYGLQDAKKLLYDTNA----GARNNVP--KLVLVMTDGASNLPSQ-TRSSAAA 141 Query: 393 AKSQGIR 399 + G Sbjct: 142 LRDAGAI 148 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 41/127 (32%), Gaps = 16/127 (12%) Query: 278 LASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT----IVKTFAIDEN 332 + I + + VR+G F R W + T + Sbjct: 69 VRQFIEDFVNSMPISPEDVRVGLITFATR----SKVRWNLSDPKATNPSLAISAARSLSY 124 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T + +Q A + +N +NN+ K ++++TDG + ++ + Sbjct: 125 STGVTYTHYGLQDAKKLLYDTNA----GARNNVP--KLVLVMTDGASNLPSQ-TRSSAAA 177 Query: 393 AKSQGIR 399 + G Sbjct: 178 LRDAGAI 184 >gi|218192066|gb|EEC74493.1| hypothetical protein OsI_09963 [Oryza sativa Indica Group] Length = 641 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N + + S K R + KSL + PLD + Sbjct: 202 NAVTKSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSLDSVSSRAPLDLVTVLDVSGSM 261 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + K L++ A++ VI+++ D R+ F+ + Sbjct: 262 SGI-----KLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLFPLRRMTLTGRQ 310 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ G T I DA++ + + R KN + + I+LL+DG++T Sbjct: 311 QALQAI-SSLVASGGTNIADALKK------GAKVVKDRRRKNPVSS---IILLSDGQDTH 360 Query: 382 D 382 Sbjct: 361 S 361 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 50/130 (38%), Gaps = 17/130 (13%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ H+ +V+ + G+T ++ A D ++ N+ K Sbjct: 68 TEFTFTTHRNGTELVQAIRNLGYKGGNTRTGTGLRYAADNFFGPT-----IIRPNVP--K 120 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NS 427 +L+TDG++ D + + KSQGI++ + +R ++P + Sbjct: 121 VAILITDGKSQDDVDPPTQ---RLKSQGIKMFAVGIK-----NADSRELTRVASTPTEDF 172 Query: 428 FFEANSTHEL 437 FF N L Sbjct: 173 FFYVNDFRIL 182 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 52/167 (31%), Gaps = 33/167 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + L + + + TA+ DA+ + + Sbjct: 132 RIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNSVED--GTALGDAILAGVNRLRQRPA 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 + + ++L+TDG N E AK+ GIR+ I + Sbjct: 190 E-----------GRALILITDGRNNAG-AEPQLAAQAAKALGIRVHAIGIGSRGSAVIPV 237 Query: 411 ------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + L A + +FEA L+++F + Sbjct: 238 PSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEAGDATVLSRVFAE 284 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 58/162 (35%), Gaps = 23/162 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F++ V F+ + ++ R I + +TA+ D + A ++ Sbjct: 159 RLSVYAFSETVEEIVPFTSDLRRIDRGISE-----IIAGSATAMYDTIFLASKALMK--- 210 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---AFSVNKTQQ 412 + +K +VL+TDG +T + A + +I + + + Sbjct: 211 ---------HDGRKVMVLITDGGDTFSSTSYEQAARAATQSETLLYSIIVVPVANSAGRD 261 Query: 413 EKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFER 452 + L + + A L+ F+ +I +E+ + Sbjct: 262 TGGEHALIQISQDTGGKHYYATDMGSLDVAFK-QISDELRTQ 302 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 72/444 (16%), Positives = 133/444 (29%), Gaps = 90/444 (20%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 + ++ Q+A+ QS +V++ A+N+ F E + F N + R Sbjct: 63 KEKAQEEYQSAVDAG-----QSAAQVTANARNANQFTVSVNIEPEQKIFFNLTYEELLSR 117 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVS 168 + V V ++ L +N +++ LR + I E VS Sbjct: 118 RKGIYEQAIHVTPGSVVPKMSVRVNIFETLPINKITVPQLRGNDNEIAQIIRTNETTVVS 177 Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 Y I+ D + Q D + + + G++ + D Sbjct: 178 VVYEPTEAEQIKMSKDGLQGQFVVQYDVDRSSIE--------------KKGGEIHVVDGY 223 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + V P L FV +S + + A+ S++ ++K Sbjct: 224 FVHFFV-----------PADLPTLPKHVIFVLDTSGSMAGTRIEQTKQAMNSILDQLRKD 272 Query: 289 DNVNDTVR--MGATFFN--------DRVISDPS--------------------------- 311 +++ V G T ++ D + Sbjct: 273 EDIFSVVEFSSGVTEWDLRKPYKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDI 332 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 ++ V + K F + ST INDA+ A +++ + I Sbjct: 333 LAYPVTEQSVKRAKEFVAAMDVTSSTNINDALLLALK--------NSQSVQSRVRLTPII 384 Query: 372 VLLTDGENTQDNEEGIAICNKAKS----QGIRIMTIAFSVNKTQQEKARYFLSNCASPN- 426 + LTDGE T + I + + I +AF FL+ +S N Sbjct: 385 IFLTDGEPTASVTDTTEILKNVRKGNSDDVVSIFCLAF-----GTGTDYQFLTKISSQNR 439 Query: 427 ----SFFEANSTHELNKIFRDRIG 446 +EA K F D + Sbjct: 440 GFARKIYEAADATLQLKGFFDEVA 463 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 58/158 (36%), Gaps = 29/158 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 K V++A ++ + D R+ F+ R VI + ++ + I + Sbjct: 65 KPIKTVKEAAMRLVDGLGASD------RLSVVAFDHRAKVIVPNQPVDDIERVKQAIER- 117 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEE 385 G T+I++ M+ + +D V + I LLTDGEN DNE Sbjct: 118 ----LKPEGGTSIDEGMKLGIKEVALGKDDRVSQ----------IFLLTDGENEHGDNER 163 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + + A I + T+ F + L + A Sbjct: 164 CLKLAQVAAEYNITVNTLGF-----GNHWNQDVLESIA 196 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L++ AL ++ ++ DN+ G V Sbjct: 233 PPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 288 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 289 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 334 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 335 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 392 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 393 YSYIDSLSEAQKVLKD 408 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 37/116 (31%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G F+ + + ++ + G T I A+ A Sbjct: 133 RFGLITFSRIANIECPLTSEPDMVLLYL-SDLNSSLLPGGGTNIAAALDHA--------- 182 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + K N + +VLL+DGE D + + + I + I+ K + Sbjct: 183 --QKQFKENERDSRVVVLLSDGE--TDGNKWRESLEALQKKKIPVNVISLGDPKRE 234 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 287 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVY 340 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ I + KN + + I+LL+DG++T Sbjct: 341 -SLTSNGGTNIAEGLRKGSKVIED------RQAKNPVCS---IILLSDGQDTYTVSPT 388 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 296 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVY 349 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ I + KN + + I+LL+DG++T Sbjct: 350 -SLTSNGGTNIAEGLRKGSKVIED------RQAKNPVCS---IILLSDGQDTYTVSPT 397 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 58/189 (30%), Gaps = 23/189 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + ++ D V + Sbjct: 172 PPANLVFLVDVSGSMSAADKLPLVKQTLRILTEQLRAQDKVT------IITYASGEKLVL 225 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + G+TA A+Q AY + +KN + Sbjct: 226 EP---TSGEQKEKILAVINGLRAGGATAGEQAIQLAYKQAEKA------FVKNGINR--- 273 Query: 371 IVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + + + + GI + T+ F ++ A + Sbjct: 274 ILLATDGDFNVGITDFSTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLAD--AGDGN 331 Query: 428 FFEANSTHE 436 + ++ +E Sbjct: 332 YSYIDNKNE 340 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 69/219 (31%), Gaps = 23/219 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L FV S +K ++AL ++ +KK D +N Sbjct: 265 NGYFVHFFAPQNLPVVPKNVVFVIDVSGSMSGRKLVQTKEALLKILSDVKKDDFLN---- 320 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVK-----TFAIDENEMGSTAINDAMQTAYDTII 351 F+ V +W + + T F G T IND + + + Sbjct: 321 --FILFSSDVR-----TWKENLVPATPENLKAAEEFVHQIQATGGTNINDGLLRGIEMVN 373 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 + E ++ I++LTDGE I ++ + + F + Sbjct: 374 KAREMGTVLDRSTS----IIIMLTDGEANVGESRVEKIQENVRNAIGGKYPLYNLGFGYD 429 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R L N +E + + + F D + N Sbjct: 430 VNYNFLERMALENNGVARRIYEDSDANLQMQGFYDEVAN 468 >gi|330447624|ref|ZP_08311272.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491815|dbj|GAA05769.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 2023 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 81/283 (28%), Gaps = 18/283 (6%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV-----KSYSSQNGK 221 + Y + V++ V+D S SM + + E L+ T+ Sbjct: 1473 IGDVYIESKNVNVNLVLDISGSMAMREVNPENIHLDNGHTSGKLTLVDRDGVKEHFDFNS 1532 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + L S Y P + + D V+ + ++++ + Sbjct: 1533 IDELSGILGLKFGSPLGFSVYSPELNITFPDGTTQTINDVYKDIGVVSRIDSAKESINHI 1592 Query: 282 IRSIKKIDNVNDT--VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + + + N ++ F +V F W K I + +G + Sbjct: 1593 VENYGETLNHEQLSNLKFSLITFASQVDGVKEFHWDFGKNTLVTSDGQTIADYLVGVEPL 1652 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--------QDNEEGIAICN 391 + T ++ N + + +TDG +T + Sbjct: 1653 KGPLS---QTDYDTSLVAALNGFTNTDTSNIVYFVTDGMDTIGTNGQQFNKDWVVSQTGG 1709 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 I+ +A +A L+ AS ++ +++ Sbjct: 1710 NIDKYKPTIVPVAIGGTINNTPEAHDILNQIASLGQGYKPDNS 1752 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 40/116 (34%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + T I A+Q + ++ Sbjct: 131 KIGLIVFAGDAFVQLPITSDYVSAKMFLDNINP-SLIGTQGTDIGKALQLSMNSFT---- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N + K I+L+TDGE+ + E + +A+S+GIR+ + + Sbjct: 186 -------PNSKVGKAIILITDGEDNEGGAE--EMAKQAQSKGIRVFILGVGSTEGA 232 >gi|94971019|ref|YP_593067.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94553069|gb|ABF42993.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 391 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 40/145 (27%), Gaps = 4/145 (2%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + A + + + F+ V + KL R I + Sbjct: 119 AEKTASEKFLDDM-LAQPKDQAF---LIHFDREVELMTDLTSSKDKLHRGIGELETSGPP 174 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 S+ + + + + + +K IV+LTDGE+ E Sbjct: 175 SQSSSDDGQRHRRGGTQLYDAIYLAASEILQKQQGRKAIVVLTDGEDRGSKETLTDAVEA 234 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARY 417 A+ + I F + Q Sbjct: 235 AQRADAIVYAIYFKGEQEQSRWGNG 259 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 41/97 (42%), Gaps = 7/97 (7%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 H+ AKK ++++TDG+ +D E + +A+ GI I + + Sbjct: 6 HKNGARKSAKKILIVITDGQKYKDPLEYRDVIPQAEKAGIIRYAIGVGRAFQEPTARQEL 65 Query: 419 LSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 + ++P + F+ ++ L I ++ E++ Sbjct: 66 NTIGSAPPQDHVFKVDNFAAL-----SSIQKQLQEKI 97 >gi|326444122|ref|ZP_08218856.1| von Willebrand factor, type A [Streptomyces clavuligerus ATCC 27064] Length = 519 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 58/184 (31%), Gaps = 29/184 (15%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----------K 325 AL + + D+ R G R + +G L R V + Sbjct: 348 ALRTTFDGLSGADDS----RTGKFARFHRGETLTVLRFGGKVLERRTVTYRGERDLERLR 403 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISS-NEDEVHRMKNNLEAKKYIVLLTDGENTQD-N 383 + E +TAI + AY T ED + IVL+TDG N + Sbjct: 404 SLVASEGFDTTTAIWSGLDAAYRTAAGMLREDPARPLS--------IVLMTDGRNNAGMS 455 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFR 442 + +A++ + ++ + + A A+ +A L F+ Sbjct: 456 LDAFLRAQRARTGPVA--SVRTYAVRYGEADADELGRAARATGGRLLDATE-RPLLDAFK 512 Query: 443 DRIG 446 + G Sbjct: 513 EMRG 516 >gi|294815777|ref|ZP_06774420.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] gi|294328376|gb|EFG10019.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] Length = 568 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 58/184 (31%), Gaps = 29/184 (15%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----------K 325 AL + + D+ R G R + +G L R V + Sbjct: 397 ALRTTFDGLSGADDS----RTGKFARFHRGETLTVLRFGGKVLERRTVTYRGERDLERLR 452 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISS-NEDEVHRMKNNLEAKKYIVLLTDGENTQD-N 383 + E +TAI + AY T ED + IVL+TDG N + Sbjct: 453 SLVASEGFDTTTAIWSGLDAAYRTAAGMLREDPARPLS--------IVLMTDGRNNAGMS 504 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFR 442 + +A++ + ++ + + A A+ +A L F+ Sbjct: 505 LDAFLRAQRARTGPVA--SVRTYAVRYGEADADELGRAARATGGRLLDATE-RPLLDAFK 561 Query: 443 DRIG 446 + G Sbjct: 562 EMRG 565 >gi|332872319|ref|XP_003319171.1| PREDICTED: collagen alpha-2(VI) chain isoform 2 [Pan troglodytes] Length = 828 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|332872317|ref|XP_003319170.1| PREDICTED: collagen alpha-2(VI) chain isoform 1 [Pan troglodytes] Length = 918 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|332872315|ref|XP_531504.3| PREDICTED: collagen alpha-2(VI) chain isoform 3 [Pan troglodytes] Length = 1019 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|297287371|ref|XP_001099130.2| PREDICTED: collagen alpha-2(VI) chain-like isoform 3 [Macaca mulatta] Length = 1029 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 599 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 658 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 659 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 718 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 719 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 766 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 767 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 806 >gi|296232327|ref|XP_002807820.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Callithrix jacchus] Length = 1018 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|115527066|ref|NP_478054.2| collagen alpha-2(VI) chain isoform 2C2a precursor [Homo sapiens] Length = 918 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|115527070|ref|NP_478055.2| collagen alpha-2(VI) chain isoform 2C2a' precursor [Homo sapiens] Length = 828 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|115527062|ref|NP_001840.3| collagen alpha-2(VI) chain isoform 2C2 precursor [Homo sapiens] gi|125987812|sp|P12110|CO6A2_HUMAN RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor Length = 1019 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|41350923|gb|AAH65509.1| Collagen, type VI, alpha 2 [Homo sapiens] gi|190690005|gb|ACE86777.1| collagen, type VI, alpha 2 protein [synthetic construct] gi|190691377|gb|ACE87463.1| collagen, type VI, alpha 2 protein [synthetic construct] Length = 1019 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 45/116 (38%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 376 PTYPRGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + IA + ++A +S + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVHFIALGRDA---DEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 474 >gi|312139258|ref|YP_004006594.1| type ii secretion system integral membrane subunit [Rhodococcus equi 103S] gi|311888597|emb|CBH47909.1| putative type II secretion system integral membrane subunit [Rhodococcus equi 103S] Length = 622 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 60/199 (30%), Gaps = 27/199 (13%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR-------MGATFFNDR 305 E S + ++ + + +K+ +D VR +G + Sbjct: 72 REIEVRQQPGSEQDIVLAIDVSGGMSGPALDDVKRA--ASDFVRQAPAGAHIGIVAISST 129 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + L+R I G++AI D++ TA + + Sbjct: 130 PQVLSELTTDSEDLLRRID-----GLKAGGNSAIADSVVTAAEMLER-----------GE 173 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 A ++LLTDG +T + + + + S +T + S Sbjct: 174 AANNILLLLTDGADTSSAHSMSELPSVLSRSRASLYAVQMSTPETNSALLQQVARE--SR 231 Query: 426 NSFFEANSTHELNKIFRDR 444 + A T L I++ Sbjct: 232 GQYASAGDTAALGAIYQSA 250 >gi|196230799|ref|ZP_03129660.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] gi|196225140|gb|EDY19649.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] Length = 1545 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 79/303 (26%), Gaps = 28/303 (9%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 I +QT A A + + + M D + +N F Sbjct: 1101 IPQPEVQTSANAFSTFSLNVSDVSFKLAAASLEQGHMPDPASVRSEEFINAFDYRDPEPS 1160 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 R + P P L+ +D S + + Sbjct: 1161 PGAPLAFVTERARYPFAQNRDLLRFAVKTAAAGRQPGRP-LNIVLLLDRSGSMERADRVN 1219 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG--VHKLIRTIVKTFAID 330 +VR+AL+ + + ++ D ++ + W V V + Sbjct: 1220 IVREALSVLAKHLQPQDKLS-----------IVTFARTPHLWADAVAGDKVHDVIARVNE 1268 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI- 389 G T + A+ AY+ H + I+ TDG + A+ Sbjct: 1269 ITPEGGTNLEAALDLAYE--------TAHHHFAVDSTNRVILF-TDGAANLGDVNPDALT 1319 Query: 390 --CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + QGI + + N + + N+ + F +I Sbjct: 1320 KKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRN--ADGRYGFINTPEDAAANFATQIAG 1377 Query: 448 EIF 450 + Sbjct: 1378 ALQ 1380 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KLI+S G + A+L+ +M G G+ VD L+ AA A + + L Sbjct: 6 KLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTRP 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + + + + + ++ + N +N Sbjct: 66 TDPTQLPTLQWEVARFQAQQMITENSSDNT 95 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 65/174 (37%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K + Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVAL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TI S + + +S +S Sbjct: 158 LMTDGIDHPKNPDVQSISEDARISGISFITIGLSTVVNEAK--LRLISGDSSSE 209 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/141 (19%), Positives = 51/141 (36%), Gaps = 22/141 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--RVISDPSFSWGVHKLIRTIVKTF 327 K V++A+ VI ++ D + F+D +VI + + I + Sbjct: 60 KLRNVKEAVKMVIDRLEPSD------YISVVIFDDSAQVIIPSMPANDPVGMKAAIDR-- 111 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T ++ M I S E + N + ++LLTDG D + Sbjct: 112 ---IQDAGGTTMSLGM------IQSLGELRRWNIPNAVSR---MILLTDGVTYGDTDRCR 159 Query: 388 AICNKAKSQGIRIMTIAFSVN 408 + A + GI I + + Sbjct: 160 QLARDAAAAGISIYPLGIGAD 180 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 229 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 288 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 289 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 341 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 342 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 397 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 398 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 448 Query: 424 SPNSFFE 430 + +FE Sbjct: 449 NKGYYFE 455 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 267 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 326 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 327 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 379 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 380 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 435 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 436 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 486 Query: 424 SPNSFFE 430 + +FE Sbjct: 487 NKGYYFE 493 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 156 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 215 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 268 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 269 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 324 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 325 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 375 Query: 424 SPNSFFE 430 + +FE Sbjct: 376 NKGYYFE 382 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 NKGYYFE 454 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 176 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 235 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 236 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 288 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 289 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 344 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 345 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 395 Query: 424 SPNSFFE 430 + +FE Sbjct: 396 NKGYYFE 402 >gi|325673437|ref|ZP_08153128.1| type II secretion system protein [Rhodococcus equi ATCC 33707] gi|325555458|gb|EGD25129.1| type II secretion system protein [Rhodococcus equi ATCC 33707] Length = 623 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 60/199 (30%), Gaps = 27/199 (13%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR-------MGATFFNDR 305 E S + ++ + + +K+ +D VR +G + Sbjct: 73 REIEVRQQPGSEQDIVLAIDVSGGMSGPALDDVKRA--ASDFVRQAPTGAHIGIVAISST 130 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + L+R I G++AI D++ TA + + Sbjct: 131 PQVLSELTTDSEDLLRRID-----GLKAGGNSAIADSVVTAAEMLER-----------GE 174 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 A ++LLTDG +T + + + + S +T + S Sbjct: 175 AANNILLLLTDGADTSSAHSMSELPSVLSRSRASLYAVQMSTPETNSALLQQVARE--SR 232 Query: 426 NSFFEANSTHELNKIFRDR 444 + A T L I++ Sbjct: 233 GQYASAGDTAALGAIYQSA 251 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ S + ++ Sbjct: 279 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQA-V 331 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ + E R KN + + I+LL+DG++T Sbjct: 332 NSLVANGGTNIAEGLRK------GAKVMEERREKNPVAS---IILLSDGQDTYTVSGS 380 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 296 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVY 349 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ I + KN + + I+LL+DG++T Sbjct: 350 -SLTSNGGTNIAEGLRKGSKVIED------RQAKNPVCS---IILLSDGQDTYTVSPT 397 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 55/161 (34%), Gaps = 29/161 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 + V+ A ++ +K+ D R+ F+ + VI + I K Sbjct: 57 RPLETVKQAAVQLVEKLKEGD------RLSVVAFDHQAQVIVPNQMINDSASIKGKINKL 110 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G TAI+ ++ + + ++ ++ +I LTDGEN + + Sbjct: 111 R-----ASGGTAIDKGLKLGIEELNKGRKES--------ISQAFI--LTDGENEHGDNDL 155 Query: 387 -IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + + A I + ++ F + + L A Sbjct: 156 CLKLAKLATDYNITLNSLGF-----GDDWNQDVLEKIADAG 191 >gi|323650180|gb|ADX97176.1| inter-alpha (globulin) inhibitor h4 isoform 1 [Perca flavescens] Length = 354 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 75/236 (31%), Gaps = 18/236 (7%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + + + P L FV S +K R AL ++ Sbjct: 50 VNRDTSLGDIKTSTGYFVHHFAPSSLSRIPKNVVFVIDQSGSMSGRKMQQTRIALIHILN 109 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + D+ G F+ + +K KTFA + + G+T IN A+ Sbjct: 110 DLAEDDHF------GLITFDSSIFHWKRELVQANKKNLESAKTFARNIQDRGTTDINAAV 163 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRI 400 + + H + + I+LLTDG+ T I + + + Sbjct: 164 ------LEGARMLNAHPREGSAS---IIILLTDGDPTSGETNLERIQSNVRRDIADKFPL 214 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F + + + L N +E + K F D + + V I Sbjct: 215 YCLGFGHDVNFEFLEKMSLQNNGVARRIYEDSDADLQLKGFYDEVATPLLTDVTMI 270 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 255 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 306 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 307 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQVKG 360 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 78/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 587 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 646 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 647 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALK 706 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + + R+ + V++TDG + D+ A+CN + + Sbjct: 707 FAYNQLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCN----HDVTVTA 754 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 755 IGIGDMFHERHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 794 >gi|324508797|gb|ADY43711.1| Vitrin [Ascaris suum] Length = 658 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 84/254 (33%), Gaps = 25/254 (9%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + K+ S+ ++ D +++ E Q L+ + S + N +G+ + Sbjct: 365 KGKEKDFATSVD--LNRCGPEGDIKKNDEIQQLSVRDVLKELNESSSHTLNEDIGVERKP 422 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHF---VDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + P + + + +DS+ ++ + + Sbjct: 423 DAGIDNDPDINNLIAETSTPEEITTNFTRIPLPIDSNCQVDLMFIIDRSESVENEFQKQL 482 Query: 286 K---------KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + VR+ A F + FS+ K +++ E+ GS Sbjct: 483 QFAVDLVKRMSTSDFASRVRVAAISFYSKA--KLEFSFDEFKEQSKVLEALLQIEHIGGS 540 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T+ + A D I + M +VL++DG + ++ + ++ ++ Sbjct: 541 TSAVSGVNLAVDEIKKAGRSNARHM---------VVLISDGNSQDTWDKVLEAADRLRAI 591 Query: 397 GIRIMTIAFSVNKT 410 + + S + Sbjct: 592 DADVYAVTVSHDYY 605 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 233 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 288 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 289 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 334 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 335 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 392 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 393 YSYIDSLSEAQKVLKD 408 >gi|62001310|gb|AAX58357.1| AvrE [Pseudomonas viridiflava] Length = 1719 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 35/364 (9%), Positives = 100/364 (27%), Gaps = 15/364 (4%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + ++ A + + I L A Q + Sbjct: 1355 SNNRVRFANTAGVTAYARAQIN-LGHTDQAAAAQPATTTQPSAAAPTPAAVAQPGAQASA 1413 Query: 109 REVRDIVRDTAVEMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + +++ +Q V +++ L+ + A+A Sbjct: 1414 FGPNATITASIDSKTTKRTKFQTKEGVAMTTAELTKLSKQLDGAFKDKPAQKELSRLADA 1473 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + K IQ +D + + + + + T K ++ + + Sbjct: 1474 K-QPEYADKTPKEKIQAHLDGLNKLFEDRSPNNSAQKAALLSLSRATTKHDAAVDKHSVL 1532 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + ++ + + S ++ ++ +I ++ ++ + + S Sbjct: 1533 DNARHESNYTNLSRLDEQSVASKIMSMLGSMHSPSNAENIAKLIAEEPNLKSLIGQMKAS 1592 Query: 285 --------IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ D + + G + L + F + G Sbjct: 1593 PGTMARVRLEPKDEMMQKIDKGTRDGSITQKDIIGMLNDRDNLRIKAITVFKLAGQSDGF 1652 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + + ++ N+ N + + DGE ++ +E IA K Sbjct: 1653 TTPTPLLSASSSAAVNVNKTLAKINFNYGEDQNRPSSFSVDGEMSRPTKEVIATAQALKK 1712 Query: 396 QGIR 399 G+ Sbjct: 1713 DGME 1716 >gi|7495466|pir||T32949 hypothetical protein C05G6.3 - Caenorhabditis elegans Length = 341 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 56/146 (38%), Gaps = 14/146 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDEN 332 +D ++ +K + ++ A F + + + K +++ ++ Sbjct: 133 KDLSKRLVSQLKIGPH---YTQVAAVTFAT--VGRTRVRFNLKKYQTQEEVLRGIDNLKS 187 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G+TAI ++ A + DE + + K +V TDG + + + Sbjct: 188 RGGTTAIGAGIEKALTQL-----DESEGARPGIATKVMVVF-TDGWSNKGP-DPEKRARD 240 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYF 418 A S G + T+A++ +T+Q F Sbjct: 241 AVSSGFEMYTVAYTRVETRQTYCTNF 266 >gi|167041757|gb|ABZ06500.1| putative NHL repeat protein [uncultured marine microorganism HF4000_010L19] Length = 1148 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 52/373 (13%), Positives = 111/373 (29%), Gaps = 51/373 (13%) Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI-KSWLIQTKAEA 164 + I +N + YQ LS + + +N S L K+ +++ Sbjct: 159 YVGYNNSYISLYERSSLNHVSTPYQSSLSYSWGMSVNSASNKLVIADYVKNRVVELNVSG 218 Query: 165 ETVSRSYHKEHGVS---------IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 ++S S S D S ++ Y D + F +Y Sbjct: 219 SSLSYSQATSASYSSSNGYFRRPTDAAYDSSGNI--YALDLYNNRIQKFNSSL-----TY 271 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL--RHVIKKKHL 273 ++ G + PY + + S + + ++ ++ + Sbjct: 272 QAKTGSWSVSAGFRYPYGMHIDSSDNIYVTDFYNYAVRKYDTSLNETATYGGGAGTRLDA 331 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-------- 325 + + ++ N + GA + WG IR + Sbjct: 332 AKKVIKKIVS--------NTDLTSGA--------NFGLMEWGTRHNIRVKISDTGAKTIY 375 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T G T + AM A + S N + Y+++++DG + Sbjct: 376 TNVDGVYASGGTDLAKAMSIARNYFTSGQV----ANWNLSCSVNYLIVISDGYWSGHTT- 430 Query: 386 GIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++I + K+ I+ + F++ + + + A++ EL D Sbjct: 431 VLSIAEQIKNVYNIKTFAVGFALGGANSNYSTLATKGGTTSPLY--ASNQSELLAKLTDA 488 Query: 445 IGNEIFERVIRIT 457 I I ++ T Sbjct: 489 IKQAISGKLTFTT 501 >gi|290474520|ref|YP_003467400.1| putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] gi|289173833|emb|CBJ80615.1| Putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] Length = 223 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 22/194 (11%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-NDRVISDPS 311 L D+S K + L ++ S K + +++ F N++ + Sbjct: 15 LPVIVLADTSGSMASDGKIEALNQGLKDMLLSFKDESRLRAEIQVSVITFGNNKAEVNLP 74 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + L++ I + GST + A+ A I + + + K + Sbjct: 75 LA--PAHLLQDF-----IPLSAEGSTPLGGALSLASQMIEDKS------VIPSRAYKPVV 121 Query: 372 VLLTDGENTQDNEEG-IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--F 428 VL++DG D E + + +S ++ + + LS A+ Sbjct: 122 VLVSDGYPNDDWEGPFSSFASGERSSKATRFAMSIGGDADE-----DMLSEFANDPEAPL 176 Query: 429 FEANSTHELNKIFR 442 F A + ++ + FR Sbjct: 177 FRAENASDIRRFFR 190 >gi|198426251|ref|XP_002120495.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 1059 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 77/253 (30%), Gaps = 18/253 (7%) Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE-HFV 259 + + T + S + + + + + S P + F+ Sbjct: 556 TDSNTYDSTCTFQCDQSNGYSLYPSGHSRTTCLSNSSWSWASPCCSRPCLNHSVIDMIFI 615 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 SS ++++ + VI + + R+ +N R H Sbjct: 616 LDSSSSVGSDNWDVMKNFVRDVINLLSITETG---TRVSIFRYNRRPDRQSQILLNDHIG 672 Query: 320 IRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT--D 376 + + + G T I +A+ A D I+ +++N V+LT D Sbjct: 673 DKQGLLSALDQMPYNGRGTWIGNALNHAKDFIL--------QLRNGDRPDVIDVVLTITD 724 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G ++D+ ++ + + QG + + + L S + S Sbjct: 725 GR-SKDDVSVVS--EELRKQGALTYAVGVIPPNGKGPREEELLKIAGSTDRVKITTSFEG 781 Query: 437 LNKIFRDRIGNEI 449 F + +++ Sbjct: 782 TKDHFEKLMSDDL 794 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 43/100 (43%), Gaps = 11/100 (11%) Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A++ A + + E H + K+ ++++TDGE + D+++ + +++GI I Sbjct: 3 ALKHA-NALF----TEEHGSRIKQNVKQMLIVITDGE-SHDHDQLNDTALELRNKGITIF 56 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + + N+ ++ +L +F Sbjct: 57 AVGV-----GKANQKELEGMAGNKNNTIYVDNFDKLKDVF 91 >gi|240140254|ref|YP_002964732.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] gi|240010229|gb|ACS41455.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] Length = 441 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 51/146 (34%), Gaps = 24/146 (16%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I AL ++G+ G +D R++ L+ A ++ +L+ V S +++ Sbjct: 1 MIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAG----GNALKLVVSSSESIV 56 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 Q I+ D V V + + + A QV+ Sbjct: 57 GLTTQTIQAEAKAGA---------DAPVSIQVTVASDKTSVEARAEQVI----------- 96 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSR 169 F +G+ S I +A A V R Sbjct: 97 KLTFGAFVGMASIPISARARASVVGR 122 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 255 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 306 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 307 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQVKG 360 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 27/115 (23%), Positives = 45/115 (39%), Gaps = 14/115 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ +IR + D R+ FND V S V R+I Sbjct: 101 SRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 154 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G TA+ A++ A + DE N +I+LLTDG++T Sbjct: 155 IDRLQARGGTALMPALEEAVKIL-----DERQGGSRNHVG--FILLLTDGDDTTG 202 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 45/357 (12%), Positives = 96/357 (26%), Gaps = 34/357 (9%) Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL--------NPLSLFLRSMGIKSWLIQT 160 E R I D+A ++ S +V + Y L NP R K+ + Sbjct: 41 EETRAIETDSADQVFLAASKSRVEVQESYVLPSSTPIIPMPNPPVSENRENYPKTPISPI 100 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDF---SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + A ++ + + + +N F + + Sbjct: 101 RQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEEFINYFDYALPAPDTTNTP 160 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +P SL P L+ +D S + K L++ + Sbjct: 161 IQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMNSPDKLPLMQRS 220 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ ++ D V V G + V+ + + G T Sbjct: 221 FNLLVSQLRPQDRVAIAVYAGQSG---VVLEP------TSGDQKAQINQAINQLRAGGGT 271 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAK 394 + + AY D + + TDG+ A+ + + Sbjct: 272 HGSAGIHLAY--------DLAQANYLPDGINRIFI-GTDGDFNVGTTSLTELKALIERKR 322 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 G+ + + F ++ + + +S E K+F ++ + Sbjct: 323 EAGVFLSVLGFGTGNYNDALMEELSNH--GNGTAYYLDSYQEARKLFATQLAATLQT 377 >gi|307944861|ref|ZP_07660199.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307772075|gb|EFO31298.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 862 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 67/231 (29%), Gaps = 35/231 (15%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI---KKKHLVRDALASV 281 + S +++ C S + F S S+ I K + RDA Sbjct: 11 KTAFNSVFVLFCALSPIGSDLKAEESQRNTMIIFDASGSMWGQIGGKSKIEIARDA---- 66 Query: 282 IRSIKKIDNVNDTVRMGATFFNDR-------VISDPSFSWGVHKLIRTIVKTFAIDENEM 334 K T ++G + R + + G + T Sbjct: 67 FAEAKTA-WDAGTGQVGLIAYGHRRKGDCRDIETLVPMGSGSGAD----ISTRINSIRPK 121 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ +++ A + E +VL +DG T N + + + + Sbjct: 122 GKTPLSQSVRLAAQELQYREEAAT------------VVLFSDGIET-CNADPCLLAEELE 168 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 GI T + R L A + ++F+A+ L Sbjct: 169 RDGID-FTAHVIGFGIGSDADRKKLQCIAENTGGTYFDADDAGSLKDALGQ 218 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 81/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + + + + S G +P + Y Sbjct: 155 LNWTDALEDVFIQNRLEDPTLLWQVFGSATGVTRYYPATPWRAPSKIDLYDVRRRPWYIQ 214 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + F+++ Sbjct: 215 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVMEMLDTLS----DDDYVTVA--SFHEKAD 267 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F V +R ++K + G+T + A+ + +++ + Sbjct: 268 PVSCFRQLVQANVRNKKVIKEAVQEMVARGTTDYKAGFEYAFSQLQNTSITRANCN---- 323 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E NK +R+ T + + ++ CA Sbjct: 324 ---KMIMMFTDGGEDRVQDVFEKYNWPNKT----VRVFTFSVGQHNYDVTPLQWM--ACA 374 Query: 424 SPNSFFE 430 + +FE Sbjct: 375 NKGYYFE 381 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 285 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ + E +K + Sbjct: 286 TAGNNTTAIKAAIDN-----LDTYGSTGGEAGLRLAYE------QAEKGFIKGGVNR--- 331 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 332 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 389 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 390 YSYIDSLSEAQKVLKD 405 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L++ AL ++ ++ DN+ G V Sbjct: 233 PPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 288 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 289 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 334 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQE 413 I+L TDG+ + A+ K + +GI + T+ + + Sbjct: 335 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEA 380 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 71/214 (33%), Gaps = 21/214 (9%) Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 SQ R + + C ++ + + +DS S + ++ Sbjct: 1859 SQFSHPIERLRRCTLCYDKCFPNVCAEEQIDQVHSYVDAIFLLDS-SRHVRGDEFEQMKA 1917 Query: 277 ALASVIRSIK-KIDNVNDTV--RMGATFFN--------DRVISDPSFSWGVHKLIRTIVK 325 L+SV+ + D V R+ ++ F + + + + Sbjct: 1918 FLSSVLDNFDISADPEQSDVGDRLALLSYSPRDFSIRRTSTPVKKEFDFTTYSSKALMRR 1977 Query: 326 TFAIDENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++ A + A+Q + + S + H KN K I ++T GEN ++ + Sbjct: 1978 HIHNSLKQLHGEAYVGHALQWTTNMVFS---NTFHSRKN-----KVIFVITAGENNENKD 2029 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 I +AK QG I I+ + + E Sbjct: 2030 VLRKISLRAKCQGYVIFVISLGSTREEIEDVASL 2063 >gi|116625273|ref|YP_827429.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228435|gb|ABJ87144.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 64/167 (38%), Gaps = 22/167 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +S S+R K AL ++ + ++ FND D Sbjct: 73 DNSGSMRDKRGKVEAAALAL------VRDSNPDDEVF---IVNFNDEAFLDNPHGKDFTT 123 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 I+ + + + G TA+ DA++ + D + +++ H+ K K +V++TDG Sbjct: 124 NIKEM-EEALTRIDSRGGTAMRDALRMSIDHV----KEKAHKDK------KVLVVVTDGN 172 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE--KARYFLSNCA 423 + + ++ + I + ++ ++E +A+ L A Sbjct: 173 DNSSVVSLENLVKASQQSEVLIYGVGLLGDEERREAQRAQRALKALA 219 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 80/229 (34%), Gaps = 28/229 (12%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + V N + D ++ +D+S K+ V A +++++ Sbjct: 166 TSFTVYENGEPQQITSFRHEDIPVALGVVIDNSGSMRD--KRPAVNAATINLVKASN--- 220 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++ FND D ++ V KL + K G TA+ DA+ + Sbjct: 221 PEDEVF---VVNFNDDYYLDQDYTDSVAKLKEALEKY-----ETRGGTALYDAVLAS--- 269 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVN 408 H MK KK + ++TDGE+ K + G I TI Sbjct: 270 -------NAHLMKAPKLEKKVLFIVTDGEDDASLNTLEQTIRKVQQENGPTIYTIGILDE 322 Query: 409 KTQ-QEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +A+ L A F S E+++I +I ++I + Sbjct: 323 TGGHKRRAQRALREMAESTGGVAFFPQSLDEVSRI-TQQIAHDIRNQYT 370 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 235 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 290 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 291 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 336 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 337 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 394 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 395 YSYIDSLSEAQKVLKD 410 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 42/116 (36%), Gaps = 17/116 (14%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G +++ V D + R D G TA D + A I Sbjct: 414 IGLVSYSNDVNIDLPIA-KFDLNQRASFAGAVNDLQAGGGTATFDGIAVAMKMIQ----- 467 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQ----DNEEGIAICNKAKSQGIRIMTIAFSVN 408 E N+ I +L+DGE + ++ +GI + GI I TI ++ N Sbjct: 468 EQRAADPNIRP--VIFVLSDGETNKGHPLNDIKGI-----VEDTGIPIYTIGYNAN 516 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 55/166 (33%), Gaps = 2/166 (1%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 F ++ + G +I A+ +L G VD + S + L+++ A++ Sbjct: 8 ATFRNLCRRFGWASDGQISMIFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLAG 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 V + +++S+ A + + + +F +N + T + M Sbjct: 68 VVR--PNDQQISTAAAVFSGAYRGRFDTAATASFASNTDGSLTGTATTSVKTSFLNVMGT 125 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 +++ L + + + + A+ + S Sbjct: 126 SALGVTASATAKAGAQAQSSVCILLVSTVNAQSLLVNSGAQLNAPS 171 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 58/189 (30%), Gaps = 23/189 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S K LV+ L + ++ D V + Sbjct: 172 PPANLVFLVDVSGSMSAADKLPLVKQTLRILTEQLRAQDKVT------IITYASGEKLVL 225 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + GSTA A+Q AY + +KN + Sbjct: 226 EP---TSGEQKEKILAVINGLRARGSTAGEQAIQLAYKQAEKA------FVKNGINR--- 273 Query: 371 IVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + + + + GI + T+ F ++ A + Sbjct: 274 ILLATDGDFNVGITDFNTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLAD--AGDGN 331 Query: 428 FFEANSTHE 436 + ++ +E Sbjct: 332 YSYIDNKNE 340 >gi|300772296|ref|ZP_07082166.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] gi|300760599|gb|EFK57425.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] Length = 212 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 59/171 (34%), Gaps = 17/171 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +I S+++ +T + F+ + + +D Sbjct: 21 IEAVKNGVQVMISSLRQNPQAIETAFLSVITFDSSARQLIPLT--------DLGAFQMVD 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A++ + I + K + + +I +TDG T D + G+ Sbjct: 73 IRATGTTSLGEALKVVSNCIDNEVAKTTSESKGDWKPLVFI--MTDGIPTDDWQSGLREF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 K K+ TIA + L + + ++K F Sbjct: 131 QKRKTA----YTIACAAGSGADT---SVLKQITENVVSLDTADSQSISKFF 174 >gi|126724455|ref|ZP_01740298.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] gi|126705619|gb|EBA04709.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 39/125 (31%), Gaps = 18/125 (14%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T I ++ AY + S +E H +K ++LLTDG D Sbjct: 234 IQPGGTTNIYGPLREAYGWL-SESERTDH--------QKAVILLTDGRANDDAASESQTL 284 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K T + + + R N S A + + + + + Sbjct: 285 -AMKDDA---YTFVYYMGDSDDRWLRSLADNYFSGGGHVSAQ-----LERYFNVVSDAYS 335 Query: 451 ERVIR 455 + + Sbjct: 336 AQTVL 340 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 52/172 (30%), Gaps = 39/172 (22%) Query: 13 KLIKSCTGHF--------FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 KL+K+ G+F ++ M + G ++ D+ + L A + Sbjct: 5 KLLKNLLGNFLNNQNGLAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAAALAG 64 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + L + + A +Y +N ++ ++ Sbjct: 65 AQELPDNPAQAVQVA-----------RDYAGKNG----------------FAPDSLNISI 97 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + ++ R + ++F R +GI S + + A + + Sbjct: 98 SGDNRTISVAGREVVN----TIFARVLGIYSKTVSAGSSASVQGLTSCQGVA 145 >gi|90409149|ref|ZP_01217268.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] gi|90309757|gb|EAS37923.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] Length = 226 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 48/136 (35%), Gaps = 20/136 (14%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + VD + + LV+ L + + R+G F D ++ + Sbjct: 99 NEVDMRLDGQSVTRLSLVKSLLKKFVAT-------RQGDRLGLILFADHAYLQTPLTFDL 151 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + I +T TAI +++ A + R N ++ ++LLTD Sbjct: 152 KTIAQRIEETQIGLVGIR--TAIGESIAIA-----------IKRFVKNKNKQRILILLTD 198 Query: 377 GENTQDNEEGIAICNK 392 G NT + +K Sbjct: 199 GSNTAGRIKPCKQPDK 214 >gi|311268950|ref|XP_003132277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Sus scrofa] Length = 1153 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N+ + T FSV + + CA Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPNRTGR----VFT--FSVGQHNYDVTPPAXXGCA 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 TKGYYFE 454 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 58/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + S D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 54/136 (39%), Gaps = 16/136 (11%) Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--FFNDRVISD 309 S++ + DS K+ + ++ + I DN +R+G + ++ Sbjct: 517 SMNFNFYNDSIPYNEKDKRIYSLKQSAKQFINKFNNKDN----IRIGIIPYSYYSGYANN 572 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 ++ + + + + G+T D ++ A +++++ + KK Sbjct: 573 IKQLTEINDNNKKSYENYIDNIKVEGATNQGDGIREAGKMLLNTDGNS----------KK 622 Query: 370 YIVLLTDGENTQDNEE 385 Y++L+TDGE T E Sbjct: 623 YVILITDGEATAITIE 638 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 30/82 (36%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L++ G +I A M+ + G VD R L+QA +++A+V Sbjct: 17 RTLLRDRNGAVAVIFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVKGGNP 76 Query: 72 LEEVSSRAKNSFTFPKQKIEEY 93 +R NS P Sbjct: 77 DSGQLARYLNSNMSPGGAATNV 98 >gi|311268952|ref|XP_003132278.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Sus scrofa] Length = 1146 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 80/247 (32%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 228 LNWTEALENVFIENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 287 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 340 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + +SN + Sbjct: 341 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN---- 396 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N+ + T FSV + + CA Sbjct: 397 ---KMIMMFTDGGEDRVQDVFEKYNWPNRTGR----VFT--FSVGQHNYDVTPPAXXGCA 447 Query: 424 SPNSFFE 430 + +FE Sbjct: 448 TKGYYFE 454 >gi|291395815|ref|XP_002714336.1| PREDICTED: complement component 2 isoform 2 [Oryctolagus cuniculus] Length = 613 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 42/140 (30%), Gaps = 12/140 (8%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYI 371 S + ++I ++ D T I A+ A + ++++ + E + I Sbjct: 181 SRDMTEVINSLENAKYTDHENGTGTNIYKALN-AVNIMMNNQMQRLGMETTGWQEIRHAI 239 Query: 372 VLLTDGENTQDNEEGIAICNKAKS---------QGIRIMTIAFSVNKTQQEKARYFLSNC 422 +LLTDG++ + K + I I + S Sbjct: 240 ILLTDGKSNMGG-SPKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKK 298 Query: 423 ASPNSFFEANSTHELNKIFR 442 F L ++F Sbjct: 299 DGERHAFILQDARALQQVFE 318 >gi|291395813|ref|XP_002714335.1| PREDICTED: complement component 2 isoform 1 [Oryctolagus cuniculus] Length = 744 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 42/140 (30%), Gaps = 12/140 (8%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYI 371 S + ++I ++ D T I A+ A + ++++ + E + I Sbjct: 312 SRDMTEVINSLENAKYTDHENGTGTNIYKALN-AVNIMMNNQMQRLGMETTGWQEIRHAI 370 Query: 372 VLLTDGENTQDNEEGIAICNKAKS---------QGIRIMTIAFSVNKTQQEKARYFLSNC 422 +LLTDG++ + K + I I + S Sbjct: 371 ILLTDGKSNMGG-SPKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKK 429 Query: 423 ASPNSFFEANSTHELNKIFR 442 F L ++F Sbjct: 430 DGERHAFILQDARALQQVFE 449 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 68/225 (30%), Gaps = 17/225 (7%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + D + P+ + SY + E+ P C + Sbjct: 46 LQDILQGEACPPVFQYKSHQLCAGASYLCNSDDECSPGERCCPQENECPLKCRKTIKKKS 105 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--V 306 ++ S R ++ ++ + D + +G F+ + Sbjct: 106 CPIDIAFLLDASGSIGRRSWEEIKNFVKSI------VDMCDISDQGTHVGIITFSTDPVI 159 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 GV I + + G T I+ A+ A ++ + + Sbjct: 160 DIAFDKYKGVEMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFT------QEAGMRED 213 Query: 367 AKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVN 408 ++K VL++DG T+D N K +G+++ T+ + Sbjct: 214 SQKVAVLMSDGIQTKDRGPFTPTDIAANPLKMKGVQVYTVGIGAD 258 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 60/217 (27%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDHKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L+R AL ++ ++ DN+ G V Sbjct: 233 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 288 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 289 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 334 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 335 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 392 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 393 YSYIDSLSEAQKVLKD 408 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G ++ A+ + + + V G +DV + L++ A A I + + Sbjct: 10 RGKSRQRGSVAVMAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIASVESMTDQC 68 Query: 73 EEVSSRAKNSFTFPKQKIE-EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + +S A + +Y +++ D T +N V Sbjct: 69 SQQNSPAMMAAQSNALANGFDYRASGQTLSIECGRWDTSATPYFNSTFTPLNA------V 122 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 +S + L F G + + A+ ++ Sbjct: 123 SVSVTQQVPYIFLGRFFGKSGSTGATVAAFSTAKAINIDS 162 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 93/283 (32%), Gaps = 40/283 (14%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + LL+P+M G V+ R+ L+ +A+ A + ++ ++ + + Sbjct: 22 MGLLLVPIM-GFTFWAVEGTRYVQESSRLRDSAEAAAMAVTIE-----DQPGAARALATK 75 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + + + N D+ + Y V + +D Sbjct: 76 YVENYVRDIKSTNLSAQRFYQAEDKGTGALEYI----------QYTVNARTTHD------ 119 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHK---EHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + S I S+ Q + +++R Y ++ + I +V DFSRSM D S + + Sbjct: 120 -SWFASSFIPSFDKQQELAGRSLARKYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKI 178 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + D+ + ++ V P D + S++ Sbjct: 179 DDLKTAIDQISNNILCKS---------TRQEYVDGEWKDVC---DEPGDDTTSDKLLNRV 226 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + ++ + +V A+ + + + +ND Sbjct: 227 GFVPFNVRTREIVAGNRANATSQLSYKNGYKAYL--SPYSYND 267 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 46/135 (34%), Gaps = 16/135 (11%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LVR+ LAS+I D + +G + ++ DP W ++K A+ Sbjct: 254 LVREFLASLISPFNIA---RDKISIGLSQYS----GDPRTEWDLNKFASKDKVLEAVRNL 306 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE-AKKYIVLLTDGENTQDNEEGIAICN 391 + + E + A K ++LLTDG + +E A Sbjct: 307 RYKGGNTFTGL-----ALTHVLEQNLKLEAGPRPEADKIVILLTDG---KSQDEANAAAQ 358 Query: 392 KAKSQGIRIMTIAFS 406 K GI I I Sbjct: 359 ALKDLGISIFAIGVK 373 >gi|301622626|ref|XP_002940637.1| PREDICTED: integrin alpha-L-like [Xenopus (Silurana) tropicalis] Length = 1031 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 95/291 (32%), Gaps = 41/291 (14%) Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 +E+ V +K ++ ++ S G+P +C+ P Sbjct: 85 SESCDVIIGANKTDASNLGLTLEVDPSHNKSIVCGPGKPRDCYQTPY------------- 131 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH--FVDSSSLRHVIKKKHLVRDALA 279 YM+ + + L PG D +E F+ S + ++V++ + Sbjct: 132 -----TNGECYMIDSSLATSKELNPGHQDCQKAEVDLCFMVDGSSSMGEIEVNIVKEFMK 186 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 +VI+S++ +V A F+ F++ + R A G T Sbjct: 187 NVIKSLE----NETSVHFAAIQFSTH--PKTEFTFADFQKDRNPDVLLANYRLLKGFTNT 240 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEA--KKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q D I + K KK +V+L DGE+T +++ KA Sbjct: 241 YKAIQYTLDRIFTE--------KYGSRPSAKKVLVILADGEST--DDDTTKAIEKADKAR 290 Query: 398 IRIMTIAFSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDRI 445 + I N ++ + F+A L + +I Sbjct: 291 VSRYIIGVGQNFKTEDLEAFVSWPAKEHTRTIEKFDAQQLTILFAELQRKI 341 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ G + R+ Sbjct: 226 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPS 281 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + GST ++ AY +K + Sbjct: 282 -----ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKG------FIKGGINR--- 327 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 I+L TDG+ ++ +I + K Q G+ + T + + Sbjct: 328 ILLATDGDFNVGIDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEA 373 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 73/222 (32%), Gaps = 41/222 (18%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P S +D S ++L AL +++ + D + G F V Sbjct: 42 PTSKSSDVRVLIDMSGSMKDNDPENLRIPALNLIVQLLP------DGSQAGVWTFGQWVN 95 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + V+ RT K A N G T I +AM+ A + + ++ E H Sbjct: 96 MLIPPA-EVNSEWRTNAKEKAKMINSFGLRTNIGEAMERATWKLAADSDFEQHA------ 148 Query: 367 AKKYIVLLTDG-----ENTQDNEEGIAICN----------KAKSQGIRIMTIAFSVNKTQ 411 +LLTDG + ++ + K+ G++I TIA S Sbjct: 149 -----ILLTDGIVDIAADDDPQKDNKNEAERQRILTDVLSEYKNLGVKIHTIALS----- 198 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 + L A + ++ +L K F + E Sbjct: 199 NAADKVLLEKLALETGGMAEVVENSEQLVKAFLNAFDKAAPE 240 >gi|332262981|ref|XP_003280535.1| PREDICTED: integrin alpha-M-like [Nomascus leucogenys] Length = 997 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 70/181 (38%), Gaps = 18/181 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ +++V+ +KK + ++ F+ + + L++ I++ F Sbjct: 169 MKEFVSTVMEQLKKSKTLFSLMQYSEEFWTHFTFKEFQDNPNPRSLVKPIMQLF------ 222 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T ++ + + + K +V++TDGE D + +A Sbjct: 223 -GRTHTATGIRKVVRELFNITQGARKNAF------KILVVITDGEKFGDPLGYEDVIPEA 275 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIF 450 +G+ I + +R L+ AS + F+ N+ L I ++++ +IF Sbjct: 276 DREGVIRYVIGVG-DAFHSMNSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLREKIF 333 Query: 451 E 451 Sbjct: 334 A 334 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 37/113 (32%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +G I AL++ +LG G+ +D R + L+ A + ++A+ L + + + Sbjct: 8 HRQSGAVIITVALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDAL 67 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + T + + + T + + A Sbjct: 68 TRATSAGKTAADLNKINFQGEATGLAQTEVVFSDSLIGTYSHTFTPIANARYA 120 >gi|119629723|gb|EAX09318.1| collagen, type VI, alpha 2, isoform CRA_c [Homo sapiens] Length = 1019 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|119629721|gb|EAX09316.1| collagen, type VI, alpha 2, isoform CRA_a [Homo sapiens] Length = 828 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|119629722|gb|EAX09317.1| collagen, type VI, alpha 2, isoform CRA_b [Homo sapiens] Length = 918 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|13603394|gb|AAA52056.2| type VI collagen alpha 2 chain precursor [Homo sapiens] Length = 1019 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 30/227 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 589 LTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAI 648 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D ++T R+G ++ + + + E G T A++ Sbjct: 649 AKDPKSETGTRVGVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALK 708 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AYD +I + + R+ + V++TDG + D+ A+C+ + + + Sbjct: 709 FAYDRLIKESRRQKTRV--------FAVVITDGRHDPRDDDLNLRALCD----RDVTVTA 756 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNE 448 I ++ ++ S C P + + +F D + + Sbjct: 757 IGIGDMFHEKHESENLYSIACDKPQ---QVRNMT----LFSDLVAEK 796 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 64/207 (30%), Gaps = 21/207 (10%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI---KKKHLVRDALASVIRSIKKIDNVNDT 294 S P P + V S I + + + A V+ ++ D Sbjct: 3 ASADDTAEPVTEPPRVQLVLDVSGSMRATDIDGRSRISVAQQAFGEVVDALP--DETQLG 60 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID----ENEMGSTAINDAMQTAYDTI 350 +R+ + + + + A G T + A++ A + Sbjct: 61 IRVLGATYRGEDKQQGCLDTQQIVPVGPVNRERAKAAVATLRPTGFTPVGLALREAAKDL 120 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + IVL+TDGE+T + + + +QG ++ + Sbjct: 121 ------------GGGTTARRIVLITDGEDTCAPPDPCQVARELAAQGTTLVVDTLGLAPD 168 Query: 411 QQEKARYFLSNCASPNSFFEANSTHEL 437 ++ + + A+ ++ A S +L Sbjct: 169 EKVRRQLLCIAAATGGTYTAATSAEDL 195 >gi|303241521|ref|ZP_07328022.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] gi|302590939|gb|EFL60686.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] Length = 1061 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 119/379 (31%), Gaps = 32/379 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L K G + A+++ VML + G+LVD R E ++ + A + L+ Sbjct: 3 LFKKNEGAITVYLAIILSVMLILTGVLVDGARARTAEAQVQSTTEAAANSLLANYNNILK 62 Query: 74 E---VSSRAKNSFTFPKQKIEEYLIRNFENNL---KKNFTDREVRDIVRDTAVEMNPRKS 127 E + S ++N+ ++++ YL RN L K N +D + + + Sbjct: 63 EWFGLMSLSENNSDVLEEELMYYLNRNLMTELGAEKANLSDESWNYVKKFLGSDNKYNDV 122 Query: 128 AY---------QVVLSSRYDLLLNPLS-----LFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + +V Y+L + + +++ ++ + + + + Sbjct: 123 NFIDMYDYRIEEVSACPMYNLSESAVLRSQIVEYMKYRAPEALGEEFLEKINVLKSFKKQ 182 Query: 174 EHGVSIQWVIDFS-----RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 +S + ID S + + D E L D+ K KV + E Sbjct: 183 SEALSSKLEIDRSLNEIKKELEKLSNDIEKVNLYDAKVLEDKLKKVCEKTTEKVLLEKEC 242 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI-KK 287 N Y + D + ++ + + ++ +S Sbjct: 243 EYLKEQKKNAEEELNNYKKSESYKKELKSLSDQLAAAADQSEREAILAEIGNLCKSYEDV 302 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + ++ + L + I + + E + + T Sbjct: 303 IKTAKEAYDRACDNYSSCEDEAKEL---IKDLNKYIDEYYGYSETALAHANKVMGLST-- 357 Query: 348 DTIISSNEDEVHRMKNNLE 366 I+ ED ++K + Sbjct: 358 -DILKQIEDMELQLKGDTS 375 >gi|221123691|ref|XP_002160228.1| PREDICTED: similar to CnPolydom [Hydra magnipapillata] Length = 954 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 73/237 (30%), Gaps = 27/237 (11%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 K ++ G + +M + + + + S + S + Sbjct: 18 GGQKRCINKRPLTGPIPDTQLEFMNCWSNEVEHGASDLVILLDSSGSMYSTGSFQGVTMT 77 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + +++ + I + +G N ++ + K F Sbjct: 78 GFDIGKTFINALLSKVH-ISFNATRIAIGTFGTNHKIDINFILRPDYSMHKCKFKKDFEK 136 Query: 330 DENEMGSTAINDAMQTA---YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDN 383 G T + A+Q + + + S+ + R K N + ++LL+DGE N Sbjct: 137 IRIYGGMTNLRGALQDSLNIFRELDSNPDTHKKRHKTN----RVVILLSDGEGNVMDNPN 192 Query: 384 EEGIAICNKAKSQG-----------IRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 G+ + + + TI + ++A L A+ + F Sbjct: 193 GRGVTHNDGLARNPHDIAHNLRLGLVEVYTIGV---TSAPDRA--VLEGLATEKNLF 244 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 53/158 (33%), Gaps = 19/158 (12%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ------TAYDTIISSNE 355 ++D + P F + + T + +Q AY +N+ Sbjct: 416 YDDGQANPPEFD---APTFSHMPQCAGNGFCSGSGTYTHLGLQLVKDYQQAYSGSTMNND 472 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKAKSQGIRIMTIAFSVNKTQQE 413 + + + +L+TDG+ + + + GI I F + Sbjct: 473 MAPYPTADETLY--FNILITDGQYNGYSTNAQVQGELEEMYNSGITTYVIGFG-DGVDTA 529 Query: 414 KARYFLSNCA-----SPNSFFEANSTHELNKIFRDRIG 446 A+ L N A S N++++AN+ EL + G Sbjct: 530 AAQAQLQNMAQWGSGSQNNYYDANNQTELEQALTTIFG 567 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 65/174 (37%), Gaps = 18/174 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K + Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVAL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 L+TDG + N + +I A+ GI +TI S + + +S +S Sbjct: 158 LMTDGIDHPRNPDVQSISEDARISGISFITIGLSTVVNEAK--LRLISGDSSSE 209 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 80/248 (32%), Gaps = 24/248 (9%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 Y + ++ L + Y++ S + F + I K L Sbjct: 74 HYFVEKAPEVKKETPLQQVSIPSEGIAIYLVLDQSGSMSEEVKVFRKT------ITKMDL 127 Query: 274 VRD-ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +++ V+ + ++ MG F + +I + K + Sbjct: 128 LKEVTKGFVLGNKQEGLTGRPQDMMGLVTFARGAQVLAPLTLDHQAIIDQLSKLQYTTDL 187 Query: 333 EMGSTAINDAMQTAYDTIISS---NEDEVHRMKNNLEAKK-YIVLLTDGENTQD------ 382 E TAI A+ + I ++ E+ K K ++L+TDG + Sbjct: 188 EQDGTAIGYAIYKTANLIAATRHYAEELEGAGKPAYTIKNSIMILVTDGLQAPNPLDQGK 247 Query: 383 ---NEEGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 N E + AK G+++ I + + R + + F+ +++ Sbjct: 248 EFRNVELLDAAVYAKKLGVKVYIINVEPRIASEEFSAHRLLMKKITELTGGRFYMVDNSL 307 Query: 436 ELNKIFRD 443 L+ I+ + Sbjct: 308 NLSSIYSE 315 >gi|229816811|ref|ZP_04447093.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] gi|229785827|gb|EEP21941.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] Length = 1185 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 49/173 (28%), Gaps = 17/173 (9%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + S G D+ S + + + P D L + + + Sbjct: 455 NNKSIVKTDGGDGDQYTLNLTASGDSTSSTVTTATPADIVLVMDKSGSMNENNRDANAQK 514 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +D ++ V+M F+ F+ V ++ A+ N Sbjct: 515 AAKDLAKKLLTGTNSKLPPEQQVQMAVVTFSTEASLKQKFTTNVSEINN------AVRGN 568 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T A++ A D KK+I+ L+DG T Sbjct: 569 PDGGTNWEAALKQANDMQGGRRG-----------VKKHIIFLSDGNPTYRTTS 610 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/132 (23%), Positives = 49/132 (37%), Gaps = 16/132 (12%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 T + A + G T I D++ A H + K I+L++ Sbjct: 333 TANRQATAINRTADALSAYGYTPIADSLTLA-----------GHDLLAIDAQKHMIILIS 381 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMT--IAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DGE T A+ +S GI + T I F ++ T Q++ + S A +F+A Sbjct: 382 DGEETCGGF-PAAVAANLRSLGIDLQTHVIGFDLDATAQQQMQAIAS--AGGGQYFDAAD 438 Query: 434 THELNKIFRDRI 445 EL I Sbjct: 439 GDELGASLMRVI 450 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 255 ADATDASGGTDIYEVLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 306 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 307 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQAKG 360 >gi|320105612|ref|YP_004181202.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924133|gb|ADV81208.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 335 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 81/216 (37%), Gaps = 23/216 (10%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN-----DTVRMGATF 301 G LD + E + + ++ + ++ D S+ + + + + G Sbjct: 90 GLLDDNRPPERVIKFTQQTNLPLRVGVLMDTSGSIRQRFQFEQDAATEFFLQVLHRGDAA 149 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + ++ G TA DA + ++ D++ + Sbjct: 150 FVMGFDVKTDLAQDYTNSV-DLLNQAIHKLRPGGGTAFFDA-------LYTTCRDQMLTL 201 Query: 362 KNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K + ++ +++++DG + Q + I +C +A++ + TI+ +++ T ++ A Sbjct: 202 KESNTVRRALIVVSDGHDNQSRAQENDAIKMCQRAET---IVYTISTNISPT-KDAADEV 257 Query: 419 LSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFER 452 L A+ F N ++ F I E+ + Sbjct: 258 LRRIADATGGRVFFPNRIEDVANGFHS-IEEELRSQ 292 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 47/139 (33%), Gaps = 20/139 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V++A ++ S+ D R+ F+ + V Sbjct: 57 KPIKTVKEAAIRLVESLGSGD------RLSVVAFDHKAKVIVP---NQPIDDIKTVNQQI 107 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G T I++ M+ + +D V + I LLTDGEN DNE + Sbjct: 108 QRLEPAGGTCIDEGMKLGIKEVALGKDDRVSQ----------IFLLTDGENEHGDNERCL 157 Query: 388 AICNKAKSQGIRIMTIAFS 406 + A I + T+ F Sbjct: 158 KLAQVAAEYNITLNTLGFG 176 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 273 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 324 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 325 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQVKG 378 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 64/183 (34%), Gaps = 17/183 (9%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK--TF 327 K +A ++ + + D+ G F+D+V + V + I+K F Sbjct: 5 KIKQTYEAFLKILADLPEEDHF------GILIFDDKVDKWQNTL--VKAVPDNIIKAKQF 56 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IN A+ A + +++ +++ + I+ L+DGE T Sbjct: 57 VSKISARGGTDINKALLAAVKMLKNTSRNKL----LPKISTSIILFLSDGEPTSGVTNHN 112 Query: 388 AICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I N K + + F + + L N +E + + F + Sbjct: 113 EIINNVKKANERQTTLYCLGFGNDVDFNFLEKMALENGGLARRIYEDSDAALQLQGFYNE 172 Query: 445 IGN 447 + N Sbjct: 173 VAN 175 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 34/156 (21%), Positives = 65/156 (41%), Gaps = 26/156 (16%) Query: 280 SVIRSIKKIDNVN-DTVRMGATFFND-----RVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + IR I I +++ D R+ ++D R I S G +K T + ++ E Sbjct: 77 NFIREISTIFSMSPDEARVSVVTYSDSSKIVRQIDYIGSSVGKNKC--TFLGELSLIRYE 134 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ A + + + +VLLTDG++T+ + + I + Sbjct: 135 AGWTDTKGALEEADRVLQHARSGA----------NRLVVLLTDGQSTEG--DPVGIATRI 182 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 +++GIRI+ I + L++ A+ F Sbjct: 183 RNKGIRIVAIGVGNVNMDE------LTSIATAQYVF 212 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 50/328 (15%), Positives = 98/328 (29%), Gaps = 25/328 (7%) Query: 52 ALKQAAQTAIITASVP--LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 L A + + + + P L+ E + + +E + KN + Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTVPENTKETNKNDSAP 179 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 E + T NP A + R +L L ++ + ++ + Sbjct: 180 EKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNV 239 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 H+ + + D Q G P N + + E Sbjct: 240 LNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETT 291 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 +P + L L VD S + + V+ + + ++ Sbjct: 292 TPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSG 349 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ + MG ++ ++ + G ++ +K + G T A++ A D Sbjct: 350 ITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDM 407 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + N KK IVLLTDG Sbjct: 408 LATPNGH-----------KKVIVLLTDG 424 >gi|85859136|ref|YP_461338.1| hypothetical protein SYN_01495 [Syntrophus aciditrophicus SB] gi|85722227|gb|ABC77170.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 364 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 75/247 (30%), Gaps = 26/247 (10%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 F K ++ G +I AL M V LG+ M VDV ++ L AA ++ Sbjct: 8 FGFK--LRERKGAVAVIVALAMTVFLGIAAMAVDVGHIMVVKNELHNAADA----DALAR 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 L + S T P + ++ Sbjct: 62 ANLLYAHTPSGFTSATPPTPDWAAA---------------ESAASTIDPANKSDGVTLTS 106 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-QWVIDFSR 187 Y+V + ++L NP L +S+ + + A + R +G SI W F Sbjct: 107 YEVE-TGYWNLDQNPAGLQPKSITPGTRDV---AAVKVTVRRVDGTNGGSIRHWFGAFVG 162 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 ++ + + G T+ + SP + S Y+ Sbjct: 163 NLTSNASATAIAICSSPGTAKPGTLVPMAIPLWIAKRASHYNSPSNLLTIGSAYHYDAYA 222 Query: 248 PLDPSLS 254 P++P Sbjct: 223 PVNPDGD 229 >gi|323700441|ref|ZP_08112353.1| hypothetical protein DND132_3035 [Desulfovibrio sp. ND132] gi|323460373|gb|EGB16238.1| hypothetical protein DND132_3035 [Desulfovibrio desulfuricans ND132] Length = 389 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 61/167 (36%), Gaps = 9/167 (5%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + ++ + +G ++ +L M ++G+ + VD+ R AL+ AA + + Sbjct: 1 MRTHFRR-PDNESGFATVMVSLCMAALMGLTALAVDLGRAYLKRSALQTAADAGALAGAN 59 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L+ + +++ + + + + + K TD ++ + E P + Sbjct: 60 SLLAAGKDLDKLRLIVTNYTTRNLTD------ADGPAKALTDADIVFLRDGVPDEEQPNQ 113 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 V LS + F +++G + I + A K Sbjct: 114 VEVTVTLSGERENAFP--LYFGKAVGKPAMDIVVTSRAGLAGMCSSK 158 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 53/417 (12%), Positives = 130/417 (31%), Gaps = 49/417 (11%) Query: 48 YYEHALKQAAQTAIITASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 E +A ++ + P + ++ P + + + Sbjct: 111 QSEEQKAGSADEPVMELATPSEAQPATTSATPTQKPTGTENPTTVERSVQSDDDDADTVA 170 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQ-VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 N + + + + ++ ++Y+ ++ S+ L + + G ++ ++ Sbjct: 171 NQNEAKDDETKDNADKTVHLGIASYRGMLKSASAGLSTPEHTKSIEYQGNGAYTLKLDVT 230 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLD--------YQRDSEGQPLNCFGQPADRTVKSY 215 + S S + I V+D S SM D + + +N F +T + Sbjct: 231 GKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTI 290 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 N KV + K + + + + + S + + ++ + L++ Sbjct: 291 EDDNNKVKVALVKYANQIGTATGADGC-----RISNSRQSDTGNCTQIVQELTTDAGLLK 345 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 ++ + + + V + + + G + ++ + N Sbjct: 346 TSVNGLQAAGATYADAAMEV------------AQQALAGGRAGAKKYVIFFTDGEPNHWS 393 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 D + ++ + +KN Y + + DG N + + NK Sbjct: 394 GFD---------DDVANAAIKKSQELKNAGT-TVYSIGIFDGANPSASVSSASNANK--- 440 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 S N R AS + ++ A+S +L +IF D I I E+ Sbjct: 441 -----FMHGISSNYPNATGYRSLGDR-ASGDYYYSASSATQLAQIFND-IQKTITEK 490 >gi|291569722|dbj|BAI91994.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 213 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 44/141 (31%), Gaps = 15/141 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S S+ + V + +++ ++ +T + F + Sbjct: 11 VLDCSGSMCG--EPIEAVNQGIKALVAELQSEPYAIETAYLSVITFESTAQQVFPLT--- 65 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++K + G+T++ DA++ + K + + + L+TD Sbjct: 66 -----ELMKFQPPVLSAGGTTSLGDALKLLTQCFDKEVKKASDTQKGDWKP--LVFLMTD 118 Query: 377 GENTQDNEEGIAICNKAKSQG 397 G + ++ K + Sbjct: 119 G---MPTDTWEKAADELKQKK 136 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 17/162 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ D +A V+ + + D A F Sbjct: 498 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-DFVARVVETFEISDIGTKV---AAVQF 553 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + D Sbjct: 554 T--YDQRTEFSFTDYTTKENVLAVIRQIRYMSGGTATGDAVAFTVRNVFGPLRDS----- 606 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 K ++V+LTDG++ D A A GI + +I Sbjct: 607 ---PNKNFLVVLTDGQSYDDVRGPAA---AAHKAGITVFSIG 642 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 32/95 (33%), Gaps = 13/95 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 135 RVGLILFGSQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 192 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +VL+TDG N + I Sbjct: 193 TS-----------RALVLVTDGANNAGQIDPITAA 216 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 110/435 (25%), Gaps = 57/435 (13%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNS-----FTFPKQKIEEYLIRNFENNLKKNFTDR 109 A + I+ + A+ K + E N N N Sbjct: 127 SAVDALKMQIGERFIEGQIKPRQEAREIYEQAKAEGKKTALLEQQRPNIFTNQVANIGPG 186 Query: 110 EVRDIVRDTAVEMNPRKSAYQV----VLSSRYD-LLLNPLSLFLRSMGIKS--------- 155 E + + ++ + + V++ RY+ + F G + Sbjct: 187 ETIVVQIEYQQTIHQSGGEFSLRFPMVVAPRYNPAPIVQTVEFNNGAGFATPRDPVENRD 246 Query: 156 ------------WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 I + + + S +D S+ + S Sbjct: 247 KIEAPVLDPRENARINPVSLTVDLRAGFPLGDVKSSFHAVDVSQDGDQARTISLKADTVP 306 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 + + T K+ + G+ E + P+ E FV +S Sbjct: 307 ADKDFELTWKAAPGKMPSAGLFREVIDGKTYLLAFVTPPTAPDTAAPPAKREVVFVIDNS 366 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 R +LA I + D+ + +R + Sbjct: 367 GSMSGPSIEQARQSLALAISKLNP-DDRFNVIRFD----DTMTDYFKGLVAATPDNREKA 421 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + G T + A+Q A + + + + +V LTDG N Sbjct: 422 IA-YVRGLTADGGTEMLPALQAA---LRNQGPVATGAL-------RQVVFLTDGA--IGN 468 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKT------QQEKARYFLSNCASPNSFFEANSTHEL 437 E + A R+ T+ E R + S + A+ EL Sbjct: 469 ERQLFQEITANRSDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQV--ASRMGEL 526 Query: 438 NKIFRDRIGNEIFER 452 ++ +I Sbjct: 527 FAKLQNPAMTDITAT 541 >gi|198426244|ref|XP_002124558.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 430 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 75/232 (32%), Gaps = 29/232 (12%) Query: 207 PADRTVKSYSSQNGKVGIRDEK--LSPYMVSCNKSLY---YMLYPGPLDPSLSEEHFVDS 261 P ++ + +NG D S C+ L + PS E Sbjct: 168 PPPHICQALNFKNGSASCTDGNNLSSNCTFECDADLVLHPANVSSSQCTPSGWIEPPPCC 227 Query: 262 SSLRHVIKKKHLVR--------------DALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + L + + +A ++ +++ ++R+GA +N RV Sbjct: 228 TKLCPPFARTDAIIVVDSSSSVKRPNWDKMIAFIVNMLRQFTIDQSSLRIGAFRYNRRVH 287 Query: 308 SDPS-FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 +D + ++ G T A+ + ++ + N + Sbjct: 288 TDTQILLNQFNNDKDGLLAAIQSLPYNGGGTNTGRAIAHVTNVMLKAE------NGNRPD 341 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +VL+TDG QD + ++ ++ G + +A + + ++ Sbjct: 342 VQDLVVLITDGRA-QDRVDLVSA--DLRATGAVVFVVAVILPGSTIRLSQML 390 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 49/119 (41%), Gaps = 13/119 (10%) Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + G T I + + +A D + S +E IVL+TDG + D+++ Sbjct: 273 ADATDASGGTDIYEGLLSALDELPSESEASQ--------YTTAIVLMTDGRSNSDHQDEF 324 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K++ + + I +I F Q K+ L S F+ + +L +FR G Sbjct: 325 ESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGR-SGDLAAVFRQVKG 378 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 38/133 (28%), Gaps = 20/133 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDR-----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++ + R+ F+ + + S + + Sbjct: 106 VRKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSQEIPAITY 162 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T A Q A + S E+ K I L+TDG + + +A Sbjct: 163 RGGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGGDPRPVAA--S 210 Query: 393 AKSQGIRIMTIAF 405 + G+ I T Sbjct: 211 LRDFGVEIFTFGI 223 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 52/191 (27%), Gaps = 29/191 (15%) Query: 274 VRDALASVIRSIKK--IDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTF 327 + L ++I + + R G + V S + + + + Sbjct: 289 AKKCLVNLIEKVASYGVKP-----RYGLVTYATYPKIWVKVSEPDSSNADWVTKQLNEIN 343 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+Q Y + ++ + + I+L+TDG + + I Sbjct: 344 YEDHKLKSGTNTKKALQAVYSMMSWPDDIPP---EGWNRTRHVIILMTDGLHNMGG-DPI 399 Query: 388 AICNKAKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + ++ + + + Q S + F+ Sbjct: 400 TVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-PLVNQVNINALASKKDNEQHVFKVKDM 458 Query: 435 HELNKIFRDRI 445 L +F I Sbjct: 459 ENLEDVFFQMI 469 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 45/113 (39%), Gaps = 16/113 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F+ + + +++ + Sbjct: 191 TKLELLKQAMGFVIDNLGPRD------RLCVVSFSSGANRLMRLAR-MSDAGKSLARRAV 243 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T I +A++ A I ++ +HR +VLL+DG++T Sbjct: 244 QSLAAGGGTNIGEALRRAAKVI----DERMHRNAVAS-----VVLLSDGQDTY 287 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 40/116 (34%), Gaps = 14/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + T I A+Q + ++ Sbjct: 131 KIGLIVFAGDAFVQLPITSDYVSAKMFLDNINP-SLIGTQGTDIGKALQLSMNSFT---- 185 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N + K I+L+TDGE+ + E + +A+S+GIR+ + + Sbjct: 186 -------PNSKVGKAIILITDGEDNEGGAE--EMAKQAQSKGIRVFILGVGSTEGA 232 >gi|290957869|ref|YP_003489051.1| hypothetical protein SCAB_34031 [Streptomyces scabiei 87.22] gi|260647395|emb|CBG70500.1| putative membrane protein [Streptomyces scabiei 87.22] Length = 534 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 59/328 (17%), Positives = 95/328 (28%), Gaps = 46/328 (14%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + LI +S + L R F K D ++ Sbjct: 184 SGFSTLISVASGLSGAQSALTDADVARATPRLKRFFA-GQKLTSGSSGWLATAYDRRGDV 242 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + Y+ VL SR DL + G+ + + A T + V Sbjct: 243 DA-LLNYESVLKSRPDLTV-----IRPRDGVVTADYPLSSLASTGTD------------V 284 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG-KVGIRDEKLSPYMVSCNKSLY 241 D R + D R + Q L V S G R E P S L Sbjct: 285 RDDVRRLTDALRTPDVQRLITERTLRRPVVASVPPAAGLDTTRRRELPFPGSRSVAVGLL 344 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-----KIDNVNDTVR 296 PS + S S+ + ++ AL + + + V+ Sbjct: 345 DAYENDLRRPSRTVYVLDTSGSMEGD--RLDRLKTALTELTGDFRDREEVTLMPFGSDVK 402 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 V+ G+ + K + G TAI +++ AY+ + + + D Sbjct: 403 ----SVRTHVVRPADPKAGLDGIRADTRK-----LSAAGETAIYTSLRRAYEHLGAVDRD 453 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNE 384 IVL+TDGENT+ Sbjct: 454 TFTS----------IVLMTDGENTEGAS 471 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 376 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 474 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 43/121 (35%), Gaps = 16/121 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT + G T I A+ A D + N K IVL++DGE+T Sbjct: 114 RTEAKTAVATLSPTGWTPIGPALLKAADDLDGGN------------GSKRIVLISDGEDT 161 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ + + A+ ++ +L Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPDVKLNRQLSCIAE--ATGGTYTSVEHRDQLT 219 Query: 439 K 439 Sbjct: 220 D 220 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 53/138 (38%), Gaps = 22/138 (15%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 L+R+ LAS++ D +R+G T ++ + + +++ Sbjct: 265 LIREFLASLVAPFNVA---MDKIRVGLTQYSS--DPRTEWDLNTYATRDEVLEALRSLRY 319 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK----KYIVLLTDGENTQDNEEGIA 388 + G+T A+ H +K + A+ K I+LLTDG++ D Sbjct: 320 KGGNTFTGLALTHV----------LEHNLKADTGARSEAPKLIILLTDGKSQDDANPPAQ 369 Query: 389 ICNKAKSQGIRIMTIAFS 406 + K+ GI+I + Sbjct: 370 V---LKNMGIQIFAVGVK 384 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 376 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 474 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 377 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 422 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 423 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 475 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 376 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 474 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 44/116 (37%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 376 PTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + F ++A +S + F+ ++ L F Sbjct: 422 CIDEVKQSGAIVH---FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 474 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S ++ L++ AL ++ ++ DN+ G V Sbjct: 241 PPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALAS 296 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + I + GST ++ AY+ E +K + Sbjct: 297 TAGNNTTAIKAAIDN-----LDAYGSTGGEAGLRLAYEQ------AEKGFIKGGVNR--- 342 Query: 371 IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + A+ K + +GI + T+ + + + Sbjct: 343 ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNGN 400 Query: 428 FFEANSTHELNKIFRD 443 + +S E K+ +D Sbjct: 401 YSYIDSLSEAQKVLKD 416 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 79/255 (30%), Gaps = 36/255 (14%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY--MVSCNK 238 +D + + + + KS+ ++N + G + Y C Sbjct: 1 MSLDK---LSKTCPSKRYSQVPKYLKIQYPVKKSFLAKNIRQGGACPPVFQYKSYQQCTG 57 Query: 239 SLYYMLYPGPL---DPSLSEEHFVDSSSLRHVIKK-----KHLVRDALASVIRS------ 284 Y D + R IKK + DA AS+ R Sbjct: 58 VSYLCNSDDECSSPDERCCPQENDCPLKCRKTIKKSCPIDIAFLLDASASMGRRTWGKIK 117 Query: 285 ------IKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + D + +G F+ + GV I + + G Sbjct: 118 NYVKSIVDMGDISDQGTHVGIITFSTDPVIDIPFDKYKGVKMNAVNIKRDIDELRRKKGY 177 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKA 393 T I+ A+ A ++ + +++K VL++DG T+D I N+ Sbjct: 178 TFIDKALTLADKSLFT------QEAGMREDSQKVAVLMSDGIQTKDRGPFTPTIIAANRL 231 Query: 394 KSQGIRIMTIAFSVN 408 K +G+++ T+ + Sbjct: 232 KMKGVQVYTVGIGAS 246 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 49/329 (14%), Positives = 89/329 (27%), Gaps = 41/329 (12%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++++L + G I ALL + + L+DV R S ++ A A + A+ Sbjct: 18 FARRLRRDDRGAIAIQFALLALPLSILLFGLLDVGRLSLQRRQMQDALDAATLMAARSTA 77 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S ++ D + + + S + Sbjct: 78 TSSADL----------------------------DTTGDAAFLAEIAGMNLGLTASSSTF 109 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++R + + +S A +E V S + E V+D + SM Sbjct: 110 SAGTNNRVIGTATATLRPIIANLWQSGNFTVTASSEVVRASKNLEIA----LVLDITGSM 165 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + R ++ + L PY N Y Sbjct: 166 GNGTRIADLKVAAADLVDVL------VRDTQTPFYSKMALVPYSAGVNVGATYADAVRGP 219 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS-IKKIDNVNDTVRMGATFFNDRVIS 308 P + +S I A+ + + D V T G T ND++ Sbjct: 220 VPVKTITGAAWASGSARSITGITRANPAVVTASGHGLSTGDYVYITGVRGMTSVNDKIYR 279 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGST 337 K+ T + G T Sbjct: 280 VT--RSDPDKVSLNSTNTSSASNYTNGGT 306 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 8/83 (9%) Query: 376 DGENTQDNEEGIAICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEA 431 D N ++ +C+ K+ I + T+ F+V Q + FL+NCAS ++F A Sbjct: 546 DATNGSSFDQAAELCDSIKASANDITLYTVGFTVGNDQTAR--NFLTNCASSTDKAYFPA 603 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + EL F+ I EI I Sbjct: 604 TGS-ELKASFQ-AIAQEISNLRI 624 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 41/253 (16%), Positives = 73/253 (28%), Gaps = 38/253 (15%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 +R S S+ + + D K +P N L + +D S Sbjct: 256 NRICNSRSTWDVIMNSTDIKATPPQADSN--LPVPTFTLLQSSDRVVTLVLDVSGSMASD 313 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + I V + +G F+ + R +K F Sbjct: 314 GRIGRLYQAAEVFVMQI-----VEEGSHVGIVSFSTSTTVLSKLVQVIDDTQRNHLK-FL 367 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + +G T I ++ + H + IVLLTDGE+ + + Sbjct: 368 LPKTAVGGTNICAGIR------EGIKVNNQH---DGSSYGTEIVLLTDGEDNYNT----S 414 Query: 389 IC-NKAKSQGIRIMTIAFSVNKT-------QQEKARYFLSNCASPNSFFEANSTHELNKI 440 +C + GI + IA N FL+ + L Sbjct: 415 LCFPDISNSGIIVHFIALGPNPNPNLETIVDMTGGLRFLAT--------DKVDAQGLIDA 466 Query: 441 FRD-RIGNEIFER 452 F G+ + Sbjct: 467 FSSLTAGDGATTQ 479 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 72/214 (33%), Gaps = 24/214 (11%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P LDP FV S K +A+ +++ ++ D+ + Sbjct: 292 NGYFVHFFAPDNLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFS---- 347 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN + + + K K + G T IN+A+ A + +N Sbjct: 348 --VVDFNHNIRTWRNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNM 405 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQE 413 + + I+L++DG+ T + I K I + ++ + Sbjct: 406 GLLDPNSVSL----IILVSDGDPTVGELKLSKIQKNVKENIRDNISLFSLG-----MGFD 456 Query: 414 KARYFLSNCASPNS------FFEANSTHELNKIF 441 FL ++ N + +++ +L K + Sbjct: 457 VDYDFLKRLSNENRGIAQRIYGNQDTSSQLRKFY 490 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 38/133 (28%), Gaps = 20/133 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDR-----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++ + R+ F+ + + S + + Sbjct: 106 VRKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLRREIPAIAY 162 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T A Q A + S E+ K I L+TDG + + +A Sbjct: 163 RGGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGGDPRPVAA--S 210 Query: 393 AKSQGIRIMTIAF 405 + G+ I T Sbjct: 211 LRDFGVEIFTFGI 223 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 32/81 (39%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G ++ ALL+ + + +DV Y + L++ A ++ + L + ++ ++ A Sbjct: 15 GTVSLMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQLKSADDQATTNA 74 Query: 80 KNSFTFPKQKIEEYLIRNFEN 100 + + + N Sbjct: 75 NVLSSVTSAAAQNGYTKAVAN 95 >gi|198417365|gb|ACH87900.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1042 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 43/313 (13%), Positives = 85/313 (27%), Gaps = 76/313 (24%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 K+ + I V+D S SM D E + G + + Sbjct: 436 KKQPLDILVVVDRSGSMQDGIGSIEKYKYW---KYKYDEYYHIWRNAGTIYFDN------ 486 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL---ASVIRSIKKID 289 L P + + + S+ IK+ V+DAL +++ I+ Sbjct: 487 ----------YLGPRYQPDTYTYYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDIN 536 Query: 290 NVNDTVRMGA---TFFNDRVISDPSFSWGV--------HKLIRTIVKTFAIDENEM---- 334 N +G + W ++K + N Sbjct: 537 PQNQLAVVGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSL 596 Query: 335 -----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--------ENTQ 381 T + A+ A + + + HR K +V ++DG +N + Sbjct: 597 SYEDRNGTNYHAALLKADEKLQKVA-NNGHR--------KIMVFISDGVPTFYFGADNYR 647 Query: 382 DNEEGIAICNKAKSQG---------------IRIMTIAFSVNKTQQEKARY--FLSNCAS 424 ++ N SQ + I ++ S + + L + Sbjct: 648 SGNGTVSDSNIINSQKGSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYLSG 707 Query: 425 PNSFFEANSTHEL 437 + + T +L Sbjct: 708 DDYYSGITDTEQL 720 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 49/141 (34%), Gaps = 19/141 (13%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ V + G + + + A G ++D++ A + + Sbjct: 455 FDSAVQNQYGPVAGSEREVAQG-EIIARGVTGGGGINVHDSLVAAGNVL----------- 502 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K ++I+LL DG ++Q E + + ++ + GI TIA FL+N Sbjct: 503 KGRNAPIRHIILLADGSDSQQQENAVRLTDEHRRLGITTSTIAI-----GNGGDVGFLNN 557 Query: 422 CASP--NSFFEANSTHELNKI 440 A F L I Sbjct: 558 VAVAGGGRHFLVEDALSLPDI 578 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 62/220 (28%), Gaps = 15/220 (6%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 +S G L F Q + + + + + + S + Y ++ P Sbjct: 247 SESTGNYLVTFNQSNIKMDRDIWLEWQPSPSSAPQAAIFTESKGQHDYALVMLMPPQVKS 306 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPSF 312 + D + + + S++ D +++ R FN+ Sbjct: 307 QDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTTRLFET 366 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + F N G T + A+ A + + K +V Sbjct: 367 SVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAALKRTTTK------------DFIKQVV 414 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +TDG + I N+ R+ T+ Sbjct: 415 FITDGAVGNEAALFSQIKNELGDA--RLFTVGIGSAPNSY 452 >gi|309362046|emb|CAP28695.2| hypothetical protein CBG_09096 [Caenorhabditis briggsae AF16] Length = 516 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 80/245 (32%), Gaps = 14/245 (5%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG--PLDPSLSEEHFVDS 261 + A + S Q +KL P + + P + V Sbjct: 196 YVAMAALANYTQSEQKIVTNQNLQKLEPQFEKYHGTEICEQQPTCVKGSDKPLDLALVVD 255 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ +I + VR+ ++ + F+ ++ Sbjct: 256 ASESLDHLFSDQKKFLVDRVLGNINIHP---EAVRVALITYSGQAFVHFKFNSFLYGNNT 312 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ + G+TA N A+ A+D + S + R K +V LTDG Sbjct: 313 SVQGFVKNIRSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALV-LTDG---H 365 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + K ++ GI ++ ++ + E ++ S F + HE F Sbjct: 366 SHRSPKDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAG--SEKRAFTPPNLHEFESEF 423 Query: 442 RDRIG 446 +G Sbjct: 424 MKYVG 428 >gi|309362033|emb|CAP28676.2| hypothetical protein CBG_09117 [Caenorhabditis briggsae AF16] gi|309362045|emb|CAP28694.2| hypothetical protein CBG_09097 [Caenorhabditis briggsae AF16] Length = 787 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 80/245 (32%), Gaps = 14/245 (5%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG--PLDPSLSEEHFVDS 261 + A + S Q +KL P + + P + V Sbjct: 196 YVAMAALANYTQSEQKIVTNQNLQKLEPQFEKYHGTEICEQQPTCVKGSDKPLDLALVVD 255 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ +I + VR+ ++ + F+ ++ Sbjct: 256 ASESLDHLFSDQKKFLVDRVLGNINIHP---EAVRVALITYSGQAFVHFKFNSFLYGNNT 312 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ + G+TA N A+ A+D + S + R K +V LTDG Sbjct: 313 SVQGFVKNIRSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALV-LTDG---H 365 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + K ++ GI ++ ++ + E ++ S F + HE F Sbjct: 366 SHRSPKDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAG--SEKRAFTPPNLHEFESEF 423 Query: 442 RDRIG 446 +G Sbjct: 424 MKYVG 428 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKT---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F + A S + + A + EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEIDDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|268572089|ref|XP_002641231.1| Hypothetical protein CBG09097 [Caenorhabditis briggsae] gi|268572157|ref|XP_002641249.1| Hypothetical protein CBG09117 [Caenorhabditis briggsae] Length = 772 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 80/245 (32%), Gaps = 14/245 (5%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG--PLDPSLSEEHFVDS 261 + A + S Q +KL P + + P + V Sbjct: 196 YVAMAALANYTQSEQKIVTNQNLQKLEPQFEKYHGTEICEQQPTCVKGSDKPLDLALVVD 255 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ +I + VR+ ++ + F+ ++ Sbjct: 256 ASESLDHLFSDQKKFLVDRVLGNINIHP---EAVRVALITYSGQAFVHFKFNSFLYGNNT 312 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ + G+TA N A+ A+D + S + R K +V LTDG Sbjct: 313 SVQGFVKNIRSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALV-LTDG---H 365 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + K ++ GI ++ ++ + E ++ S F + HE F Sbjct: 366 SHRSPKDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAG--SEKRAFTPPNLHEFESEF 423 Query: 442 RDRIG 446 +G Sbjct: 424 MKYVG 428 >gi|268572085|ref|XP_002641230.1| Hypothetical protein CBG09096 [Caenorhabditis briggsae] Length = 564 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 80/245 (32%), Gaps = 14/245 (5%) Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG--PLDPSLSEEHFVDS 261 + A + S Q +KL P + + P + V Sbjct: 196 YVAMAALANYTQSEQKIVTNQNLQKLEPQFEKYHGTEICEQQPTCVKGSDKPLDLALVVD 255 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ +I + VR+ ++ + F+ ++ Sbjct: 256 ASESLDHLFSDQKKFLVDRVLGNINIHP---EAVRVALITYSGQAFVHFKFNSFLYGNNT 312 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ + G+TA N A+ A+D + S + R K +V LTDG Sbjct: 313 SVQGFVKNIRSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALV-LTDG---H 365 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + K ++ GI ++ ++ + E ++ S F + HE F Sbjct: 366 SHRSPKDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAG--SEKRAFTPPNLHEFESEF 423 Query: 442 RDRIG 446 +G Sbjct: 424 MKYVG 428 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 43/127 (33%), Gaps = 15/127 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 K +D++ND ++G F + + + + TAI A+ Sbjct: 121 KLVDDMNDD-KVGLVVFAGDAYTQLPITVDYVSAKMFLSNISP-ELVPRQGTAIGSALDL 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + + +E K I+L+TDGEN +D+ I A I + I Sbjct: 179 AIKSFGAKSEA-----------GKAIILITDGENHEDDA--IGAAKLAAENNIIVNVIGM 225 Query: 406 SVNKTQQ 412 Sbjct: 226 GKTDGAP 232 >gi|317419330|emb|CBN81367.1| Integrin alpha-M [Dicentrarchus labrax] Length = 1058 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 67/203 (33%), Gaps = 33/203 (16%) Query: 246 PGPLDPSLSE--EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P LD SE F+ S + ++D + +++ S + D T F Sbjct: 51 PSSLDECRSEADIAFLLDGSGSVASQDFTKMKDFVKNLVNSFQGKD----------TKFA 100 Query: 304 DRVISDPSF------SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 S+ ++ ++ I + G T A++ + + + Sbjct: 101 IAQFSNAPLVHYYFDTFDINNWRTQIDRIR----QLTGGTYTAAAIEHVVNNVFDPSRGS 156 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 L KK ++++TDGE + D + + A+ + I I + + Sbjct: 157 R------LNVKKVLIVITDGE-SHDRRDLPYAASLAQGKNIVRFAIGVGGAFSNVAAKQE 209 Query: 418 FLSNCASP---NSFFEANSTHEL 437 L AS + F ++ L Sbjct: 210 -LDTIASDPPASHVFRVDNFGAL 231 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 44/157 (28%), Gaps = 32/157 (20%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + L + + + TA+ DA+ A + + Sbjct: 129 RIGLVAFGSRAYVVLPPTDDRAALTQALSRLSVGA--AGRRTAMGDAVGLAVKQLDRAPG 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------K 409 + +V+ DG + + A ++GI + T+ + Sbjct: 187 LA-----------RLVVVFGDGLSNAGEVRPVEAAKAAAARGIAVFTVGVGGDGPAPFLV 235 Query: 410 TQQEKARYFLSNCA-------------SPNSFFEANS 433 + + A S +FF A Sbjct: 236 NHPLLGQEIVRENAAVDTAALTELAALSGGAFFRAED 272 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 50/139 (35%), Gaps = 19/139 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A S+I S+ D R+ F+ R + T++++ Sbjct: 57 KPLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQ--SRQDDLTLIRSKI 108 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G TAI++ ++ + +I LLTDGEN DN+ + Sbjct: 109 QQLRAGGGTAIDEGIKLGIQE----------SSSGSKGYVSHIFLLTDGENEHGDNQRCL 158 Query: 388 AICNKAKSQGIRIMTIAFS 406 + A GI + T F Sbjct: 159 KLAAVAAEYGITLNTFGFG 177 >gi|220897449|emb|CAX15333.1| complement component 2 (within H-2S) [Mus musculus] Length = 623 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 60/198 (30%), Gaps = 15/198 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF---- 312 + + +S K + + + ++ I + V + F + + S Sbjct: 126 YLLLDASQSVTEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSER 182 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 183 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 242 Query: 373 LLTDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ + A+ + + + I I ++ S Sbjct: 243 LLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDG 302 Query: 425 PNSFFEANSTHELNKIFR 442 F L +IF Sbjct: 303 ERHAFILQDAKALQQIFE 320 >gi|148694788|gb|EDL26735.1| complement component 2 (within H-2S), isoform CRA_b [Mus musculus] Length = 753 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 60/198 (30%), Gaps = 15/198 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF---- 312 + + +S K + + + ++ I + V + F + + S Sbjct: 263 YLLLDASQSVTEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSER 319 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 320 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 379 Query: 373 LLTDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ + A+ + + + I I ++ S Sbjct: 380 LLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDG 439 Query: 425 PNSFFEANSTHELNKIFR 442 F L +IF Sbjct: 440 ERHAFILQDAKALQQIFE 457 >gi|148694789|gb|EDL26736.1| complement component 2 (within H-2S), isoform CRA_c [Mus musculus] Length = 809 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 60/198 (30%), Gaps = 15/198 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF---- 312 + + +S K + + + ++ I + V + F + + S Sbjct: 263 YLLLDASQSVTEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSER 319 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 320 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 379 Query: 373 LLTDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ + A+ + + + I I ++ S Sbjct: 380 LLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDG 439 Query: 425 PNSFFEANSTHELNKIFR 442 F L +IF Sbjct: 440 ERHAFILQDAKALQQIFE 457 >gi|148694790|gb|EDL26737.1| complement component 2 (within H-2S), isoform CRA_d [Mus musculus] Length = 755 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 60/198 (30%), Gaps = 15/198 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF---- 312 + + +S K + + + ++ I + V + F + + S Sbjct: 263 YLLLDASQSVTEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSER 319 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 320 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 379 Query: 373 LLTDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ + A+ + + + I I ++ S Sbjct: 380 LLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDG 439 Query: 425 PNSFFEANSTHELNKIFR 442 F L +IF Sbjct: 440 ERHAFILQDAKALQQIFE 457 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 58/154 (37%), Gaps = 16/154 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + S+ I ++ +++ A F+ V DP S Sbjct: 50 VFILDSSESSKIVLFDNQKDFVDSLSEKIFQLTPGRSLKYDIKLAALQFSSSVQIDPPLS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + + + + K + Sbjct: 110 SWKDLRTFKQRVKSLNLIGQ---GTFSYYAISNATRLL---------KREGRKDGVKVAL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 L+TDG + + + +I A+ GI +T+ S Sbjct: 158 LMTDGIDHPKSPDVQSISEDARILGISFITVGLS 191 >gi|157951694|ref|NP_038512.2| complement C2 precursor [Mus musculus] gi|3986766|gb|AAC84162.1| C2 [Mus musculus] gi|15029737|gb|AAH11086.1| Complement component 2 (within H-2S) [Mus musculus] gi|74147034|dbj|BAE27452.1| unnamed protein product [Mus musculus] gi|148694787|gb|EDL26734.1| complement component 2 (within H-2S), isoform CRA_a [Mus musculus] gi|220897448|emb|CAX15332.1| complement component 2 (within H-2S) [Mus musculus] Length = 760 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 60/198 (30%), Gaps = 15/198 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF---- 312 + + +S K + + + ++ I + V + F + + S Sbjct: 263 YLLLDASQSVTEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSER 319 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 320 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 379 Query: 373 LLTDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 LLTDG++ + A+ + + + I I ++ S Sbjct: 380 LLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDG 439 Query: 425 PNSFFEANSTHELNKIFR 442 F L +IF Sbjct: 440 ERHAFILQDAKALQQIFE 457 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 94/304 (30%), Gaps = 31/304 (10%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 M +++ + T A + + + ID SM+ + + D Sbjct: 106 MNVQTSPLSTFAADVDTASYTQIRSAIENGYDIDP--SMVRIEEMLNYFHYDYPLPKDDE 163 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 Y+ D KL+ S+ P+ + +D S K Sbjct: 164 KFAVYTEYMDCPWNEDTKLALV------SMNTQAIDFKSAPASNLVFLIDVSGSMFDDNK 217 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ AL + ++ + D V+ G +D V+ GV + + Sbjct: 218 LPLVQQALTMLAENLTEKDRVSIVTYAG----SDEVVLQ-----GVSGDDYHEISSAIEG 268 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGI 387 GST + ++TAY + + +K ++L TDG+ + Sbjct: 269 LEAYGSTNGSAGIETAY------ALAKKYFIKGGNNR---VILCTDGDLNVGLTSEGQLE 319 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + K G+ + T + K ++ +S E K D +G Sbjct: 320 KLITEKKDSGVFLSTFGVGYGNYKDNKLELLADK--GNGNYAYIDSMFEAKKALVDELGA 377 Query: 448 EIFE 451 + Sbjct: 378 NMVT 381 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 52/167 (31%), Gaps = 33/167 (19%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F R + L + + + TA+ DA+ + + + Sbjct: 132 RIGLVAFAGRPYPAAPLTSDHQWLQGVVDRLDTGAVED--GTALGDAILSGVNRLRRRPA 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VNK 409 + + ++L+TDG N E AK+ GIR+ I + Sbjct: 190 ES-----------RALILITDGRNNAG-AEPQLAAQAAKALGIRVHAIGIGSRGSAVIPV 237 Query: 410 TQQEKAR-----------YFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A + +FEA L+++F + Sbjct: 238 PSPLGGTIYRRLDAELDAATLKGVAELTGGRYFEAGDATVLSRVFAE 284 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 69/184 (37%), Gaps = 27/184 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI---SDPSFSWGVHKLIRTIVKT 326 K + R AL ++I + + NV +R + R SD + L R + Sbjct: 191 KTVIARQALIALINRLPETTNVA--LRT----YGHRRADDCSDTELIQALAPLQRDALIA 244 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + A ++ +D +E IVL++DG+ T + Sbjct: 245 RINAIRPVNGGRTPIA-----QSLADMAQDLA-----GIEGNVLIVLVSDGDETCGG-DP 293 Query: 387 IAICN--KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN--SFFEANSTHELNKIFR 442 +A + +A + +RI I F V +QE+ R L A ++F+A++ +L Sbjct: 294 VATASMLRAANSQLRISVIGFDV---EQEEWRRRLEGIAVAGGGAYFDASNAEQLADALD 350 Query: 443 DRIG 446 I Sbjct: 351 QAIA 354 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 51/164 (31%), Gaps = 26/164 (15%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ S +D + + R F+ S + + ++K+ + G T I Sbjct: 334 IVGSKHLVDRLKEVDRGAVIDFDSTAQLLQSLTDN-----KAVIKSALDLIDASGGTDIG 388 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 D + A + ++ +VLLTDG + ++ + K Sbjct: 389 DGVSKALEEFANARSASDWA----------VVLLTDGSGSYNHALTTELVQK-------- 430 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFR 442 I + + + + N+ EL ++F Sbjct: 431 -NIRVLGITMGSGANQSLIRGISDSTYGIYQHVNTADELIEVFE 473 >gi|320352629|ref|YP_004193968.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121131|gb|ADW16677.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 577 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 71/238 (29%), Gaps = 22/238 (9%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + +N F R + G Y+++ L Sbjct: 136 GHLPPVGAVRIEEMINYFTYAYPRPIGKAPFALGAEVGPSPFHRDYLLARIGLAAKDLAK 195 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 L PS + +D S K L++ AL V+R + D V V GA + V Sbjct: 196 EHLPPS-NLVFLIDVSGSMQDGNKLPLLKQALPLVVRQLGARDRVALVVYAGA---DSVV 251 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + GST + ++TAY+ S +K Sbjct: 252 LPPTP------GDRQQEILAALDQLQAGGSTHASSGIRTAYELARKS------FIKGGNN 299 Query: 367 AKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 ++L +DG+ +E + + + GI + + + + Sbjct: 300 R---VILASDGDFNVGVTSRDELTRLIEEERKDGIYLTVLGLGMGNYHDDTMEVLADK 354 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 23/199 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S K L++ A + ++++ D ++ V G++ V+ Sbjct: 226 PASNLVFLLDVSGSMGQQNKLPLLKSAFKLLTNNLREQDKISIVVYAGSSG---VVLEP- 281 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 +T ++ + GSTA + ++ AY +D + NN Sbjct: 282 -----TSGDQKTKIEEALDKLSAGGSTAGGEGIELAYKI----AKDNFIKNGNNR----- 327 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ A+ + K + GI + + F + + K + Sbjct: 328 VILATDGDFNVGLSSDKAMEDLIKEKRESGIFLTALGFGMGNYKDSKLESLAQ--TGNGN 385 Query: 428 FFEANSTHELNKIFRDRIG 446 +S E ++ G Sbjct: 386 HAYIDSMQEAQRVLETEFG 404 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 34/137 (24%), Positives = 55/137 (40%), Gaps = 16/137 (11%) Query: 249 LDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR- 305 LD S S V ++S + + + +++ ++ VIR + D R+ FND Sbjct: 76 LDVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRKLADGD------RLSIVAFNDGP 129 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V S V R+I G TA+ A++ A + DE N Sbjct: 130 VKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAVKIL-----DERQGSSRNH 184 Query: 366 EAKKYIVLLTDGENTQD 382 +I+LLTDG++T Sbjct: 185 VG--FILLLTDGDDTTG 199 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 63/176 (35%), Gaps = 20/176 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A+ +++ + + + + + + F +V F+ + Sbjct: 31 KIDNLNKAVENMLDTFAQEEKMETEILVSVITFGGKVDLHVPFT--------KASQVQWH 82 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + A++ A I + + IVL++DG+ T N A+ Sbjct: 83 GLQVNGDTPMGTALKMAKAMIEDKE------TTPSRAYRPTIVLVSDGQPTDGNIWKQAM 136 Query: 390 CN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + +S M +A + + + F+ A F A + +L++ F+ Sbjct: 137 ADFISEGRSSKCDRMAMAIGHDA-DETVLKRFIEGTAHD--LFYAENAGQLHEFFQ 189 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 44/136 (32%), Gaps = 16/136 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT K G T I ++ A D + N K IVL++DGE+T Sbjct: 115 RTEAKAAVATLTPTGWTPIGPSLLKAADDLEGGN------------GSKRIVLISDGEDT 162 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ N + A+ ++ EL Sbjct: 163 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYASVEHQDELT 220 Query: 439 KIFRDRIGNEIFERVI 454 + + V Sbjct: 221 DRVNELVDRAAEPVVT 236 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 42/127 (33%), Gaps = 15/127 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +DN + ++G F + + G T I A++ Sbjct: 121 NLVDNFTND-KIGLVVFAGEAFVQLPITSDYVSAKMFLQNADPSLITTQG-TNIAQAIRL 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + + ++ + I+L+TDGE+ + E + +A+ +GI + + Sbjct: 179 SMSSFTQQDK-----------VGRAIILITDGEDHEG--EALEAAKEARKKGINVYILGV 225 Query: 406 SVNKTQQ 412 K Sbjct: 226 GETKGAP 232 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 44/136 (32%), Gaps = 16/136 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT K G T I ++ A D + N K IVL++DGE+T Sbjct: 108 RTEAKAAVATLTPTGWTPIGPSLLKAADDLEGGN------------GSKRIVLISDGEDT 155 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ N + A+ ++ EL Sbjct: 156 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYASVEHQDELT 213 Query: 439 KIFRDRIGNEIFERVI 454 + + V Sbjct: 214 DRVNELVDRAAEPVVT 229 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 58/161 (36%), Gaps = 30/161 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+DR+ V T + + G T A++ A++ + Sbjct: 77 YLGVLAFDDRIEELVPLQ-QVADNKGTFKEAVEGNLVPRGFTDYVGALEEAFEQL----- 130 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT-----QDNEEGIA--------ICNKAKSQGIRIMT 402 H ++ +A++ +V LTDGE ++++E + + + + G+ + Sbjct: 131 ---HSVETG-DARQVVVFLTDGEPNPHLDARNDDEFMEGYLGELWDLTGEYAAAGVPVYP 186 Query: 403 IAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 +AFS E L A + F +L F Sbjct: 187 VAFS-----DEVGPEVLEQIAGHTGADFVLMPDPGDLVVTF 222 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 47/131 (35%), Gaps = 22/131 (16%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNEEGIAICN 391 TA +A+ TA I + + IVL +DG+ T D Sbjct: 166 TATGEAIFTALQAIDTLAGVLG---GGSTPPPARIVLESDGKQTVPADLNDPRGAFTAAR 222 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELNKI 440 AK QG+ I TI+F + L A S SFF A S EL Sbjct: 223 LAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQAS 282 Query: 441 FRD---RIGNE 448 +++ +IG E Sbjct: 283 YQNLQQQIGYE 293 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 66/232 (28%), Gaps = 30/232 (12%) Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD---SSSLRHVIKKKHL 273 N + + D L S + L + G + + S + + Sbjct: 2 KVNLQPVLNDANLDAQQPSSQRQLAISISAGAEPQDRTVPLNLCLILDHSGSMNGRPLEI 61 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+ A ++ +K D R+ F+ R +K Sbjct: 62 VKQAAIRLVDRLKTGD------RLSVVAFDHRAKVLVP---NQVIDNPEQIKKQINRLAA 112 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAICNK 392 G TAI++ ++ + + ++ + + LLTDGEN + + Sbjct: 113 DGGTAIDEGLRLGIEELAKGKKETISQAF----------LLTDGENEHGDNNRCLKFAQL 162 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 A + + T+ F N + L A S + F Sbjct: 163 AAGYNLTLNTLGFGDNWN-----QDVLEKIADAGLGSLSYIQKAEQAVDEFG 209 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 40/128 (31%), Gaps = 18/128 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ DN +G F + + + T I A+ Sbjct: 124 LRSFDNDK----VGLIVFAGDAFVQLPITSDFISA-KMFLNDINPSLIGTQGTDIGKAIN 178 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A +H +A K I+++TDGE+ + E A+ KA+ G I + Sbjct: 179 LA-----------MHSFSPTSKAGKAIIIITDGEDNEGGAE--AMAKKAQEAGFHIYILG 225 Query: 405 FSVNKTQQ 412 + Sbjct: 226 IGSTSGAE 233 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 60/180 (33%), Gaps = 15/180 (8%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S + + FV S + ++++L S ++ I +G Sbjct: 369 SAQKVWKTNKNGTRPTIAVFVTDISGSMNGTRIKSLKNSLLSTMQYIDSSS------YIG 422 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D+V + + +K D + G TA DA+ ++ + Sbjct: 423 LVSYSDKVYINLPIAQFDNKQRAYF-SGAVKDLDVGGQTATYDAVLVGMQMLMEKS---- 477 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K +A + +L+DG E I GI + TI + + + +E + Sbjct: 478 ---KEVPDANMMLFVLSDGAQNAG-FELKRITPIVGGLGISVYTIGYEMTDSDKEDLKAL 533 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 50/368 (13%), Positives = 104/368 (28%), Gaps = 54/368 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK G II ++PV++ + + + + + +AA+ A + + E+ Sbjct: 19 IKKQQGVAGIIFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEK 78 Query: 75 VSSRAKNSFTFP-KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A+ E + R F N + + D V+ + + Sbjct: 79 NQEYAQKIVDHYIPDNKGEVVARVF--NRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDS 136 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 Y+ L+ MG T + + I ++ID S SM++ Sbjct: 137 WISYNDGEMGLTKDFEVMG-------------TSTSRKFLPQPLDIYFIIDMSGSMVNPW 183 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S + +R V R + + + K Sbjct: 184 GGSGKTKYDVVADTINRIVDDLREFKTDRKSRVAVIGFHHTAVKK------------VGR 231 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 F SS + ++ + KI + ND N + D + Sbjct: 232 QRTAFDYSSYRTPSAT--------VNNMFTA-PKIHSRND-------SSNIKTFEDIPLT 275 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + ++ II + + +L ++ +L Sbjct: 276 EDYDAFLTKFNSSNYYA--------SRYGLTESWQGIIGAAQMAEQA--TDLNPEQVFIL 325 Query: 374 LTDGENTQ 381 L+DG + Sbjct: 326 LSDGRDGD 333 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 52/172 (30%), Gaps = 35/172 (20%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--------VISDPSF 312 S ++ + V+ L+ S + R+ F+ + IS Sbjct: 146 SVGRQNFASEIKFVKKLLSDFNVSY-------NYTRVAVITFSSQKKIFRHIDQISQSVE 198 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 L+ V A G T A++ A + ++ D KK I Sbjct: 199 DNDKCLLLNYQVPRIA---FSGGGTYTYGALKEAEEIFKNARLDS----------KKIIF 245 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 L+TDG + + +A K + I +I Q L AS Sbjct: 246 LITDGFSNGRDPIPLAG-RLKKDNNVVIYSIGI------QSGNYAELHAIAS 290 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 69/192 (35%), Gaps = 37/192 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + L ++A+ + ID ++G F+D +L + A+ Sbjct: 432 RLDLAKEAVYQASLGLTPID------QVGLVVFDDAANWVLPL----QRLPSVVEIERAL 481 Query: 330 DENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I ++ A + S++ H ++LLTDG + + IA Sbjct: 482 GSFGIGGGTNIRPGIEQAAQALASADAKVKH-----------VILLTDGIAESNYSDLIA 530 Query: 389 ICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF-RDRI 445 + ++ G+ I T+A N + A A + ++ +IF ++ I Sbjct: 531 ---QMRAAGVTISTVAIGEDANPNLVDVAN------AGGGRSYRVTRIEDVPRIFLQETI 581 Query: 446 ---GNEIFERVI 454 G +I E I Sbjct: 582 IAAGRDIVEERI 593 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 58/178 (32%), Gaps = 25/178 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTI 323 +K L + + +S+ + +RM N GV+ + Sbjct: 59 RKFDLAKQEVFKFAQSL----PKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPYE 114 Query: 324 VKTFAIDEN---EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++F N G T I A++ A N H + L+TDGE T Sbjct: 115 KESFENSLNELGPNGWTPIARALEHAKQADEQLNNGTKH----------IVYLITDGEET 164 Query: 381 QDNEEGIAICNKAKSQGIRIMT-IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + S+G ++ I N + + + A +++A++ E+ Sbjct: 165 CGGDPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAK--AGKGHYYQASTGKEM 220 >gi|71984286|ref|NP_498819.2| C-type LECtin family member (clec-160) [Caenorhabditis elegans] gi|47117847|sp|P34393|CL160_CAEEL RecName: Full=C-type lectin domain-containing protein 160; Flags: Precursor gi|28894818|gb|AAK84522.2|L11247_8 C-type lectin protein 160, confirmed by transcript evidence [Caenorhabditis elegans] Length = 639 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 65/173 (37%), Gaps = 8/173 (4%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +V+ + +++ + N+ V++G ++D+ S ++ T Sbjct: 307 MVKAEINTLVGQMSLDPNIQKHVQVGLIKYSDKAEVVFKPSDYTNEDEFTEDLWSDPRLE 366 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ++ + + + + + M+N + +K IV+ N + N++ I Sbjct: 367 DVDEKSDEVNLHLG---LQQAAKMTA-SMRNGV--RKVIVVYAASYNDEGNDDARQIAAN 420 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + G I+T+AF ++ + + ASP F + L + D + Sbjct: 421 IRETGYAIITVAFVEPESSNLVMK--IGEIASPRMNFTSFRDDLLVEQMEDAL 471 >gi|145632220|ref|ZP_01787955.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229844730|ref|ZP_04464869.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] gi|144987127|gb|EDJ93657.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229812444|gb|EEP48134.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] Length = 212 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 53/172 (30%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR+ L ++ ++++ +T + F+ + + ++ Sbjct: 21 IESVRNGLQMLVSALRQDPYALETAYLSVITFDSQAKQVTPLT--------ELMSFQLPT 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T++ A+ D I + +K + + + LL+DG T D ++GI Sbjct: 73 IEASGLTSMGGALSLLTDCINREVQKGSAEVKGDWKP--VVFLLSDGVPTDDLQKGINAL 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K T F L + + + F+ Sbjct: 131 RTVK-------TGTFVACAAGAGADTNVLKQITESVVSLDTTDANSIKAFFK 175 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 176 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKLASYGVKP-----RYGLVTYATX 230 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T +A+Q Y + ++ Sbjct: 231 XXIWVKVSEAVSSNADWVTKQLNEINYEDHKLKSGTNTEEALQAVYSMMSWPDDVPP--- 287 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICN-------------KAKSQGIRIMTIAFSVN 408 + + I+L+TDG + + I + + + + + Sbjct: 288 EGWNRTRHVIILMTDGLHNMGG-DPITVIDXXXXXXYIGKDRKNPREDYLDVYVFGVG-P 345 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 346 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 382 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 70/223 (31%), Gaps = 19/223 (8%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 + Q +++ E +QW ++ S + + + G+ D ++ + Sbjct: 286 TLDEQQGGLTASLANRVRAERDFVLQWRLEQGVSPVAWVFNQNGKTHQTQAASDDGSMVN 345 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 ++ I + L + + G L+ +D+S Sbjct: 346 TAASTSASNIDNYSLVMVLPPK------VEASGQLNLPRELILVIDTSGSMAGDSIIQA- 398 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++AL +R +K D+ N FN V I + F Sbjct: 399 KNALRYALRGLKAQDSFN------IIEFNSDVSLLSPVPLPATAENLAIARQFVNRLQAD 452 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 G T ++ A++ A + N ++ ++ +TDG Sbjct: 453 GGTEMSLALEAA-----LPKQRPSRAASENNVLQQ-VIFMTDG 489 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA + + TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNRDSTKAALDKLQFADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E L A S + + A S EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKAVYATLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 45/138 (32%), Gaps = 11/138 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A S+ R+++ + + R FN + + + + Sbjct: 333 IKQAKGSLTRALRHLGPND---RFNVIEFNSSHRALFQHAVPASHHNLQLASEYVRHLEA 389 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + A+Q A + +E A + ++ +TDG ++ I + Sbjct: 390 SGGTEMMPALQLALKLPGAQDELR------PEPALRQVIFITDGAVGNESALFEHIVDSL 443 Query: 394 KSQGIRIMTIAFSVNKTQ 411 G R+ T+ Sbjct: 444 --GGSRLFTVGIGSAPNA 459 >gi|118096709|ref|XP_001233876.1| PREDICTED: similar to tumor suppressor candidate 4 [Gallus gallus] Length = 1208 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 42/115 (36%), Gaps = 15/115 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + K G+T + A+D + +SN + K I++ TDG Sbjct: 124 NVASKVFKEDVQGMVVKGTTDYKAGFEYAFDQLQNSNITRANCN-------KMIMMFTDG 176 Query: 378 --ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 + QD E NK +R+ T + + ++ CA+ +FE Sbjct: 177 GEDRVQDVFEKYNWPNKT----VRVFTFSVGQHNYDVTPLQWM--ACANKGYYFE 225 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 46/131 (35%), Gaps = 22/131 (16%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNEEGIAICN 391 TA +A+ TA I + IVL +DG+ T D Sbjct: 166 TATGEAIFTALQAIDT---LAGVVGGGGTPPPARIVLESDGKQTVPTDLNDPRGAFTAAR 222 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEANSTHELNKI 440 AK QG+ I TI+F + L A S SFF A S EL Sbjct: 223 LAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQAS 282 Query: 441 FRD---RIGNE 448 +++ +IG E Sbjct: 283 YQNLQQQIGYE 293 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 71/214 (33%), Gaps = 17/214 (7%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P LDP FV S K +A+ +++ ++ D + Sbjct: 292 NGYFVHFFAPENLDPIPKNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTED------Q 345 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN V + + K T K + G T IN+A+ A I NE Sbjct: 346 FSVVDFNHNVRTWRNDLVSATKTQITDAKRYIEKIQPSGGTNINEALLRA---IFILNEA 402 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMT--IAFSVNKTQ 411 M N IVL++DG+ T + I K I + + I F V+ Sbjct: 403 SNLGMLNPDSVS-LIVLVSDGDPTVGELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDF 461 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 ++ A + T K F +++ Sbjct: 462 LKRLSNENRGIA--QRIYGNRDTSSQLKKFYNQV 493 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 49/123 (39%), Gaps = 12/123 (9%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A++ + +S + + + ++++TDG T D + + Sbjct: 213 QVGGWTETATAIRRVVRELFTSQKGSRNGAT------RILIVITDGVKT-DRLQYSQVIP 265 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNE 448 +AK GI I + + A+ L + AS F + + L I +D++ ++ Sbjct: 266 EAKQAGIIRYAIGVGDAFSSPD-AKRELDDIASEPKAEHIFTVYNFNALRGI-QDQLKDK 323 Query: 449 IFE 451 IF Sbjct: 324 IFA 326 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 47/137 (34%), Gaps = 15/137 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ +++ V + + + + + T A+ A I Sbjct: 87 TRLALIYYSSSVHINQ--HFNDWQDLDVFLDQLEDASYIGQGTYSTYAISNATQLFI--- 141 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 R + + +L+TDG + N + + + +AKS I+I I S+ Sbjct: 142 -----RETSGQSV-RVSLLMTDGSDHPRNPDIMTVVAEAKSHNIKIFAIGLSMRAMDSNS 195 Query: 415 ARYFLSNCASP--NSFF 429 A+ L AS +F Sbjct: 196 AK--LRAVASSPAQQYF 210 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 29/58 (50%), Gaps = 3/58 (5%) Query: 4 DTKFIFYSKKLI---KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 S++L ++ +G F I+ AL++P+M+ + G +D+ R + L+ A Sbjct: 6 SRSVRVQSRRLRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQSVAD 63 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 73/219 (33%), Gaps = 23/219 (10%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 S V C + P P+D ++ + S + +K + + + + Sbjct: 63 SECGVKCTAPVAATSCPIPIDLAMVLDS-SGSIGKKDWVKLLEFTKSVVDAY-----SVS 116 Query: 290 NVNDTVRMGATFFNDRVISDPSFSW--GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 V G ++ D +F GV + K I + T ++ A++ A Sbjct: 117 EEATHV--GVITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKALELAN 174 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIA 404 + E M+ N K+ + LTDG T + + K K +GI + T+ Sbjct: 175 SVLF----TEARGMRPNK--KQVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGIDVYTVG 228 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + LS + + A + EL ++ Sbjct: 229 VGDDV----DLFELLSISSGDKYTYSAKNFDELQAKVQE 263 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 100/333 (30%), Gaps = 30/333 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 89 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 148 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 149 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 207 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 208 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 264 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 265 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 324 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 325 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSRTNGGTFTQKALR 382 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 A + N KK IVLLTDG Sbjct: 383 DAGSRLSVPNGH-----------KKVIVLLTDG 404 >gi|148537043|dbj|BAF63430.1| Ca(2+)-activated chloride channel splicing variant [Rattus norvegicus] Length = 514 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 64/219 (29%), Gaps = 21/219 (9%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + N + + S + + + ++ + Sbjct: 243 LNSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQQAPPMRGTEAPPPPTFSLL 302 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + + A + I + +++ +G F+ Sbjct: 303 KSRQRVICLVLDKSGSMDTEDRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQI 357 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + T + + +G T+I ++ + I SS++ Sbjct: 358 QNYLI-KITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE------- 409 Query: 369 KYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFS 406 IVLLTDGE ++ I+ C K G I TIA Sbjct: 410 --IVLLTDGE-----DDLISSCFEVVKHSGAVIHTIALG 441 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 26/134 (19%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I ++ A++ + ++ ++ + + IVLLTDGE+ Sbjct: 358 PTTASGGTSICSGIRRAFEVV--------RKLYSHTDGSE-IVLLTDGEDN-----TAGA 403 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHE---LNKIFRD- 443 C ++ K G I IA + + + A+ F A E L F Sbjct: 404 CVDEVKQSGAIIHFIAL-----GPSADKAVIEMSTATGGVHFYATDEAENNGLIDAFGAL 458 Query: 444 RIGN-EIFERVIRI 456 GN +I ++ +++ Sbjct: 459 ASGNTDISQQSLQL 472 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + R + K D TA + + TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNRDSTKRALDKLQFADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDAPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E L A S + + A + EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAATLQELKSVYATLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 12 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 70 Query: 75 V 75 Sbjct: 71 P 71 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 67/187 (35%), Gaps = 19/187 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 M+ S++ +D SS + ++ + ++++ ++ + D ++ A F Sbjct: 288 MMCSKTCYNSVNIAFLIDGSSSVGDSNFRLML-EFVSNIAKTFEISDIGA---KIAAVQF 343 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 FS+ + ++ G TA DA+ + Sbjct: 344 T--YDQRTEFSFTDYSTKENVLAAIRNIRYMSGGTATGDAISFTVRNVFGP--------I 393 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++V++TDG++ D + A + A GI + ++ + + + S Sbjct: 394 RESPNKNFLVIVTDGQSYDDVQGPAAAAHDA---GITVFSVGVAWAPL--DDLKDMASKP 448 Query: 423 ASPNSFF 429 ++FF Sbjct: 449 KESHAFF 455 >gi|332262934|ref|XP_003280513.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Nomascus leucogenys] Length = 1164 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 61/177 (34%), Gaps = 18/177 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + +VI ++ F++R + ++ + + A G T Sbjct: 174 VRAVISQFQR--PSTQF---SLMQFSNRFETHF--TFEKFRRSSNPLSLLASVRQLGGLT 226 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q + ++ K ++++TDG+ D+ + + A + G Sbjct: 227 YTATAIQKVVHQLFHASYGARRDAT------KILIVITDGKKEGDSLDYKHVIPMADAAG 280 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFE 451 I I + Q + L++ A S F+ L I ++++ +IF Sbjct: 281 IIRYAIGVG-SAFQNTNSWKELNDIASKPSQEHIFKVEDFDALKDI-QNQLKEKIFA 335 >gi|111224529|ref|YP_715323.1| hypothetical protein FRAAL5146 [Frankia alni ACN14a] gi|111152061|emb|CAJ63786.1| hypothetical protein FRAAL5146 [Frankia alni ACN14a] Length = 209 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 56/178 (31%), Gaps = 15/178 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + D+L ++ + ++ V + R+ F+D + + + + + Sbjct: 10 LEALNDSLPALQKEMQSNPTVGEIARISIVTFSDVGRTVVPLC--------DLAEVYLPE 61 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T A Q I K + + ++DGE+ + A Sbjct: 62 LMVEGGTNFAAAFQETRRAIEGGLRSLP---KGTPIYRPVVFFMSDGEHQAPG-DWTAAL 117 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--EANSTHELNKIFRDRIG 446 N + + R + + A+ SF +A+ ++ +I IG Sbjct: 118 NDLRDRSWR-FAPEVVAFGFGDQVNVDSIRRIATRFSFLARDADPATQVREIMNALIG 174 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 30/70 (42%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G I + + V L V + VD+ + L+ A +A + A+ L S ++ Sbjct: 10 KKQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQA 69 Query: 76 SSRAKNSFTF 85 + A+ + T Sbjct: 70 AVSAEVTSTL 79 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 66/202 (32%), Gaps = 34/202 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +A + + N +G F + TI + Sbjct: 111 RLKAAEEAASQFAAQLTPGIN------LGLVGFAGTPYLLVPP---TPQHQATIDALKKL 161 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------ 383 D + TA +A+ TA + ++ + IVLL+DG + + Sbjct: 162 DFADS--TATGEAIFTALHAVSATA-----ITGGDTPPPARIVLLSDGRENKPSNPSDPH 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYF--------LSNCA--SPNSFFEAN 432 + A+ +G+ I TI+F + E + + A S + A Sbjct: 215 DGVYTAARLARDEGVPISTISFGTKTGEIEMDGQRVAVPVSTDQMKTIAKLSGGQSYTAG 274 Query: 433 STHELNKIFRDRIGNEIFERVI 454 + ELNK + + I +I R + Sbjct: 275 NLAELNKSY-NAIEKDIGYRTV 295 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 11 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 69 Query: 75 V 75 Sbjct: 70 P 70 >gi|146162766|ref|XP_001010021.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146146274|gb|EAR89776.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 783 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 11/169 (6%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + M Y + ++ + RT N + D+ SC + Sbjct: 20 KIKPMHSYIQINQQSDIVMPVMIDIRTQTEKPLSNQHIQNDDDLTK----SCTPTSSQAS 75 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +D + FV S+ KK + ++ + ++I + + D RM FN+ Sbjct: 76 CNIDIDRQPLDLIFVIDLSISMRGKKMNQLKKTICNLINFLNEND------RMALIGFNN 129 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + S + + + MG T I M A + SS Sbjct: 130 SAQNLFPLSHLTQQNKKKVT-QILNSILPMGLTNITAGMMEAIKQLESS 177 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 46/137 (33%), Gaps = 18/137 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P S S+R K LV+ A+ VI S+ D R+ F+ Sbjct: 160 DAPRAPLDLVTVLDVSGSMRWD--KLALVKQAMGFVIGSLGPHD------RLSVVSFSSG 211 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + +++ G T I + ++TA + + HR + Sbjct: 212 ARRVTRLLR-MSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVL----GERRHRNAVSS 266 Query: 366 EAKKYIVLLTDGENTQD 382 ++LL+DG + Sbjct: 267 -----VILLSDGHDNYS 278 >gi|225874357|ref|YP_002755816.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] gi|225791485|gb|ACO31575.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] Length = 333 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 45/141 (31%), Gaps = 8/141 (5%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI----VK 325 K H + L S++ + V FN V S K + + V Sbjct: 62 KVHASKQFLNSLLSAHASGSAPRVFV----VQFNRDVDLLEDPSASASKAQQALSQVGVA 117 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 F D N + + + + + H + N +K IV+L+DG + Sbjct: 118 QFHGDSNADSTANKGRHAKASGAALYDAIYLATHNVLNTTAGRKVIVVLSDGIDQGSKTT 177 Query: 386 GIAICNKAKSQGIRIMTIAFS 406 A+ G+ + I F Sbjct: 178 LNGAVEAAQRAGVAVYAIYFK 198 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 45/113 (39%), Gaps = 12/113 (10%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K G T + A+++A + A++ IVL+TDGE+T Sbjct: 116 KAAVATLRPTGFTPVGLALRSAAQDLG-----------TGSTARR-IVLITDGEDTCAPP 163 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + + +QG +++ + ++ + + A+ ++ A S EL Sbjct: 164 DPCEVARELAAQGTKLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAAQSADEL 216 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 68/199 (34%), Gaps = 21/199 (10%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT-VRMGATFFND 304 P P + V S I + + A + + + + +R+ + Sbjct: 35 PVAEPPKVELVLDVSGSMRARDIDGRSRISVAQQAFNEVVDALPDETQLGIRVLGATYRG 94 Query: 305 RVISDPSFSWGVHKLI------RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + +++ RT K G T + A+++A + Sbjct: 95 KDKKQGCL--DTQQIVPVGPVDRTQAKAAVAGLRPTGFTPVGLALRSAAQDLG------- 145 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 A++ IVL+TDGE+T + + + +QG R++ + ++ + + Sbjct: 146 ----TGSTARR-IVLITDGEDTCAPPDPCEVARELAAQGTRLVVDTLGLAPDEKVRKQLL 200 Query: 419 LSNCASPNSFFEANSTHEL 437 A+ ++ A S EL Sbjct: 201 CIAGATGGTYTAAQSADEL 219 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 68/199 (34%), Gaps = 21/199 (10%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT-VRMGATFFND 304 P P + V S I + + A + + + + +R+ + Sbjct: 35 PVAEPPKVELVLDVSGSMRARDIDGRSRISVAQQAFNEVVDALPDETQLGIRVLGATYRG 94 Query: 305 RVISDPSFSWGVHKLI------RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + +++ RT K G T + A+++A + Sbjct: 95 KDKKQGCL--DTQQIVPVGPVDRTQAKAAVAGLRPTGFTPVGLALRSAAQDLG------- 145 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 A++ IVL+TDGE+T + + + +QG R++ + ++ + + Sbjct: 146 ----TGSTARR-IVLITDGEDTCAPPDPCEVARELAAQGTRLVVDTLGLAPDEKVRKQLL 200 Query: 419 LSNCASPNSFFEANSTHEL 437 A+ ++ A S EL Sbjct: 201 CIAGATGGTYTAAQSADEL 219 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 58/182 (31%), Gaps = 21/182 (11%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L P L+ +D+S + K L+ A ++ ++ D V+ G+ Sbjct: 270 TLPDVSEHPPLNLVFLIDTSGSMNQPDKLPLLISAFRLMLSELRPEDEVSIITYAGSAG- 328 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +V++ S R + + GSTA ++ AY + +ED Sbjct: 329 --QVLAPTPAS------DRATILAALNRLSAGGSTAGQAGLRQAYAIAAAMSEDGEIAR- 379 Query: 363 NNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 ++L TDG+ D + + G + + F + Sbjct: 380 --------VILATDGDFNVGLNDPDTLKDYITTRRDSGTYLSVLGFGRGNLNDAVMQSLA 431 Query: 420 SN 421 N Sbjct: 432 QN 433 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 54/350 (15%), Positives = 108/350 (30%), Gaps = 31/350 (8%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 +D+V + K+ + I +++ +FP + N Sbjct: 51 IDLVENYLSDVERKEG-DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLI 109 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 K + + + +DT V + P+ + S + ++P +L S+ S Sbjct: 110 LDNKEYDIIDETENKKDTDVSV-PKVDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQ 168 Query: 161 KAEAETVSRSYHKEHGVSIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 E + S Q V+D + ++ +N + + SY Sbjct: 169 TIPTEYTTDDQGTYPKASWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSY 225 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIK 269 G D + Y + + +Y + VD S + Sbjct: 226 IEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNN 285 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTF 327 + V+ + + ++ + D + MG ++ S + G ++ VK+ Sbjct: 286 RIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI 344 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 G T A++ A + + N KK IVLLTDG Sbjct: 345 -TPSRTNGGTFTQKALRDAGNMLSVPNGH-----------KKVIVLLTDG 382 >gi|31873561|emb|CAD97767.1| hypothetical protein [Homo sapiens] Length = 363 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 36/117 (30%), Gaps = 8/117 (6%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---- 389 T A+ + Y + + E + I+LLTDG++ A+ Sbjct: 5 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 64 Query: 390 ----CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 N+ ++ + I I + S F T L+++F Sbjct: 65 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFE 121 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 38/334 (11%), Positives = 89/334 (26%), Gaps = 37/334 (11%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ--SLEEVSSRAKNSFTFPKQKI 90 + G + VD+ + + A A + + L + + A + Sbjct: 27 LFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGVAYNNLPAAVTAAQASVTS 86 Query: 91 EEYLIRNFENNLKKNF--TDREVRDIVRDTAVEMNPRKSAYQVVLSS--------RYDLL 140 +L NF + + V+ + + + + R L Sbjct: 87 NPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGGAMSAPGGNSWTAMRVTLG 146 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ + F+R G+ S A A I +V+D + SM + G Sbjct: 147 VSQPTYFMRVFGVNSMPSGAVATA--------VYRPRDIAFVLDMTGSMAFSSTFNSGNA 198 Query: 201 LNCFG-------QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 P Y++ ++ D + + ++ + + PG Sbjct: 199 QGSSSDYQSMNPDPLVPKAGHYTTVQSRIVAADNLANSSNEALPRNNFTITTPGGPPIVR 258 Query: 254 SEEHF-VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S + + + V D A+++ + + GA + + Sbjct: 259 SYYYDPSNWGTPSTVASPVTTKGDGSANLLPAFHRWSPPE----SGADS-----DNYIAP 309 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 ++ + + M A Sbjct: 310 TYNFAGYNAFHKGNETTPQGPTPAPDTYGTMTDA 343 >gi|1399245|gb|AAB03226.1| integrin alpha-M [Rattus norvegicus] Length = 205 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 60/177 (33%), Gaps = 18/177 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +++V+ +K + ++D F++ K G T Sbjct: 1 VSTVMEQFQKSKTL-----FSLMQYSD--EFRTHFTFNXFKRNPDPKSHVRPIRQLNGRT 53 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 ++ + K +V++TDGE D + +A+ G Sbjct: 54 KTASGIRKVVRELFQKINGARDNA------AKILVVITDGEKFGDPLNYEDVIPEAEEAG 107 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 I I N + ++R L AS + F+ ++ LN I R+++ +IF Sbjct: 108 IIRYVIGVX-NAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTI-RNQLQEKIFA 162 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 41.5 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 6/104 (5%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ I + KNN +K I++L+DG+ D + NK + + Sbjct: 253 TKTASAINHVLTDIFIP----ENGSKNNS--EKIIIVLSDGKILGDPMTLDEVLNKTQMK 306 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 G+ +I + + A ++ A P ++ +S LN I Sbjct: 307 GVTRYSIGVGDGILKNKDAIKEMTQIADPGKYYSVSSYGALNDI 350 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 58/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKIIVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + S D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 122 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEAIFTAL 176 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 177 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 233 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + EL ++ + +IG E Sbjct: 234 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYE 293 Query: 449 IFE 451 + Sbjct: 294 TIK 296 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 79/247 (31%), Gaps = 26/247 (10%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKSLYYMLYPG 247 L++ E + Q + + S G +P + Y Sbjct: 274 LNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQ 333 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 VD S + L++ ++ ++ ++ +D V + FN++ Sbjct: 334 GASSPKDMVIIVDVSGSVSGLT-LKLMKTSVCEMLDTLS----DDDYVNVA--SFNEKAQ 386 Query: 308 SDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 F+ V +R + K G+T + A+D + + N + Sbjct: 387 PVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNPNITRANCN---- 442 Query: 366 EAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K I++ TDG + QD E N + +R+ T + + ++ C Sbjct: 443 ---KMIMMFTDGGEDRVQDVFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWM--ACT 493 Query: 424 SPNSFFE 430 + +FE Sbjct: 494 NKGYYFE 500 >gi|313235273|emb|CBY10837.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 83/265 (31%), Gaps = 27/265 (10%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 +++ ++ +S ++ +D+ R + L F + Sbjct: 90 NTTKTAALKNSISNKFNVDWDRLKYEELTIPTNSILALFLFIDQLNEFPIPYSIDEQDAL 149 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSE---EHFVDSSSLRHVIKKKHLVRDALASVI 282 +++ V K L + S+ FV S LV+ +I Sbjct: 150 YTRITHNTVDGFKDKVDELDQNTQNECTSKALDIVFVVDESGSVGPDNFDLVKQ---FLI 206 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + + R+ ++ D S + +I I +E G T D Sbjct: 207 DYAQDSNIAANATRIAIRTYSTYSDLDFSLNDFKTSNIIFEINNL----VHESGGTNTAD 262 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + + + V K +V +TDG++ D + A + K+ I Sbjct: 263 AITNGLNDFGNDRSESV----------KIMVTITDGQSNYDRVK--AAADLLKADPRNIQ 310 Query: 402 TIAFSVNKTQQEKARYFLSNCASPN 426 + A ++ + L A+ + Sbjct: 311 SFAIGIDGANMAE----LQAIATTD 331 >gi|304437812|ref|ZP_07397761.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369169|gb|EFM22845.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 255 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 49/147 (33%), Gaps = 14/147 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + ++ + +I+ D+ + F++ F+ I + Sbjct: 61 KSRIQELQKGIEMFFEAIRTDILAADSAEISIVTFDNEAKCLLDFAN--------IERQT 112 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-- 385 + G TA+ + + A D + + + ++ K + ++VL++DGE D E Sbjct: 113 VPQLHANGLTAMGEGVNLALDLL--AQRKKEYQDKGVDYYQPWLVLMSDGEPNGDPTELR 170 Query: 386 --GIAICNKAKSQGIRIMTIAFSVNKT 410 + + + + I Sbjct: 171 RATQRVTELVNAGKLTVFPIGIGSEPG 197 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 6 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 64 Query: 75 V 75 Sbjct: 65 P 65 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 59/178 (33%), Gaps = 25/178 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTI 323 +K L + + +S+ + +RM N GV+ + Sbjct: 59 RKFDLAKQEVFKFAQSL----PKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPYE 114 Query: 324 VKTFAIDEN---EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++F N G T I A++ A T N H + L+TDGE T Sbjct: 115 KESFENSLNGLGPNGWTPIARALEHAKQTDEQLNNGTKH----------IVYLITDGEET 164 Query: 381 QDNEEGIAICNKAKSQGIRIMT-IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + S+G ++ I N + + + A +++A++ E+ Sbjct: 165 CGGDPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAK--AGKGHYYQASTGKEM 220 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 54/171 (31%), Gaps = 30/171 (17%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + V +G ND V G +V G T + Sbjct: 471 LDGTKPYRELVPVGLM--NDEV--------GTGTREEALVAA-VNGLKAKGGTGLYATAL 519 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-------CNKAKSQG 397 A++++ + + + K N +VLLTDG+N Sbjct: 520 AAFESLSAQYQPD----KPNQ-----VVLLTDGQNDDPTSSMTLTQLIATLKAEYNPKAP 570 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + I+TI + + + + A+ + + A + + ++ + + + Sbjct: 571 VHIITIGYGADADMDALRQI---SAATGSKTYPAQDPNSIFQVMVNALTDR 618 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/270 (12%), Positives = 82/270 (30%), Gaps = 25/270 (9%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + S+L Q ++ +V+ +++ + K + Sbjct: 9 AGSLLLAQDQLPSIKVDVDVVSILTSVRDKRGALIPSLQKEDFTI---LEDGKPQPIKYF 65 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 D L+ VD S + + + R A + R + + ++ + G + Sbjct: 66 TKETDLPLTIGMLVDVSGSQRNL--IDIERSAASQFFREVLRKKDLAFLIMFGEETELLQ 123 Query: 306 VISDPS--FSWGVHK--LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + G++ + + MG Y +++NE Sbjct: 124 DYTGSPRLLTEGLNHLEVSSGVSGIHPGPVPTMGGPRGTVLYDAVY---LAANEKLK--- 177 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI------AFSVNKTQQEKA 415 +K IV++TDG + A+ I +I A+ Sbjct: 178 --GEVGRKVIVVITDGVDQGSRMSRNQAIEAAQKSDCVIYSIDYSDPRAYGPFNMVGGGG 235 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L + + ++ + H L+++F++ Sbjct: 236 EGELRKMSDETGGRVYKVDRRHTLDQVFKE 265 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 45/395 (11%), Positives = 109/395 (27%), Gaps = 34/395 (8%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 L++ + S +L +S K+ T ++ + Sbjct: 126 RSELEKDKKADADVESRDAADALVAPASPQKSESTVASGAAQQNALERGVLAEPAPPAPT 185 Query: 110 EVRDIVRDTAVEMNPRKS---------AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 + + R Q + P + R K+ + Sbjct: 186 GEFALDGAVSAPSTSRARMPAESKLMAPQQPSTLPADQMQPQPENRD-RIEDFKTNPVHA 244 Query: 161 KAEAETVSRSYHKEHGVSIQWVID--FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 E + S + S +V ++ + +N F + + Sbjct: 245 ALEDPVSTFSIDVDTA-SYSFVRSSLKQGTLPQVDTVRVEEMINYFPYDWKGPESASTPF 303 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 N V + + + + ++ P + +D S K L++ A Sbjct: 304 NSTVSVMPTPWNTHTRLMHVAIKGFDVKPTEQPKANLVFLIDVSGSMDEPDKLPLLKSAF 363 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +K D ++ G + + + + + GSTA Sbjct: 364 RLLVSKLKADDTISIVTYAG--------DAGTVLE-PTKASQKDKILSAIDNLTPGGSTA 414 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKS 395 ++ AY S +K+ + ++L TDG+ D+++ + K + Sbjct: 415 GEAGIKEAYRLAQKS------FVKDGVNR---VMLATDGDFNVGQSDDDDLKRLIEKERK 465 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G+ + F + + N ++ + Sbjct: 466 TGVFLSVFGFGRGNLNDQMMQTIAQNGNGTAAYID 500 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 59/186 (31%), Gaps = 17/186 (9%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + DA +I + + + F V ++ ++ + Sbjct: 32 KIDSLYDATIDMIETFSAAQAKEQVIDVAIITFGTHVELHTKYT-----PVKDLQAKGIC 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-A 388 + G T + A++ A D I + + + +VL++DG D + + Sbjct: 87 KFSASGLTPMGTALRMAKDMIEDK------DVTPSRIYRPAVVLVSDGAPNDDWKSPMDK 140 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 N +S + +A + PN+ A ++++ F+ I Sbjct: 141 FINDGRSAKCQRFAVAIGND----ADRSILERFTQDPNAVLFAEDAKDISEQFK-TISMS 195 Query: 449 IFERVI 454 I + Sbjct: 196 ISTMAV 201 >gi|242373747|ref|ZP_04819321.1| possible cell wall associated fibronectin-binding protein [Staphylococcus epidermidis M23864:W1] gi|242348567|gb|EES40169.1| possible cell wall associated fibronectin-binding protein [Staphylococcus epidermidis M23864:W1] Length = 4172 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 39/330 (11%), Positives = 110/330 (33%), Gaps = 13/330 (3%) Query: 44 VRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK--QKIEEYLIRNFENN 101 + + A KQA A+ + +I + + + + + ++ L + Sbjct: 598 GNYINADQAKKQAYDNAV-QNAQNIINGTNQPTINKGDVSNATQSVKATKDALDGDHRLE 656 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS-SRYDLLLNPLSLFLRSMGIKSWLIQT 160 KN ++E+ ++ + K+A + + L +MG I Sbjct: 657 EAKNHANQEITNLQNLNNAQKEAEKNAVNTSPTLEQVQQNLQTAQNLDSAMGALKQSIAN 716 Query: 161 KAEAETVSRSYHKEHGV--SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 K + ++ S+ +++ + + + + ++++ D E N + Sbjct: 717 KDQVKSESKYLNEDPTIKANYDNAVQRAENIINATHDPELNKANIEQATQAVQQAEQALH 776 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 + D+ + + +L +E +S+S + +K + L Sbjct: 777 GVEKLTHDKTTASNELDGLTNLTDAQREK------LKEQINNSTSRDDIKQKIEQAKQ-L 829 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +++ +K+ D V + +FN+ ++ + + I + + N T Sbjct: 830 NNLMDQLKQQVGHKDDVHSSSNYFNEDPDKKKAYDDAIKRAEEIIKNSTDPNLNPQDITN 889 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + A D + + + + +N Sbjct: 890 ALNQINHAKDNLHGDDLLQHQKDTSNQSID 919 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 49/151 (32%), Gaps = 17/151 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++G ++ R + S + ++ G T A++ + Sbjct: 40 TQVGIIQYSTRPRQEFSM--NSFQTKESLSTAIENVNYMAGGTLTGRAIRY----VTKYG 93 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 E + + K ++L+TDG +++ +A+ +GI + I S Q Sbjct: 94 FGESDGARPGIP--KIVILVTDG---VSSDDIEQPALEAQQKGISLYAIGVSGYDMDQ-- 146 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 L AS N L R+ + Sbjct: 147 ----LERIASNNRTLAVAENFNLLDSLRNTL 173 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 7 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 65 Query: 75 V 75 Sbjct: 66 P 66 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|126316414|ref|XP_001380743.1| PREDICTED: similar to integrin alpha 2 subunit [Monodelphis domestica] Length = 1214 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 71/223 (31%), Gaps = 22/223 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S + L + F+D + + +++ ++ +D D ++G Sbjct: 187 SPSFQLLNSFSPAVQTCPSFIDVVVVCDESNSIYPWSAVKNFLVKFVQGLDIGPDKTQVG 246 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDE 357 + + + S K +VK + G T A+Q A + S Sbjct: 247 LIQYGNYPRVVFNMS--TFKTKEEMVKATSQTIQHGGDLTNTFKAIQFAREFAYSEASG- 303 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAFSV----NKTQ 411 A K +V++TDGE+ + +E I CN I IA N Sbjct: 304 -----GRPSATKVMVVVTDGESHDGSFLKEVIGQCN---DDNILRFGIAVLGYLNRNALD 355 Query: 412 QEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFE 451 + + AS FF + L + +G IF Sbjct: 356 TKNLIKEIKAIASTPTERYFFNVSDEDALLEK-AGTLGERIFS 397 >gi|78189842|ref|YP_380180.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172041|gb|ABB29137.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 334 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 40/130 (30%), Gaps = 7/130 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + I I R G F + + T++ A + E T + A Sbjct: 116 QEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRDG-FATLLNMAAPELIEEQGTRLQPA 174 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 A +NE + IVLL+DGE+ N + + Q +++ Sbjct: 175 FALASTIFDVANESNAASTRGVQV----IVLLSDGEDHDSNVQ--RAAQQLAKQSVQLFV 228 Query: 403 IAFSVNKTQQ 412 I K Sbjct: 229 IGVGSLKPSP 238 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 69/199 (34%), Gaps = 23/199 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D+S K L++ + ++ +++ D R+ + + Sbjct: 248 PPSNIVFLIDTSGSMDEPNKMPLLKASFKLLLDNLRPED------RIAIVVYASQTGIAL 301 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + D GSTA +QTAY+ + K N Sbjct: 302 P---STPAKEKEKISKVIDDLVASGSTAGGAGLQTAYEVAEKN-----FLPKGNNR---- 349 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ +E + + ++ GI I + + + + + A + + Sbjct: 350 IILATDGDFNVGISSRDELQRLVEEKRNNGIYISVLGYGMGNYRDDMAETIANK--GNGN 407 Query: 428 FFEANSTHELNKIFRDRIG 446 + ++ E K+ + G Sbjct: 408 YAYIDNFTEAKKVLVNEFG 426 >gi|300869050|ref|ZP_07113652.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300332961|emb|CBN58846.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 411 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 55/158 (34%), Gaps = 29/158 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW--GVHKLIRTIVKT 326 + V+ A ++ +K D R+ F+ + + + + I K Sbjct: 57 RPLDTVKQAAGRLVDRLKPGD------RLSVIAFDHKAKVIVPNQFIDDPGSIKKQIDKL 110 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G TAI++ ++ + + + K+ ++ + LLTDGEN + Sbjct: 111 R-----SSGGTAIDEGLKLGIEEMG--------KGKSETVSQAF--LLTDGENEHGDNNR 155 Query: 387 -IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + + A + + ++ F + + L A Sbjct: 156 CLKLAKLAADYNMTLNSLGF-----GDDWNQDILEKIA 188 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 51/151 (33%), Gaps = 18/151 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ +D R+G ++ R I F +I + T E + + A++ Sbjct: 101 VRNLDMPKTGARVGTVIYSKRSIVLFDFKDNKTDIILQL-NTIKYREKPNRALSTGQALE 159 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + KK +L+T GE +QD+ I K G+ I T+ Sbjct: 160 LAQKRLFQNARRNS---------KKIAILVT-GEKSQDDV--IIPSKLMKDSGVIIYTVG 207 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + L + AS S+ Sbjct: 208 VGEGYFLPQ-----LESIASSPSYVYTEKVK 233 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 11/77 (14%), Positives = 31/77 (40%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L++S G T + + ++G + + W + AA +A + ++ L + Sbjct: 2 RNLLRSRQGSAAFATVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRLSCT 61 Query: 72 LEEVSSRAKNSFTFPKQ 88 + + ++ K+ Sbjct: 62 MAGAACGTQSVDYLAKE 78 >gi|17231851|ref|NP_488399.1| hypothetical protein alr4359 [Nostoc sp. PCC 7120] gi|17133495|dbj|BAB76058.1| alr4359 [Nostoc sp. PCC 7120] Length = 418 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 67/232 (28%), Gaps = 30/232 (12%) Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD---SSSLRHVIKKKHL 273 N + + D L S + L + G + + S + + Sbjct: 2 KVNLQPVLNDANLDAQQPSSQRQLAISISAGAEPQDRTVPLNLCLILDHSGSMNGRPLEI 61 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+ A ++ +K D R+ F+ R +K Sbjct: 62 VKQAAIRLVDRLKTGD------RLSVVAFDHRAKVLVP---NQVIDNPEQIKKQISRLAA 112 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICNK 392 G TAI++ ++ + + ++ + + LLTDGEN DN + Sbjct: 113 DGGTAIDEGLRLGIEELAKGKKETISQAF----------LLTDGENEHGDNSRCLKFAQL 162 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 A + + T+ F N + L A S + F Sbjct: 163 AAGYSLTLNTLGFGDNWN-----QDILEKIADAGLGSLSYIQKPEQAVDEFG 209 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 47/121 (38%), Gaps = 15/121 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--RVISDPSFSWGV 316 VD S + + + V+ + + I+ + D+V MG ++ + + G Sbjct: 361 VDWSGSMNEMGRIAEVKKGVDRFLNQIEG-SGIQDSVYMGYVGYSSDGSNYQNKTCQLGK 419 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ +++ E G T ++ A D + + N KK IVLLTD Sbjct: 420 FSEVKETIRSM-TPETAAGGTFTQRGLRQAGDMLSTQNGH-----------KKVIVLLTD 467 Query: 377 G 377 G Sbjct: 468 G 468 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 45/126 (35%), Gaps = 16/126 (12%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 V L RT KT + G T I A+ + S K IVL++ Sbjct: 109 VSTLDRTEAKTQVATLSPTGWTPIGPALLKSAGDFTDSAS------------SKRIVLIS 156 Query: 376 DGENTQDNEEGIAICNK--AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DGE+T + + + AK G+ I T+ N +++ A+ ++ Sbjct: 157 DGEDTCAPLDPCEVAREIGAKGIGLTIDTLGLVPNTKMRKQLSCIAE--ATGGTYTSVEH 214 Query: 434 THELNK 439 EL Sbjct: 215 ADELTD 220 >gi|326923641|ref|XP_003208043.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 597 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 51/150 (34%), Gaps = 18/150 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S S+R+ + + ++LA ++ +RM F+ R + + Sbjct: 93 LDKSGSVRNHWTEIYSFVESLAEKF--------ISPMLRMSFIVFSSRGTTIMKLTENRE 144 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + R + E G T +++ + A + I H + I+ LTDG Sbjct: 145 AIRRGL--EILQHEVPGGDTFMHEGFKRANEQIY-------HETYGGVRTASVIIALTDG 195 Query: 378 ENTQDN-EEGIAICNKAKSQGIRIMTIAFS 406 E N+A+S G + + Sbjct: 196 ELQDVQFYYAEQEANRARSFGAIVYCVGVK 225 >gi|224051386|ref|XP_002199700.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 565 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 56/366 (15%), Positives = 111/366 (30%), Gaps = 43/366 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 +V + + S A N+ Q + + KN V V Sbjct: 172 GAVKVQTLPGQESYPAVNANGIQSQVLSRWASSFSVTAGAKNAALEAVGRSVSTARPSTG 231 Query: 124 PR--------------KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 R K+ ++ Y++ +L +G + ++ E V Sbjct: 232 KRPKKTLDKKAGNKDCKADIAFLIDGSYNIGQRRFNLQKNFVGKVAVMLGIGTEGPHVGV 291 Query: 170 SYHKEHG-VSIQW--------VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 EH + V+ + + +S + SY Sbjct: 292 VQASEHPKIEFYLKNFTATKEVLFAIKELGFRGGNSNTVIDFSLAVCRNNGFFSYQMPTW 351 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 K +V S ML S++ +D SS I + ++ + +++ Sbjct: 352 ---FGTTKYVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSIGEINFRLML-EFVSN 407 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V ++ + D + A F + F+ H ++ + G TA Sbjct: 408 VAKAFEISDIGSKV---AAVQFTYNQRKEFGFT--DHVTKEKVLSAIHNIQYMSGGTATG 462 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA+ T+ +D K ++++LTDG++ D A AK GI + Sbjct: 463 DAISFTTRTVFGPVKDG--------PNKNFLIVLTDGQSYDDVTGPAA---AAKKAGITV 511 Query: 401 MTIAFS 406 ++ + Sbjct: 512 FSVGVA 517 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 64/190 (33%), Gaps = 34/190 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ I ++ G+ +D+ R + L+ AA Sbjct: 5 RFLSDRRGNIAIAFGSFAFLLTAGSGVGIDMSRVVTEKSRLQSAAD-------------- 50 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S T+ ++I ++ F+ +T E + R+ V + Sbjct: 51 ATALAANYKSGTYTAEQIRQHAEAYFDGL----YTAPERGSVSRNVTVGDGT------IS 100 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + F +G++ AE++ + S+ + V+D S SM Sbjct: 101 VEAGVTMP----TFFAPLLGVEEISFAVMAESKVGTASFD------VVLVLDNSGSMAGS 150 Query: 193 QRDSEGQPLN 202 + + Q + Sbjct: 151 RMTTLKQAAS 160 >gi|149601522|ref|XP_001515070.1| PREDICTED: similar to integrin alpha 2 subunit, partial [Ornithorhynchus anatinus] Length = 329 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 62/185 (33%), Gaps = 19/185 (10%) Query: 276 DALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A+ + + + +D ++G + + + + +T ++N Sbjct: 127 NAVKNFLEKFVQGLDIGPTKTQVGLIQYGNEPRVVFNM-NKFKTKEEMVQETSRTNQNGG 185 Query: 335 GSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T A++ A S ++ H K +V++TDGE + D + ++ Sbjct: 186 SLTNTFKAIEFARQNAFSLASGGRPHAT-------KVMVVVTDGE-SHDGSNLKKVIDQC 237 Query: 394 KSQGIRIMTIAFSV----NKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIG 446 + I IA N+ + + AS FF + L + +G Sbjct: 238 EQDNIIRFGIAVLGYLLRNELDTKNLITEIKAIASSPTEKYFFNVSDEAALLEK-AGTLG 296 Query: 447 NEIFE 451 IF Sbjct: 297 ERIFS 301 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + + + D TAI++A+Q + + Sbjct: 109 RVGIIGYAGSAFPQVPITTDFASTKLFLNSMD-TDMVSSQGTAISEAVQMSTTYFDDEEQ 167 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +++DGE+ + N E + A GI++ I K Sbjct: 168 KN-----------RVLFVISDGEDHEGNFE--SAIENATENGIKVYAIGVGTEKGGP 211 >gi|90407194|ref|ZP_01215382.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] gi|90311770|gb|EAS39867.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] Length = 1116 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 44/318 (13%), Positives = 85/318 (26%), Gaps = 82/318 (25%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE----- 256 N + V + K D+ + Y ++ KS Y Sbjct: 123 NKYSWAPLAWVYQTEIYDCKADQDDKNNTEYPINNKKSPYMKKNKKNKKNKKIFTSPDAV 182 Query: 257 ---------HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +++ + + + + ++ + ++I S +D G + FN + Sbjct: 183 TLYTANYYYWYMNKRASNNTRSRVDIAKEVITTLISSTPSVD-------FGLSIFNSKGS 235 Query: 308 SDPSF----SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + +T + + + +Y N K Sbjct: 236 GGRIIHRVKERDKPEKEQLVASINNLKIESSTNTPLLETVFESYLYYAGLNVKSGKTAKG 295 Query: 364 NLEA--------------------KKYIVLLTDGENT----------------QDNEEGI 387 + K Y+VL+TDG+ + N Sbjct: 296 SEPIRDMKAEDAGVYTSPFKTCQEKSYLVLMTDGDPSSPYALQKEIGSLIKRDNPNSNLK 355 Query: 388 AICNK------------------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNS 427 I N A+ Q I TI F + T A L A Sbjct: 356 GIANSYMPVLTQWMFNNDINPKLAEEQNITTYTIGFG-DITSDTDATDLLKKTAELGGGK 414 Query: 428 FFEANSTHELNKIFRDRI 445 +F A++ L K F+ I Sbjct: 415 YFPASNASALQKAFKKMI 432 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 34/97 (35%), Gaps = 2/97 (2%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP--LIQS 71 L K G ++ A+L+ + LG+ + +DV + L+ AA + + L Sbjct: 8 LKKDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESG 67 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + + + I E + N N D Sbjct: 68 ASVNPNANVIANNTAIENISEDVPVEVNYNAAANTGD 104 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 59/192 (30%), Gaps = 36/192 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ +I+++ D R+ F+ + R + Sbjct: 170 KLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRILPL-RTMSGSGREDAISVVN 222 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----- 384 + G T I ++ + E H + I+LL+DG +TQ + Sbjct: 223 SLSATGGTNIVAGLRKGVRVLE---ERRQHNSVAS------IILLSDGCDTQSHSTHNRL 273 Query: 385 --------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 A +++ I T F ++ + + S +F S Sbjct: 274 EYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAA-----MHAISDVSGGTFSFIESI 328 Query: 435 HELNKIFRDRIG 446 L F IG Sbjct: 329 DILQDAFARCIG 340 >gi|171910783|ref|ZP_02926253.1| Protein containing von Willebrand factor (vWF) type A domain [Verrucomicrobium spinosum DSM 4136] Length = 917 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 40/129 (31%), Gaps = 18/129 (13%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 VD S K LV+ +L + + D V G + VI Sbjct: 544 LVDVSGSMDEPDKLPLVKQSLRMLTERLSTKDRVAIVTYAG----STAVILPS-----TA 594 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + GST ++ AY+ + + H K + ++L TDG Sbjct: 595 GTEKSRIIEAIDGLGAGGSTNGAGGIRLAYE------QAQQHFQKEGVNR---VILCTDG 645 Query: 378 ENTQDNEEG 386 + Sbjct: 646 DFNVGISSP 654 >gi|149546336|ref|XP_001514218.1| PREDICTED: similar to Procollagen, type VI, alpha 2 [Ornithorhynchus anatinus] Length = 1023 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 69/196 (35%), Gaps = 18/196 (9%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS----EEHFVD 260 G P + ++ + G G P + C+ Y G D + FV Sbjct: 562 GPPGEPGLRGPPGRPGPEGEPGPPGDPGLTECDVMTYVRETCGCCDCEKRCGAVDIVFVI 621 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDT-VRMGATFFNDR-VISDPSFSWGV 316 SS L ++ + +V+ + I D ++T R+G ++ Sbjct: 622 DSSESIGFTNFSLEKNFVINVVSRLGAIAKDPKSETGARVGVVQYSHEGTFEAIQLDDER 681 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + E G T A+Q AY+ +I + R K K + V++TD Sbjct: 682 IDSLSSFKEAVKNLEWIAGGTWTPSALQFAYNKLIKES----RRKKT----KVFAVVITD 733 Query: 377 G--ENTQDNEEGIAIC 390 G + ++ A+C Sbjct: 734 GRHDPRDNDANLRALC 749 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 26/134 (19%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I ++ A++ + ++ ++ + + IVLLTDGE+ Sbjct: 322 PTTASGGTSICSGIRRAFEVV--------RKLYSHTDGSE-IVLLTDGEDN-----TAGA 367 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHE---LNKIFRD- 443 C ++ K G I IA + + + A+ F A E L F Sbjct: 368 CVDEVKQSGAIIHFIAL-----GPSADKAVIEMSTATGGVHFYATDEAENNGLIDAFGAL 422 Query: 444 RIGN-EIFERVIRI 456 GN +I ++ +++ Sbjct: 423 ASGNTDISQQSLQL 436 >gi|296269618|ref|YP_003652250.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092405|gb|ADG88357.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 223 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 38/128 (29%), Gaps = 13/128 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS--W 314 + V S + L + R I V D R+G F+DR S Sbjct: 9 YLVCDESYSMEGPPLDAINQELPEIYREIASNPVVADRARLGIIGFSDRAEVLLPLSDLN 68 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 VH + + G T A TI + + + + L Sbjct: 69 DVHSIPQ---------LAPRGGTNYGAAFALLKSTIEQDVQALKQA--GHRPYRPCVFFL 117 Query: 375 TDGENTQD 382 TDG+ T + Sbjct: 118 TDGQPTYE 125 >gi|146329087|ref|YP_001209787.1| type IV fimbrial tip adhesin [Dichelobacter nodosus VCS1703A] gi|146232557|gb|ABQ13535.1| type IV fimbrial tip adhesin [Dichelobacter nodosus VCS1703A] Length = 1272 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 73/240 (30%), Gaps = 36/240 (15%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD----YQRDSEGQPLNCFGQPADR 210 S +Q +E+ + K I +ID S SM D + +S + P Sbjct: 52 SLNLQNFSESSGGNGFPAK-----IMLMIDDSGSMADSPMVFGGNSSSLYCPSYSWPVSS 106 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI-- 268 + S Y G S S+ ++++ Sbjct: 107 NGYMNFVNKRNQLGLPLIIPSDEQLAVGSNCYFCMDGTKPTMSSANSVSCSNGHQNLMLI 166 Query: 269 ----------KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 K + R+AL ++R+ + D N++ PS Sbjct: 167 PTPAIPGNEASKFDIARNALVKILRN----KDYKDKFYWSFISLNNKTTQKPSP---YTL 219 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +I+ + G T A+ ++ + E ++ K YI++L+DG+ Sbjct: 220 DSYSIINNQMSILSPSGGTPSTSRYMDAFTHVVKNME--------HVCQKNYIIVLSDGD 271 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 66/211 (31%), Gaps = 27/211 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + VD+S K L++ AL ++ ++ D V G Sbjct: 218 PAANLVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYAGQASMTLD----- 272 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + GST + AY + +K + Sbjct: 273 ----STPGDQKARINAAIDELRAAGSTNGGAGLDLAY------AQAAKGFVKGGVNR--- 319 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L +DG+ + + +K Q GI + T+ A A S Sbjct: 320 ILLASDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLAD--AGNGS 377 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +S E K+ + ++ ++ I + Sbjct: 378 YHYLDSLREARKV----LAAQMSSTLLTIAR 404 >gi|313212817|emb|CBY36735.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 19/135 (14%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D R+ ++ D S + R I+ G T DA+ + Sbjct: 216 ADATRIAIRTYSTNSDLDFSLN---DFKTRNIISEINNLVYASGGTNTADAITKGLNDFG 272 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + V K +V +TDG+++ D+ + A + K+ I + A ++ Sbjct: 273 NDRSESV----------KIMVTITDGQSSYDHVK--AAADLLKADPRNIQSFAIGIDGAN 320 Query: 412 QEKARYFLSNCASPN 426 + L A+ + Sbjct: 321 MAE----LQAIATTD 331 >gi|294997269|ref|NP_001025043.3| integrin alpha-D isoform 1 [Mus musculus] Length = 1202 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 38/89 (42%), Gaps = 3/89 (3%) Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNC 422 AKK ++++TDG+ +D E + +A+ GI I + + + + Sbjct: 290 KSAKKILIVITDGQKFRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSA 349 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S + F+ + L I + +I +IF Sbjct: 350 PSQDHVFKVGNFVALRSI-QRQIQEKIFA 377 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 38/106 (35%), Gaps = 8/106 (7%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ A ++ + ++ I++ TDG++ + + + K + Sbjct: 132 TNTQLALNLAEMIFFNNT------LGPLRPGRRRILIFTDGQSNVKEQMTLYRAFRLKKR 185 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 G+ I +A + + +S + + S + KI + Sbjct: 186 GVEIYVVAVGKYLYGMHEIIGLAT--SSSHHLYRVRSMKDFVKIVQ 229 >gi|74220882|dbj|BAE33629.1| unnamed protein product [Mus musculus] Length = 1202 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 38/89 (42%), Gaps = 3/89 (3%) Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNC 422 AKK ++++TDG+ +D E + +A+ GI I + + + + Sbjct: 290 KSAKKILIVITDGQKFRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSA 349 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S + F+ + L I + +I +IF Sbjct: 350 PSQDHVFKVGNFVALRSI-QRQIQEKIFA 377 >gi|158335198|ref|YP_001516370.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305439|gb|ABW27056.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 573 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 74/254 (29%), Gaps = 29/254 (11%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + Q+ L K + + + + +++ Sbjct: 331 ETQQIGTTLGLRPGTPGVALGNKFTQTWGVEAQPKYDSFRVPKPEVVEAMLQSWQTAAKK 390 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 PS S S+ K V+ L + ++ + + V F+ V Sbjct: 391 PSQVVIVVDSSGSMTG--TKLAAVQSTLQTYLKGLGPKEKVT------LIDFDSVVRKPV 442 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S + + F + G+T + D++ A + + Sbjct: 443 SVDGSPEGQSKGL--EFVVALKADGNTKLYDSILAAQTWLTQNLRPNAINA--------- 491 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIAFSVNKTQQEKARYFLSNCA- 423 +++LTDGE++ ++ + + K G I I T+ + + A L A Sbjct: 492 VIVLTDGEDSGSGQQLPQLLSALKKSGFEGEQRIAIFTVGYG---NAGDFAPDVLKQIAE 548 Query: 424 SPNSFFEANSTHEL 437 + ++ + Sbjct: 549 ANGGYYRQGDPASI 562 >gi|91215374|ref|ZP_01252345.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] gi|91186326|gb|EAS72698.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] Length = 689 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 65/188 (34%), Gaps = 21/188 (11%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + +DSS K ++A ++ ++ DN N F++ Sbjct: 268 NANTEVIEKNFVLIIDSSGSMRGGNKMAQAKEASEFIVNNLNIGDNFN------VIDFDN 321 Query: 305 RVISDPS--FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++ + + + F + +G+T I++++ TA + + ED+ + Sbjct: 322 NIVLFQPELVEYNIQNSNAALD--FIENIVALGATNISESLVTAINQFEAGAEDKAN--- 376 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT-IAFSVNKTQQEKARYFLSN 421 IV TDG T+ I A+ +I T I ++ L+ Sbjct: 377 -------IIVFFTDGGATEGETNTQNILQLAEDTVNQIETEIFLFTFGIGEDVTTDLLTL 429 Query: 422 CASPNSFF 429 A N+ F Sbjct: 430 LAVQNNGF 437 >gi|261820223|ref|YP_003258329.1| hypothetical protein Pecwa_0904 [Pectobacterium wasabiae WPP163] gi|261604236|gb|ACX86722.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 539 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 49/150 (32%), Gaps = 8/150 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I G AL +L +VD + +K+A A + + E Sbjct: 28 FIHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATINGE 87 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 E S N+ + EY+ N + K +++ V V T + + Y V + Sbjct: 88 EYSQEDINTLAY------EYVKSNL--GMNKALSEKLVAGDVSVTEGRNSATRKTYTVTV 139 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + L L + + S +I E Sbjct: 140 AFATKPSLLSLGARAQEVYSTSEVINRPTE 169 >gi|260785816|ref|XP_002587956.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] gi|229273111|gb|EEN43967.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] Length = 1412 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 41/137 (29%), Gaps = 25/137 (18%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++ + + S + + I + + G T A+ Sbjct: 103 FQIRVISYTCEAHTYFSLTPITMGMSYEI--EHLMRGDGGGETRTGHAIYH--------- 151 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---------AF 405 H K E+ V+LTDG Q +++ IA A+ GI + + +F Sbjct: 152 --MRHTSKFGAESHHAAVILTDG---QSDDDAIAEAEDARDAGIDLYAVVVGNFQNRASF 206 Query: 406 SVNKTQQEKARYFLSNC 422 ++ C Sbjct: 207 PAMTNDPDRVFDTSQAC 223 >gi|87199928|ref|YP_497185.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87135609|gb|ABD26351.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 153 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 2/92 (2%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-AQTAIIT 63 T+FI + +L G I A+L PV++ G V+V L+ A A+ I Sbjct: 2 TRFIPFLSRLRACRDGAIMIEVAVLTPVLVLFGLGTVEVSSLVARRSELQSALAEAVAIA 61 Query: 64 ASV-PLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 + P QS + ++ T + Sbjct: 62 LASKPDTQSKIDTIESVISASTGVSTANIDTA 93 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 65/173 (37%), Gaps = 13/173 (7%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 F+ SS ++ V+ L+ VI ++ D R+ + V + Sbjct: 153 VFIVDSSRSVRPREFEKVKTFLSQVIDTL---DIGETATRVAVVNYASTVKVEFHLQ--T 207 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H ++ + + T A++TA D + + E N+ K +V++TD Sbjct: 208 HSDKESLKQAVSRIAPLATGTMSGLAIRTAMDEVFTV-EAGARAPAFNIP--KVVVIVTD 264 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 G +E +A +A++ GI I + + + R S ++F+ Sbjct: 265 GRPQDQVQEAVA---QAQASGIEIYAVGVG--RADMQSLRQLASEPVETHAFY 312 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 60/217 (27%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATH 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDIPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|163849427|ref|YP_001637471.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527431|ref|YP_002571902.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670716|gb|ABY37082.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451310|gb|ACM55576.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 851 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 42/145 (28%), Gaps = 13/145 (8%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-----KIDNVNDTVRMGATFF 302 P S + + + ++ ++ ++A+ +I+ D +M +F Sbjct: 445 PFPDDPSCDGSPNFAWTPREERRIYVAKEAVRLLIKQTNMPGNPGYDPTRPIDQMAIVWF 504 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-------GSTAINDAMQTAYDTIISSNE 355 N V G + + G T A ++ Sbjct: 505 NQDVPESRIMLGGFSSDPAALDRAVLDAGKVNNDPYLSQGGTNTAGAF-YRVGQLLGRAP 563 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT 380 ++ ++ I+L+TDG Sbjct: 564 TGTTQLGREWTYRRAIILVTDGLAN 588 >gi|289679370|ref|ZP_06500260.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae FF5] Length = 120 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 32/92 (34%), Gaps = 13/92 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + +R + I +TA+ DA+ A + Sbjct: 42 RVGLILFGTQAFVQAPLTYD-RRTVRVWLDEARIGI-AGKNTALGDAIGLALKRLRMRPA 99 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + +VL+TDG N + I Sbjct: 100 TS-----------RALVLVTDGANNAGQIDPI 120 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 43/130 (33%), Gaps = 18/130 (13%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + E V +S K +++A + IKKI N++ + ++ Sbjct: 84 KCNIPKKEIVLVLDTSGSMKDSKIKKMKNAAMEFVNKIKKIPNLD----IDIVTYSTSGY 139 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + + G T + ++ A + ++ N A Sbjct: 140 TYLN-----NGNTEEDLLKIINSIKADGGTNTGEGLRKANYIL---------DLEKNKNA 185 Query: 368 KKYIVLLTDG 377 K IV ++DG Sbjct: 186 DKSIVFMSDG 195 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 47/121 (38%), Gaps = 15/121 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--RVISDPSFSWGV 316 VD S + + + V+ + + I+ + D+V MG ++ + + G Sbjct: 357 VDWSGSMNEMGRIAEVKKGVDRFLNQIEG-SGIQDSVYMGYVGYSSDGSNYQNKTCQLGK 415 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ +++ E G T ++ A D + + N KK IVLLTD Sbjct: 416 FSEVKETIRSM-TPETAAGGTFTQRGLRQAGDMLSTQNGH-----------KKVIVLLTD 463 Query: 377 G 377 G Sbjct: 464 G 464 >gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 764 Score = 41.1 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 52/152 (34%), Gaps = 20/152 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + ++ ++ R+ F++R + + K F G T + A++ Sbjct: 371 LDHLNPNDEF-RV--IAFSNRTTAFQPDAIAATDANIQSAKQFVRGLRASGGTNLLPALK 427 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + ++ +Y+VL+TD ++ + + + Q R+ IA Sbjct: 428 LA---LGGEADESARP--------RYMVLMTD-ALVGNDHSILRYLRQPEFQDARVFPIA 475 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 F Y +S A F T++ Sbjct: 476 FGAAPND-----YLISRAAEMGRGFSMQVTNQ 502 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 41.1 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 59/154 (38%), Gaps = 16/154 (10%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFS 313 F+ SS I +D + + I ++ +++ A F+ V DP FS Sbjct: 50 VFIVDSSESSKIVLFDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSSSVQIDPPFS 109 Query: 314 -WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 W + + VK+ + T A+ A + K ++ K + Sbjct: 110 SWKDLQTFKQKVKSMNLIGQ---GTFSYYAISNATKLLKREGR------KGSV---KVAL 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 L+TDG + N + +I + A+ GI +TI S Sbjct: 158 LMTDGIDHPKNPDVQSISDDARISGISFITIGLS 191 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 41.1 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 55/163 (33%), Gaps = 19/163 (11%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA-MQTAYDTIISSNED 356 G F + +I + K + + T+I A +TA +I+S + Sbjct: 160 GLIGFARTAQVLSPLTLDHQAIIDQLNKFSIVKHQDEDGTSIGYAIFKTA--NLIASTKH 217 Query: 357 EVHRMKNNLEAK---KYIVLLTDG---------ENTQDNEEGIAICNKAKSQGIRIMTIA 404 +K ++++TDG E+ + E AK QG+R+ I Sbjct: 218 FAEELKEASPYTIKNSIMLIVTDGFQDPNPLDREDQYRSIELEDAAKYAKEQGVRVYIIN 277 Query: 405 FSVNKTQQEKA--RYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +E R + + F+ + EL I+ D Sbjct: 278 VEPRIASEEFGSERRVMQKVTEITGGKFYLLDHIEELKNIYAD 320 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 25/71 (35%), Gaps = 11/71 (15%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + +A + S KK IVL+TDGEN A + Sbjct: 170 GTAIGTGLSSAIYHLEKSKS-----------PKKSIVLITDGENNSGAVHPHTAARLAVN 218 Query: 396 QGIRIMTIAFS 406 + I + + Sbjct: 219 KDISLYILGVG 229 >gi|260581678|ref|ZP_05849475.1| tellurium resistance protein [Haemophilus influenzae NT127] gi|260095271|gb|EEW79162.1| tellurium resistance protein [Haemophilus influenzae NT127] Length = 212 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 54/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR+ L ++ ++++ +T + F+ + + ++ Sbjct: 21 IESVRNGLQMLVSALRQDPYALETAYLSVITFDSKAKQVMPLT--------ELMSFQLPT 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T++ +A+ D I + +K N + + LL+DG T D ++GI Sbjct: 73 IEASGLTSMGEALSLLTDCINREVQKGSAEVKGNWKP--VVFLLSDGVPTDDLQKGINAL 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K T F L + + + F+ Sbjct: 131 RTVK-------TGTFVACAAGAGADTNVLKQITESVVSLDTADANSIKAFFK 175 >gi|308472823|ref|XP_003098638.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] gi|308268238|gb|EFP12191.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] Length = 417 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 58/156 (37%), Gaps = 12/156 (7%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA--MQTAYDTII 351 T R+G +N D L F + + +T A + A + + Sbjct: 83 TTRVGLVTYNSGAKLDADL-NKFQDLDGLYNGVFKDLSDVVDTTDSYLATGLNAAEELLQ 141 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + + KK I++ + + + N+ K G+ I+T+A+ ++ Sbjct: 142 SQSLNT-----TRDHYKKVIIVYASEYKGSGELDPVPVANRLKGSGVVIVTVAY--DQGG 194 Query: 412 QEKARYFLSNCASPN-SFFEA-NSTHELNKIFRDRI 445 E L+N ASP ++ A N+ L +D + Sbjct: 195 DEGLLRDLANIASPGFAYSNAPNNAGNLVGQIQDSL 230 >gi|118092778|ref|XP_421647.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 555 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 51/150 (34%), Gaps = 18/150 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S S+R+ + + ++LA ++ +RM F+ R + + Sbjct: 52 LDKSGSVRNHWTEIYSFVESLAEKF--------ISPMLRMSFIVFSSRGTTIMKLTENRE 103 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + R + E G T +++ + A + I H + I+ LTDG Sbjct: 104 AIRRGL--EILQYEVPGGDTFMHEGFKRANEQIY-------HETYGGVRTASVIIALTDG 154 Query: 378 ENTQDN-EEGIAICNKAKSQGIRIMTIAFS 406 E N+A+S G + + Sbjct: 155 ELQDVQFYYAEQEANRARSFGAIVYCVGVK 184 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 84/277 (30%), Gaps = 40/277 (14%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + M++Y R +P + + N + L Y + + Sbjct: 162 RTEEMINYFRYDYARPTTRDVPFTTNIDVAKTPWNEDTYLMRIGLRGYDIERD------- 214 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P + +D S K LV+ AL+ + + + D V+ V GA Sbjct: 215 ----ERPPANLVFLMDVSGSMGSPDKLPLVQTALSGLAGELGEQDRVSIVVYAGAAG--- 267 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + K+ ++ A +Q AY+ Sbjct: 268 ---LVLEPTNDTAKIRAALMSLSAGGSTAG-----GAGIQLAYNI-------AEDNFIEG 312 Query: 365 LEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++L TDG+ D + + + K + +GI + T+ F + + Sbjct: 313 GVNR--VILATDGDFNVGVSDRDALVEMVEKNRDRGITLTTLGFGTGNFNEA----MMEQ 366 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A+ + A L + + +E+ + I K Sbjct: 367 IANKGNGNYAYIDSALEA--KKVLSDEMSSTLFTIAK 401 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1] gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1] Length = 718 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 65/400 (16%), Positives = 134/400 (33%), Gaps = 54/400 (13%) Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + + ++E + + N V +I+++ V ++ + V L+ Y Sbjct: 99 EKQEAQRVYDKALKEGKRAAKLDQHRPNVFQMNVGNIMKNDLVTIDIYYTEMLVPLAGNY 158 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + +G + T E K G + + +D + D Sbjct: 159 EFVFPG------VVGPRFTGENTSGETVFNQPYTEKGVGDTFNYNLDVQINAGIPIADVS 212 Query: 198 GQPLN--CFGQPADRTVKSYSSQNGKVGIRD-------------EKLSPYMVSCNKSLYY 242 N + S +S+N RD L Y + K Y Sbjct: 213 SNTHNIKVHYPNTRKAEISLTSENKNPSNRDFILKYSMRGNEIQSGLLLYEENGEKFFAY 272 Query: 243 MLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 M+ P ++ E FV S + + L +++ ++ + D+ N + G Sbjct: 273 MMEPPKASVNIKLTAKEYLFVVDVSGSMNGYPMEVSKKLLRNLLVNLPETDHYNILLFAG 332 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 V++ + + + I F +E G T + +A++TAY + ++ Sbjct: 333 ----GSSVLAPEPLACTKENIQKGIN--FLTNERGGGGTRLLNALKTAY-ALPRMDKTSA 385 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +V++TDG + + + I + I VN RY Sbjct: 386 ----------RSMVVITDGYVSVERKAFEMIEQNLGQANVFTFGIGSGVN-------RYL 428 Query: 419 LSNCA--SPNSFFEANSTHELNKI---FRDRIGNEIFERV 453 L A S + F A +E N + FR+ I + + ++ Sbjct: 429 LEGMAKISNSETFIATEMNEANDVAEKFRNYIKSPLLTQI 468 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 64/204 (31%), Gaps = 38/204 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A S I+ + + TV++G F + L+ I Sbjct: 107 RLVAAQEAAKSFIKDLPR------TVKVGIVAFAGSAQVAQLPTTNHDDLVTAIDSFQLQ 160 Query: 330 DENEMGS-------TAINDA------MQTAYDTIISSNEDEVHRMKNNLEA------KKY 370 G+ T DA + + ++K+ Sbjct: 161 RATATGNAIVVSLATLFPDAGIDVEQFSAPSRQRGTPIDQTEKKLKDFTPVAPGSFTSAA 220 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKTQQEKARYFLS 420 I++LTDG+ T + + A +G+RI TI F + L Sbjct: 221 IIMLTDGQRTTG-VDPLDAAKAAADRGVRIYTVGVGTVDGETIGFEGWSMRVRLDEETLK 279 Query: 421 NCA--SPNSFFEANSTHELNKIFR 442 A + +F A + ++L K++ Sbjct: 280 AVANKTNAEYFYAGTANDLKKVYE 303 >gi|312068806|ref|XP_003137386.1| CUTiclin-Like family member [Loa loa] gi|307767445|gb|EFO26679.1| CUTiclin-Like family member [Loa loa] Length = 691 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 47/151 (31%), Gaps = 15/151 (9%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND-AMQTAYDTIISSNE 355 + ++ + F T V + T A+ AY ++ ++ Sbjct: 4 LALITYSGQAYVHFKF-NDPQIGNNTAVIGHLNALKSIKGTTSTHIALHQAYKLLMDTDS 62 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 R KK I++ TDG + + + K++ + I I + E Sbjct: 63 GNGAR----EGVKKMIIIFTDG---HSQQSPQDMALRLKNESVEIFAITLTPAPYADEG- 114 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 LS + + F T + F IG Sbjct: 115 -ELLSITQNTDHIF----TPDFETKFLPYIG 140 >gi|302632554|ref|NP_001181863.1| integrin alpha-M [Pan troglodytes] Length = 1153 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 47/124 (37%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFHSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|33340728|gb|AAQ14925.1| Mac-1 alpha subunit [Pan troglodytes] Length = 1144 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 47/124 (37%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 210 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 263 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 264 PEADREGVIRYVIGVG-DAFHSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 321 Query: 448 EIFE 451 +IF Sbjct: 322 KIFA 325 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 106/304 (34%), Gaps = 35/304 (11%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + T + T S + + ++Q D +D L+ D Sbjct: 62 PTQPNSTSPTSPTGSTAAPQSSTPTLQPRSD------QDAQDGTTFTLHRSVNEVDLIFT 115 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 V ++ + + + + + L +D+SS + Sbjct: 116 VMGKDGHFVSNLQQQNFGLLDDGRPPQHVLRFAQQTNLPLRVGIMLDTSSSIRQRFEFEQ 175 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A +++ + D R F+ ++ F+ ++ T Sbjct: 176 -QAATDFLLQVLHPAD------RAFVEGFDVQINIAQDFTN-----RIDMLDTGIRRLRP 223 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAIC 390 G TA+ D ++ + +D++ ++ + E +K IVL++DG++ E I +C Sbjct: 224 GGGTALFD-------SLYRTCKDQMLTLQQDAEVRKAIVLVSDGDDDYSRVLETEAIKMC 276 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNE 448 +A++ + TI+ +V ++ +K L A+ F ++ FR+ I E Sbjct: 277 QRAET---IVYTISTNVGPSR-DKGDDVLQQISDATGGQSFYPQRIDDVAIGFRN-IEEE 331 Query: 449 IFER 452 + + Sbjct: 332 LRSQ 335 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 56/182 (30%), Gaps = 22/182 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L + L +I+++ + D+ MG ++ V + + RT + Sbjct: 80 SKLDLAKATLEFLIKNLSQTDH------MGLVVYHSDVSVAFPLTRMDAEGKRTATAALS 133 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T ++ + + + ++L+TDG + Sbjct: 134 T-LRAQRCTNLSGGLFKGIEMMQGRERSAASVSS--------VLLMTDGIANEGVRGPNL 184 Query: 389 ICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + G + T + N ++E + ++ + + + F D Sbjct: 185 ITATRQLMGDNPSYSLYTFGYGSNH-EEELLKDLSE--VGNGMYYYIENNDTIPESFGDC 241 Query: 445 IG 446 +G Sbjct: 242 LG 243 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 48/188 (25%), Gaps = 20/188 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + + G T + + + + Sbjct: 61 TRIAAAKKIIRKFVAKY----PGDSF---GLTALSSSAALILPPTIDHKVFLSRLDSLSI 113 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + + + A+ +AY T N YIVLLTDGEN Sbjct: 114 GELGDGTAIGMGLAVSSAYMTRTKLNSS-------------YIVLLTDGENNTGEINPKT 160 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ I I + + + S F EL KI + G Sbjct: 161 AAEVLVNKNIGFYVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDEVELKKIAQYGNGKY 220 Query: 449 IFERVIRI 456 I Sbjct: 221 ASASSPEI 228 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 57/174 (32%), Gaps = 28/174 (16%) Query: 296 RMGA------TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+GA ++D V R VK GSTA+ + + Sbjct: 70 RLGAGDMVSVVAYDDAVSLISP---ATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEE 126 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQGIRIMTIAF 405 + R K + + +VLL+DG + + AK GI + T+ Sbjct: 127 L--------RRNKRPNQVNR-VVLLSDGMANVGPSSPQDLGRLGASLAKE-GITVTTLGL 176 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN--EIFERVIRIT 457 + + L S + ++ L IF+ G+ + + IR+ Sbjct: 177 GLGYNEDLMTELALR---SDGNHAFIENSQNLAGIFQTEFGDILSVVAQRIRVR 227 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 5 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 63 Query: 75 V 75 Sbjct: 64 P 64 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 61/145 (42%), Gaps = 20/145 (13%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED 356 GA ++D V+ + R +K N +G T + A++ + ++ Sbjct: 89 GALHYSDSVVIIKDLTAMPSG--RAELKNSVSAINYIGKGTHTDCAIKQGIERLLVGG-- 144 Query: 357 EVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQ 412 ++L+ KY++++TDG E ++ G+ N+AK GI++ ++A S Sbjct: 145 ------SHLKENKYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAIS-----P 193 Query: 413 EKARYFLSNCASPNSFFEANSTHEL 437 L+ A+ +++ + L Sbjct: 194 HHLDQRLNIIATDHAYRRNFTATSL 218 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 60/217 (27%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V S + + + K D T A+Q Y + ++ Sbjct: 318 PRIWVKVSEQDSSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYSMMSWPDDIPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFFQMI 469 >gi|332288899|ref|YP_004419751.1| hypothetical protein UMN179_00824 [Gallibacterium anatis UMN179] gi|330431795|gb|AEC16854.1| conserved hypothetical protein [Gallibacterium anatis UMN179] Length = 212 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR+ L ++ ++++ +T + F+ + ++ D Sbjct: 21 IEAVRNGLQMLVSALRQDPYALETAYLSVITFDTDAKQVTPLT--------ELMSFQMPD 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G TA+ +A+ D I + +K + + + LL+DG T D ++GI Sbjct: 73 IQASGVTAMGEALSLLVDCINREVQKGSAEVKGDWKP--VVFLLSDGLPTDDLQKGINAI 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K T F L + + + F+ Sbjct: 131 RQVK-------TGTFVACAAGAGADTNVLKQITETVVSLDTADANSIKAFFK 175 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 56/177 (31%), Gaps = 19/177 (10%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +S G + AL + + G+ V+ R+ L+ AAQTA + + Sbjct: 15 RSQKGIAAVWFALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALA---------ITI 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ + I +Y+ ++ + VR T N K Y V + Sbjct: 66 DDKSNQADALATMYINDYVRDISHVDI------QTVRTYEEPTEDNDNTEKIQYSVQAVT 119 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + N ++ Y + + + V DFS SM + Sbjct: 120 TH----NSWFASNSIPSFETQEKLAGQAVAAKYPFYLGDKIIDLVLVTDFSGSMNNS 172 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 37/125 (29%), Gaps = 15/125 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F + + RT + T + A+ E Sbjct: 132 RIAMIVFAGGAYIRLPLTPDLP-TARTFLADIQPGMVSNQGTNLGQAL-----------E 179 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-QEK 414 + A K ++LLTDGE+ + E + K QGI+ + + + Sbjct: 180 RSAQALSAPSRAGKAVILLTDGEDHEGGLE--EGIKRLKEQGIKAYVVTIGLPEGATIPI 237 Query: 415 ARYFL 419 L Sbjct: 238 GETLL 242 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 38/122 (31%), Gaps = 15/122 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D R+G ++DR + V+K + V + G T A++ Sbjct: 22 VNTFDISPTATRVGVVQYSDRNSLVFNLGNKVNK--PSTVSAINGISYQSGGTNTGAALK 79 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + K I++LTDG+++ + G+ + I Sbjct: 80 YV----------RQYAAWREGNVPKVIIVLTDGKSSDSVSGPSR---DLVAAGVEVYAIG 126 Query: 405 FS 406 Sbjct: 127 VG 128 >gi|94498563|ref|ZP_01305118.1| TadE-like protein [Sphingomonas sp. SKA58] gi|94422006|gb|EAT07052.1| TadE-like protein [Sphingomonas sp. SKA58] Length = 193 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 40/122 (32%), Gaps = 2/122 (1%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-AQTAIITASVPL 68 + L K TG + A+++P +L + +++ L+ A + A I+ + Sbjct: 3 HFAPLAKDRTGVSTVEFAIILPALLTLMCGAIELGHMLLARVVLEGAMTEAARISTASLE 62 Query: 69 IQSLEEVSSRAKNSFTFPKQ-KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ + + + + + NF+ A Sbjct: 63 TAEAQRTTLMEESIEQAMGNFPLADGAHVSVQTIVYGNFSSAHPETYEDANANGHYDLGE 122 Query: 128 AY 129 +Y Sbjct: 123 SY 124 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSSITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 27/180 (15%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + ELN ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELNSVYVSLQQQIGYE 301 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 61/145 (42%), Gaps = 20/145 (13%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNED 356 GA ++D V+ + R +K N +G T + A++ + ++ Sbjct: 89 GALHYSDSVVIIKDLTAMPSG--RAELKNSVSAINYIGKGTHTDCAIKQGIERLLLGG-- 144 Query: 357 EVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQ 412 ++L+ KY++++TDG E ++ G+ N+AK GI++ ++A S Sbjct: 145 ------SHLKENKYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAIS-----P 193 Query: 413 EKARYFLSNCASPNSFFEANSTHEL 437 L+ A+ +++ + L Sbjct: 194 HHLDQRLNIIATDHAYRRNFTATSL 218 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 40/116 (34%), Gaps = 14/116 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D + + F F+ + I+ + T I DA+ TA +T Sbjct: 125 NTDNLSVALVGFAGTSFVASPFTQDME-TFSYILNELNTKSVTLQGTRIADALVTAKNTF 183 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N+ KK I+L+TDGE+ + I + K I + T+ Sbjct: 184 NV-----------NIPGKKSIILITDGEDHAGYFD--NILKELKDNDISVYTVGVG 226 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 4 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 58 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 59 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 115 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 116 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 173 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 174 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 210 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 53/164 (32%), Gaps = 23/164 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ R + ++ G T+I +++A+ + Sbjct: 345 GMVHFDSTAYIKSKLIQITSSNERNKL-LESLPTAASGGTSICRGIKSAFQVLTG----- 398 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 IVLLTDGE+ C ++ + G I IA + ++A Sbjct: 399 ----TYPQIDGSEIVLLTDGEDNTAGS-----CVDEVRQSGAIIHFIALGPSA---DQAV 446 Query: 417 YFLSNCASPNSFFEANSTHE--LNKIFRDRIG--NEIFERVIRI 456 +S + ++ L F + ++ ++ +++ Sbjct: 447 IEMSTITGGKHKYASDEAANNGLIDAFAALVSGNADLSQQSLQL 490 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 248 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 302 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 303 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 359 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 360 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 417 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 418 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 454 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 30/217 (13%) Query: 249 LDPSLSEE-HFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDR 305 LDPS S + V S + L ++I + + R G + Sbjct: 263 LDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP-----RYGLVTYATY 317 Query: 306 ----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V + S + + + + D T A+Q Y + ++ Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPP--- 374 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------------QGIRIMTIAFSVN 408 + + I+L+TDG + + I + ++ + + + Sbjct: 375 EGWNRTRHVIILMTDGLHNMGG-DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-P 432 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q S + F+ L +F I Sbjct: 433 LVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMI 469 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 1/61 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 75 V 75 Sbjct: 79 P 79 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 62/189 (32%), Gaps = 37/189 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +++A VI ++ D V+ F+D V + ++ R +K Sbjct: 58 KLAALKEATRRVIDTLTPQDIVS------IVLFDDTVQTLVPATFATD---RDALKAQVD 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TA++ M E H + A ++LLTDG+ D + A+ Sbjct: 109 AIEEAGGTAMSGGMAA------GIVELRKHHDPGRVSA---MLLLTDGQTWGDEDRCRAL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------EKARYFLSNCASPNS---FFE---------- 430 + +R+ + ++ + A P+ FF+ Sbjct: 160 AQELARDHVRVTALGLGAEWNEKLLDDIADATGGLSDYIADPSQITTFFQHAVRMAQGTI 219 Query: 431 ANSTHELNK 439 A L + Sbjct: 220 AQDARLLLR 228 >gi|167045536|gb|ABZ10188.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_APKG10H12] Length = 356 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 46/150 (30%), Gaps = 23/150 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLI 320 +S +K + AL + + ++ F+ + W V ++ Sbjct: 104 TSESMDGQKMAAAQRALDRFL--FDLLGPDDEIF---LYRFD---YTPELLQDWTVDRIR 155 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + D G+TA+ DA+ + + KK +++++DG + Sbjct: 156 ---LSRAIRDIRPRGNTALYDAVAESVPRVA-----------GGQHFKKALLIISDGNDN 201 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + I I T Sbjct: 202 NSETDVRELRELIRESEALIYAIGIDGPST 231 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 74/225 (32%), Gaps = 45/225 (20%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P+D +L + S + + ++A+ ++I + VR+ F+ Sbjct: 1418 APIDTNLMVILDLSGSMGQETPTRLSRAKEAIQNLIDGYDLYGD----VRVQLVTFST-- 1471 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE-DEVHRMKNNL 365 +W + +V+ + GST + A+ A + ++ + D + Sbjct: 1472 TGASQQAWMTAAEAKALVQ----NLQAAGSTNYDAALAAAMNGFSATGKLDGAQNVS--- 1524 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICN----KAKSQGI---------------RIMTIAFS 406 LTDGE T + + N +GI +I + A Sbjct: 1525 ------YFLTDGEPTLGDGNTAQLANSSNSSTADRGIQAGEEAIWTNFLNTHQINSFALG 1578 Query: 407 VNKTQQEKARYFLSNCASPNSFFE------ANSTHELNKIFRDRI 445 + + +A+ F+ A + T +LN + + I Sbjct: 1579 LGSSLNAEAQAFIDPIAYNGNTGANTNGQIITDTSQLNDVLQGTI 1623 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 63/199 (31%), Gaps = 16/199 (8%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK G II ++P+++ + + + + + +AA+ A + + E+ Sbjct: 19 IKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEK 78 Query: 75 VSSRAKNSFTFP-KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A+ E + R F + + + D V+ + + Sbjct: 79 NQEYAQKIVDHYIPDNKGEVVARVF--HRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDS 136 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 Y+ L+ MG T + + I ++ID S SM++ Sbjct: 137 WISYNEGEMGLTKDFEVMG-------------TSTSRKFLPQPLDIYFIIDMSGSMVNPW 183 Query: 194 RDSEGQPLNCFGQPADRTV 212 S + +R V Sbjct: 184 GGSGKTKYDVVADTINRIV 202 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 61/156 (39%), Gaps = 11/156 (7%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F+ S F+ + + I++ E+ G+TA+ + ++ A Sbjct: 278 FTRVAFITFSSVGKSRTHFNLNRYDNAQQIIEAIRRVESTGGTTAVGEGIRIATQQ---- 333 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF----SVNK 409 +++ + AKK +++ TDG + + + + +AK+ G + +I + + Sbjct: 334 -QEKRMGGRPISIAKKAMLIFTDGWSNKGP-DPEEMSKEAKAAGFVLYSIGYEGNGRPDA 391 Query: 410 TQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDR 444 +Y + A + + EL + R R Sbjct: 392 EFAGLNQYTMDAIADTMHHVYSERNFSELVEELRRR 427 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 29/263 (11%), Positives = 67/263 (25%), Gaps = 23/263 (8%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + IK+ G + +P+ +G+ + ++ + + L A + A + +V Sbjct: 14 FQFFIKNENGTILMSFIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATLALTVENNN 73 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 E+ + N + + + + + +N S Sbjct: 74 IPSS----------------EQEVKNNILISSFAH--AYLPEETFSEPVITINSSASHMD 115 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 L+ + I + A A+ + + +V D+S SM Sbjct: 116 YHADITMSYPAKFLNKAFNLISISDIKLDESAIAKKNT--SITAIPTDVVFVTDYSGSMN 173 Query: 191 ---DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 D R K+ ++ + + K Sbjct: 174 RDFDGTDIDSTDISKVRIVALRRIFKNLHNEIQQNENINLVGFVPFTWGTKRTIDNNNTT 233 Query: 248 PLDPSLSEEHFVDSSSLRHVIKK 270 P S + + K Sbjct: 234 PTLLCHFPFVPKKHSPDGNYLTK 256 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 40/113 (35%), Gaps = 16/113 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R Sbjct: 267 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLT-KMSDAGRQRALQAV 319 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T I + ++ + R KN + + I+LL+DG +T Sbjct: 320 NSVVANGGTNIAEGLRKGVKVMED------RRDKNPVAS---IILLSDGRDTY 363 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 40/113 (35%), Gaps = 16/113 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R Sbjct: 264 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLT-KMSDAGRQRALQAV 316 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T I + ++ + R KN + + I+LL+DG +T Sbjct: 317 NSVVANGGTNIAEGLRKGVKVMED------RRDKNPVAS---IILLSDGRDTY 360 >gi|157838288|pdb|1BHQ|1 Chain 1, Mac-1 I Domain Cadmium Complex gi|157838289|pdb|1BHQ|2 Chain 2, Mac-1 I Domain Cadmium Complex Length = 189 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 71 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 124 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 125 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 182 Query: 448 EIFE 451 +IF Sbjct: 183 KIFA 186 >gi|119572528|gb|EAW52143.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_a [Homo sapiens] Length = 1153 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|119572529|gb|EAW52144.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_b [Homo sapiens] Length = 1152 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|64654539|gb|AAH96347.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|64654595|gb|AAH96346.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|253722212|pdb|1IDN|1 Chain 1, Mac-1 I Domain Metal Free gi|253722213|pdb|1IDN|2 Chain 2, Mac-1 I Domain Metal Free gi|313507148|pdb|1BHO|1 Chain 1, Mac-1 I Domain Magnesium Complex gi|313507149|pdb|1BHO|2 Chain 2, Mac-1 I Domain Magnesium Complex Length = 190 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 72 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 125 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 126 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 183 Query: 448 EIFE 451 +IF Sbjct: 184 KIFA 187 >gi|224831239|ref|NP_001139280.1| integrin alpha-M isoform 1 precursor [Homo sapiens] gi|307148|gb|AAA59544.1| glycoprotein Mac-1 [Homo sapiens] Length = 1153 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|386975|gb|AAA59903.1| neutrophil adherence receptor alpha-M subunit [Homo sapiens] Length = 1145 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 211 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 264 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 265 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 322 Query: 448 EIFE 451 +IF Sbjct: 323 KIFA 326 >gi|307114|gb|AAA59491.1| leukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1152 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|88501734|ref|NP_000623.2| integrin alpha-M isoform 2 precursor [Homo sapiens] gi|1708572|sp|P11215|ITAM_HUMAN RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: Full=Neutrophil adherence receptor; AltName: CD_antigen=CD11b; Flags: Precursor gi|263049|gb|AAB24821.1| leukocyte integrin alpha chain [Homo sapiens] gi|64653358|gb|AAH96348.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|68563402|gb|AAH99660.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|168275740|dbj|BAG10590.1| integrin alpha-M precursor [synthetic construct] Length = 1152 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 219 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 272 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 273 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 330 Query: 448 EIFE 451 +IF Sbjct: 331 KIFA 334 >gi|31615654|pdb|1NA5|A Chain A, Integrin Alpha M I Domain Length = 197 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 76 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 129 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 130 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 187 Query: 448 EIFE 451 +IF Sbjct: 188 KIFA 191 >gi|157831557|pdb|1JLM|A Chain A, I-Domain From Integrin Cr3, Mn2+ Bound Length = 192 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 77 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +G+ I + + EK+R L+ AS + F+ N+ L I ++++ Sbjct: 131 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI-QNQLRE 188 Query: 448 EIFE 451 +IF Sbjct: 189 KIFA 192 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 53/158 (33%), Gaps = 25/158 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V++A + ++ + D R+ F+ R VK Sbjct: 57 KPMKTVKEAASYLVEGLGPDD------RLSVVAFDHRAKVIVP---NQPVDEIDGVKDAI 107 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGI 387 G T+I++ M+ + ED V + I LLTDGEN DNE + Sbjct: 108 ASLKAEGGTSIDEGMKLGIKQVALGKEDRVSQ----------IFLLTDGENEHGDNERCL 157 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + A I + T+ F + L + A Sbjct: 158 KLAQVAGEYNITLNTLGF-----GNHWNQDVLESIADS 190 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 64/213 (30%), Gaps = 27/213 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + VD S + K L+++AL + ++ D R+ + Sbjct: 193 PPANLVFLVDVSGSMNSPDKLPLLQNALKLFVAQLRPQD------RVALVTYASGTRVVL 246 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + G+TA + AY ++ H + Sbjct: 247 EP---TAGDRKAAITAAIDGLVPGGATAGAAGIDLAYR--MAEQGFVEHGINR------- 294 Query: 371 IVLLTDGENTQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ + + + + GI + T+ F + A + Sbjct: 295 ILLATDGDFNVGITRFETLKDRVAERRKSGIALSTLGFGGGNYNDQLMEQLAD--AGDGA 352 Query: 428 FFEANSTHE----LNKIFRDRIGNEIFERVIRI 456 + +S E L F + + I++ Sbjct: 353 YRYIDSLAEAQKVLVDEFTSTLATVASDVKIQL 385 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 44/139 (31%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A V + + K+ N +G F + + + +++ + Sbjct: 112 LEKAKRLVAQLVDKMQNDK----VGMIVFAGDAFTQLPITNDYISA-KMFLESISPSLIS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI A+ A + + I+++TDGEN + A Sbjct: 167 KQGTAIGAAISLATRSFT-----------PQEGIGRAIIVITDGENHEGGVA--EAAKTA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 +GI++ + + Sbjct: 214 TEKGIQVNVLGVGMPDGAP 232 >gi|308462088|ref|XP_003093330.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] gi|308250341|gb|EFO94293.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] Length = 382 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 59/160 (36%), Gaps = 16/160 (10%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN- 340 I + +D T R+ +N ++ + + + ST + Sbjct: 62 IGTTNYLDP--RTTRVALVTYNSDSYTNADL--NQFQSTGDLFNNVFSALATLSSTDQSY 117 Query: 341 --DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + TA + ++ + +++ N K+ +++ + + + ++ K G+ Sbjct: 118 LETGLSTA-EQLLKAGKNQ----FNRAHFKRVVIVYASAYEGEGERAPMPVADRLKGDGV 172 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 +I+T+A+ ++ L+ ASP F + +L Sbjct: 173 KIITVAY--DQRGDGALLDQLAKIASPRMNF--TNNRDLV 208 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 45/155 (29%), Gaps = 22/155 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 FN + FS + + TA+ DA+ A + Sbjct: 125 ALYTFNWEIREQQPFSRDLRAFDNRLKMMHGEA-----GTAMYDAVYLAAQRLE------ 173 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV---NKTQQEK 414 + +K IV++TDG +T A+ I I + + Sbjct: 174 ------PRDGRKVIVVVTDGGDTVSRLSVQKALEAAQLADAVIYAIVVVPITNDAGRNIG 227 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + L A + F EL+K F D I Sbjct: 228 GEHALDFMAKGTGGRIFMPTLGAELDKAFADIITE 262 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA + + TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E L A S + + A S EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKSVYATLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTXPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 63/165 (38%), Gaps = 9/165 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ II+ LL+ +LG ++D+ + L A + + A++ L + Sbjct: 2 RKLNDR-GNVAIISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALELPDN- 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV----RDTAVEMNPRKSA 128 EV +R K ++ L K+ EV+++ + + + K+ Sbjct: 60 -EVRARTAAVDYLQKNNVDPSLALITVGADHKSIQIEEVKNVKHLFAQIIGINSSNIKAK 118 Query: 129 YQVVLSSRYDLL--LNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + V++ + + P ++ + + + + S +Y Sbjct: 119 TKAVVAPAKSVTGGIRPFAVEVYKFSYGDLVTLKEDAGDGYSGNY 163 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 53/165 (32%), Gaps = 22/165 (13%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + + + + +D +A+ + A + Sbjct: 142 IGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL--GNGSALGLGISIALSHL------ 193 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA- 415 K++ KK +++LTDG D + N A+ ++I ++ ++ Sbjct: 194 -----KHSEAPKKSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSVGIGSDEELNVGFK 248 Query: 416 -------RYFLSNCASPNSFFEANS-THELNKIFRDRIGNEIFER 452 + L P+ FE ++ T L D ++ + Sbjct: 249 LRSGKFYQGVLKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQ 293 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 73/206 (35%), Gaps = 27/206 (13%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF--NDRVIS 308 P + VD S K L+R +L ++R ++K D R+ + N V+ Sbjct: 147 PPANLVFQVDVSGSMDAPDKLPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVL 200 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 P+ +++ I GSTA ++ AY ++ + + N Sbjct: 201 PPTPGDQQGRIVEAIDS-----LQSGGSTAGASGIELAYK---AAQQGYLRGGINR---- 248 Query: 369 KYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+L TDG+ + + ++ + G+ + T+ F S A Sbjct: 249 --ILLATDGDFNVGVTDFDALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQ--SADAGD 304 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E K+ +G + Sbjct: 305 GAYAYIDTPLEARKVLTHELGATLAT 330 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 53/191 (27%), Gaps = 29/191 (15%) Query: 274 VRDALASVIRSIKK--IDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTF 327 + L ++I + + R G + V + S + + + + Sbjct: 255 AKKCLVNLIEKVASYGVKP-----RYGLVTYATYPKIWVKVSEADSSNADWVTKQLNEIN 309 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+Q Y + ++ + + I+L+TDG + + I Sbjct: 310 YEDHKLKSGTNTKKALQAVYSMMSWPDDVPP---EGWNRTRHVIILMTDGLHNMGG-DPI 365 Query: 388 AICNKAKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + ++ + + + Q S + F+ Sbjct: 366 TVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-PLVNQVNINALASKKDNEQHVFKVKDM 424 Query: 435 HELNKIFRDRI 445 L +F I Sbjct: 425 ENLEDVFYQMI 435 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 53/191 (27%), Gaps = 29/191 (15%) Query: 274 VRDALASVIRSIKK--IDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTF 327 + L ++I + + R G + V + S + + + + Sbjct: 264 AKKCLVNLIEKVASYGVKP-----RYGLVTYATYPKIWVKVSEADSSNADWVTKQLNEIN 318 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+Q Y + ++ + + I+L+TDG + + I Sbjct: 319 YEDHKLKSGTNTKKALQAVYSMMSWPDDVPP---EGWNRTRHVIILMTDGLHNMGG-DPI 374 Query: 388 AICNKAKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + ++ + + + Q S + F+ Sbjct: 375 TVIDEIRDLLYIGKDRKNPREDYLDVYVFGVG-PLVNQVNINALASKKDNEQHVFKVKDM 433 Query: 435 HELNKIFRDRI 445 L +F I Sbjct: 434 ENLEDVFYQMI 444 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 73/206 (35%), Gaps = 27/206 (13%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF--NDRVIS 308 P + VD S K L+R +L ++R ++K D R+ + N V+ Sbjct: 162 PPANLVFQVDVSGSMDAPDKLPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVL 215 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 P+ +++ I GSTA ++ AY ++ + + N Sbjct: 216 PPTPGDQQGRIVEAIDS-----LQSGGSTAGASGIELAYK---AAQQGYLRGGINR---- 263 Query: 369 KYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+L TDG+ + + ++ + G+ + T+ F S A Sbjct: 264 --ILLATDGDFNVGVTDFDALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQ--SADAGD 319 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E K+ +G + Sbjct: 320 GAYAYIDTPLEARKVLTHELGATLAT 345 >gi|322418525|ref|YP_004197748.1| hypothetical protein GM18_0996 [Geobacter sp. M18] gi|320124912|gb|ADW12472.1| hypothetical protein GM18_0996 [Geobacter sp. M18] Length = 389 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 46/147 (31%), Gaps = 9/147 (6%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++ V L V G+ +D+ E L+ +A+ A +T + + Q + Sbjct: 15 IMLVVFLVVTGLAIDIGYMYVSEEDLQHSAEMAALTGAQTIKQRYLYQAQTDPARLPAIS 74 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL------SSRYDLLL 141 + RN +L + +V N + + S Y Sbjct: 75 SDPVQAPARNAAVDLV--TGKHDAAALVGLLNNNGNALTGDNDITVGFWNMSSRSYTPGG 132 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVS 168 P++ ++ ++ + + Sbjct: 133 TPVNA-MQVRTRRTAESSSVGLGTVGT 158 >gi|322437225|ref|YP_004219437.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321164952|gb|ADW70657.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 528 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 63/191 (32%), Gaps = 8/191 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 ++ + G +I A M +LG + VDV + Y + L++AA A + + + + Sbjct: 74 RIFRDEDGSATLIAAFGMVAILGFLALAVDVGQLRYQKRGLQKAADAAALASVLEM-SYC 132 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + +E L + + ++ Y Sbjct: 133 DGTLACGVMQTAAKSAMVENGLTPDNIVTACGTVPSTGLTLMINHGPCAKGASDPNYGKT 192 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S ++ ++F + +G+ + + +AEA T + + +D + S Sbjct: 193 SSVEVLVMQAQPTIFAKVLGLSTGTVGARAEASTTGGTNCI-------FALDPTGSGALT 245 Query: 193 QRDSEGQPLNC 203 + C Sbjct: 246 VQGLASITSPC 256 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 40/121 (33%), Gaps = 16/121 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 RT KT G T I A+ A + N K IVL++DGE+T Sbjct: 105 RTEAKTAVATLTPTGWTPIGPALLKAAGDLDGGN------------GSKRIVLISDGEDT 152 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + ++GI I T+ + A+ ++ EL Sbjct: 153 CAPLDPCEVAREIAAKGIGLTIDTLGLVPTAKLSRQLSCIAE--ATGGTYTSVEHQDELT 210 Query: 439 K 439 Sbjct: 211 D 211 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 62/204 (30%), Gaps = 18/204 (8%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S D + + + F V S + + K N++ + P Sbjct: 196 STQDAGDVNSVRTVEVFTYTEVSAVPKSVSYDNFTILIHLKAPLTSGRQNRNWNHAESPQ 255 Query: 248 PLDPSLSEEHFVD--SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 S + V S K L++ A+ VI+++ D R+ F+ Sbjct: 256 SSQDSRAPVDLVTVLDVSGSMSGTKLALLKRAMGFVIQNLGPSD------RLSVIAFSST 309 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ G T I + ++ + ++ + KN + Sbjct: 310 ARRHFPLRRMTETGKLEALQA-VNSLVSSGGTNIAEGLRKGFKVVVD------RKWKNPV 362 Query: 366 EAKKYIVLLTDGENTQDNEEGIAI 389 + I+LL+DG++T Sbjct: 363 CS---IILLSDGQDTYTISGTSMT 383 >gi|219127467|ref|XP_002183956.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404679|gb|EEC44625.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 694 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 59/167 (35%), Gaps = 17/167 (10%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + SQNG I + S + ++ P SSS+ + + L+ Sbjct: 162 FGSQNGAFRIYPARHSEECGQYDPTVRAWKIAADSGPKNVVLVLDTSSSMGNYN-RLGLL 220 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVI----SDPSFSWGVHKLIRTIVKTFAID 330 +DA ++ ++ D R+ F+ + +F W K + +KT+ D Sbjct: 221 QDAAIRIVETLSVGD------RIAIVQFSSQAKPFESKGQTFFWAT-KENKIALKTYVED 273 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 T DA + + S + E+H ++ LTDG Sbjct: 274 LELNEGTNTLDAFNKTFAVLDDSIDQELHNECITA-----VLFLTDG 315 >gi|73958268|ref|XP_848527.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 2 [Canis familiaris] Length = 1153 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ K N F++ Sbjct: 145 CPRQDSDIAFLIDGSGSINPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSE--DF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + +G T ++ + S+ Sbjct: 198 QIHFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F N+ L I ++++ +IF Sbjct: 311 DHVFRVNNFEALKTI-QNQLQEKIFA 335 >gi|73958266|ref|XP_856286.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 3 [Canis familiaris] Length = 789 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ K N F++ Sbjct: 145 CPRQDSDIAFLIDGSGSINPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSE--DF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + +G T ++ + S+ Sbjct: 198 QIHFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F N+ L I ++++ +IF Sbjct: 311 DHVFRVNNFEALKTI-QNQLQEKIFA 335 >gi|73958270|ref|XP_856370.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 [Canis familiaris] Length = 1036 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ K N F++ Sbjct: 145 CPRQDSDIAFLIDGSGSINPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSE--DF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + +G T ++ + S+ Sbjct: 198 QIHFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F N+ L I ++++ +IF Sbjct: 311 DHVFRVNNFEALKTI-QNQLQEKIFA 335 >gi|73958264|ref|XP_547048.2| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 1 [Canis familiaris] Length = 1165 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ +++V+ K N F++ Sbjct: 145 CPRQDSDIAFLIDGSGSINPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSE--DF 197 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F++ K + +G T ++ + S+ Sbjct: 198 QIHFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F N+ L I ++++ +IF Sbjct: 311 DHVFRVNNFEALKTI-QNQLQEKIFA 335 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 27/180 (15%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + ELN ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELNSVYASLQQQIGYE 301 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 100/333 (30%), Gaps = 30/333 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 67 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 126 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 127 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 185 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 186 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 242 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 243 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 302 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 303 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSRTNGGTFTQKALR 360 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 A + N KK IVLLTDG Sbjct: 361 DAGSMLSVPNGH-----------KKVIVLLTDG 382 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 60/175 (34%), Gaps = 28/175 (16%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + TI +D + TA A+ TA I ++ Sbjct: 132 LGLVGFAGTPYLLVPP---TPQHQATIDALKKLDFADS--TATGQAIFTALHAIGATA-- 184 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDN------EEGIAICNKAKSQGIRIMTIAFSVNKT 410 + IVLL+DG + + + AK +G+ I TI+F Sbjct: 185 ---VTGGDNPPPARIVLLSDGRENKPSNPSDPHDGVYTAARLAKDEGVPISTISFGTKGG 241 Query: 411 QQE-KARYF--------LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + E + + A S + A + ELNK + + I NEI R + Sbjct: 242 EIEMDGQRVAVPVSTDQMKTIARLSGGQPYTATNIGELNKSY-NAIENEIGYRTV 295 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 100/333 (30%), Gaps = 30/333 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 89 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 148 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 149 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 207 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 208 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 264 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 265 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 324 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 325 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSRTNGGTFTQKALR 382 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 A + N KK IVLLTDG Sbjct: 383 DAGSMLSVPNGH-----------KKVIVLLTDG 404 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 40.7 bits (93), Expect = 0.56, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 53/156 (33%), Gaps = 21/156 (13%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ ++ + +G ++++ S+ H+ ++ + + Sbjct: 18 ERIKTFVSKAVTRFNIGPTQTQ---IGVIQYSNQPQSEILL--NDHQDAASLQQAISSIN 72 Query: 332 NEMGSTAINDAMQT-AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T A++ A + N K +++TDG + +++ + Sbjct: 73 YLQGGTNTGKALRYLANNAFSGKNGARA-------GVSKVAIVVTDG---RSSDDVVRPA 122 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A +GI + + + QE L + AS + Sbjct: 123 LNAGKEGIVLYAVGIGGSVDYQE-----LRDIASSD 153 >gi|121583396|ref|YP_973827.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596650|gb|ABM40085.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 212 Score = 40.7 bits (93), Expect = 0.56, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 53/174 (30%), Gaps = 21/174 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++ ++++ +T + F+ + + F I Sbjct: 21 IEAVKNGVQVLVSTLRQDPYALETAFLSIITFDSEARQVVPLT---------ELANFQIP 71 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-KYIVL-LTDGENTQDNEEGIA 388 T A+ +A + E EV R ++ K IV +TDG T D ++G+ Sbjct: 72 AIVATGTT---ALGSALSLLADKIEMEVGRTTAEVKGDWKPIVFIMTDGSPTDDWKKGLE 128 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 ++ G+ I L + F+ Sbjct: 129 RLKTVRT-GMVI------ACAAGPGADTTVLKQITEIVVQLNTADASTIKAFFK 175 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 45/116 (38%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I ++ A+ I E+H + E ++LLTDGE+ + Sbjct: 139 PTYPRGGTSICSGIKYAFQVIG-----ELHSQLDGSE----VLLLTDGEDNTASS----- 184 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + IA + ++A +S + F+ ++ L F Sbjct: 185 CIDEVKQSGAIVHFIALGRDA---DEAVIEMSKITGGSHFYVSDEAQNNGLIDAFG 237 >gi|301778757|ref|XP_002924796.1| PREDICTED: integrin alpha-M-like [Ailuropoda melanoleuca] Length = 1153 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 43/124 (34%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + + S+ K +V++TDGE D + + Sbjct: 220 IQLYGRTHTATGILKVVRELFHSSSGARENAL------KILVVITDGEKFGDPLDYKDVI 273 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +GI I K R L+ AS + F N+ L I ++++ Sbjct: 274 PEADREGIIRYVIGVG-EAFDNPKHREELNTIASKPARDHVFRVNNFEALKTI-QNQLQE 331 Query: 448 EIFE 451 +IF Sbjct: 332 KIFA 335 >gi|281352695|gb|EFB28279.1| hypothetical protein PANDA_014199 [Ailuropoda melanoleuca] Length = 1110 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 43/124 (34%), Gaps = 11/124 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + + S+ K +V++TDGE D + + Sbjct: 175 IQLYGRTHTATGILKVVRELFHSSSGARENAL------KILVVITDGEKFGDPLDYKDVI 228 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGN 447 +A +GI I K R L+ AS + F N+ L I ++++ Sbjct: 229 PEADREGIIRYVIGVG-EAFDNPKHREELNTIASKPARDHVFRVNNFEALKTI-QNQLQE 286 Query: 448 EIFE 451 +IF Sbjct: 287 KIFA 290 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 25/57 (43%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +KLI G ++ AL + +LG ++ D + + A +A++ + L Sbjct: 5 RKLIGGEGGTAAVLFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQEL 61 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 62/161 (38%), Gaps = 18/161 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ID + R+G + F F+ + ++K ++ T A++ Sbjct: 320 SLIDKFDSDFRIGISKFTGTYTRMCGFT-DDRTALSDVIKRIRTEDEIFDGTHNQTALKR 378 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-ICNKAKSQGIRIMTIA 404 + + + + IV+L+DGE+ + N E I + A + + ++T+ Sbjct: 379 CIEEFTA---------TGDGKYVNIIVMLSDGESDETNAESIKNLARLANEKSVIVLTVG 429 Query: 405 FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +E R +L A + ++ A+ + L+ +++ Sbjct: 430 L-----GREIDRAWLQEMAYSTGGKYYSASEANALDDVYKQ 465 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 50/128 (39%), Gaps = 13/128 (10%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N N+ V G +++ V + + R+ + + G TA DA+ A Sbjct: 406 NDNNYV--GLVSYSNSVTIEVPIA-QFDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKM 462 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 I + + +AK + LL+DG N + + I ++ GI + TI + + Sbjct: 463 ITEAKA-------QHPDAKCMLFLLSDGYANNGYSMDEITSA--LRTSGIPVYTIGYGDD 513 Query: 409 KTQQEKAR 416 E AR Sbjct: 514 ADTGELAR 521 >gi|2944425|gb|AAC05284.1| complement factor C2 [Mus musculus] Length = 470 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 41/136 (30%), Gaps = 8/136 (5%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 V ++I ++ D T + + Y + S + E + I+LL Sbjct: 32 DVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILL 91 Query: 375 TDGENTQDNEEGIAIC--------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 TDG++ + A+ + + + I I ++ S Sbjct: 92 TDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGER 151 Query: 427 SFFEANSTHELNKIFR 442 F L +IF Sbjct: 152 HAFILQDAKALQQIFE 167 >gi|259505645|ref|ZP_05748547.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] gi|259166776|gb|EEW51330.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] Length = 530 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 40/325 (12%), Positives = 93/325 (28%), Gaps = 33/325 (10%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 +A+++ + + + ++ ++ +L + + Sbjct: 173 SALVSVATAYAGTGQALT---LGDVDEVAPELRSFLSGQTMTSGSSGWLKDVFLRDPNRA 229 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 +N + + L + + + G+ S A + + + ++ Sbjct: 230 NALINYESVLHTINAEDNAGLRV-----VVPADGVVSADYPLTPLATADAETNQQVEALA 284 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 W++D + D+ +P++ +++ + G R + N Sbjct: 285 -DWMLDHPEHL----TDTFRRPVDPMAILPPELAQAFVIEQPFPGDRAVTDALISAYNND 339 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 LD S S + ++ +S+ + + N T+ Sbjct: 340 LRVPGDTTFVLDVSGSMAG---TRMELLRSTMLEMISGEASSLTGDVSLRERENVTI--- 393 Query: 299 ATFFNDRVISDPSFSWG-VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 FN + + V R + G T I DA+ AY+ + Sbjct: 394 -IPFNFSPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQVEP----- 447 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQD 382 IVL+TDGE T Sbjct: 448 -----GASIPS--IVLMTDGEQTSG 465 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 47/127 (37%), Gaps = 11/127 (8%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 ++ A G TAI ++Q AYD + + IVL+TDGENT Sbjct: 426 IRADAEALTADGDTAIFSSLQAAYDHLAQRRSALGDDRFTS------IVLMTDGENTTGA 479 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + +G F + ++++ L + A + F+A L+ F Sbjct: 480 TASDFDAYYRRLRGPERTAPVFPIVFGDSDRSQ--LQSIATLTGGRLFDATK-GSLDGAF 536 Query: 442 RDRIGNE 448 + G + Sbjct: 537 EEIRGYQ 543 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 58/189 (30%), Gaps = 22/189 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + LVR+ +I + D +G +++ V D + R + T Sbjct: 990 RIELVRETCHFLIDQLTADD------YLGIVSYSNTVREDVPLLRMTPEARR-LAHTMIS 1042 Query: 330 DENEMGSTAINDAMQTAYDT---IISSNEDEVHRMKNNLEAKKYI---VLLTDGENTQDN 383 G TA+ ++ S + ++ + + L TDG+ T Sbjct: 1043 SLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLFTDGQATTGP 1102 Query: 384 ------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + Q I + T F + + E + A ++ + ++ Sbjct: 1103 CTVNEIMGQMTSLQSPADQNITVHTFGFGDDHS-VELLQGVAE--AQSGVYYYISCADDI 1159 Query: 438 NKIFRDRIG 446 F D +G Sbjct: 1160 PSGFGDALG 1168 >gi|25028093|ref|NP_738147.1| hypothetical protein CE1537 [Corynebacterium efficiens YS-314] gi|23493377|dbj|BAC18347.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 531 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 40/325 (12%), Positives = 93/325 (28%), Gaps = 33/325 (10%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 +A+++ + + + ++ ++ +L + + Sbjct: 174 SALVSVATAYAGTGQALT---LGDVDEVAPELRSFLSGQTMTSGSSGWLKDVFLRDPNRA 230 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 +N + + L + + + G+ S A + + + ++ Sbjct: 231 NALINYESVLHTINAEDNAGLRV-----VVPADGVVSADYPLTPLATADAETNQQVEALA 285 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 W++D + D+ +P++ +++ + G R + N Sbjct: 286 -DWMLDHPEHL----TDTFRRPVDPMAILPPELAQAFVIEQPFPGDRAVTDALISAYNND 340 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 LD S S + ++ +S+ + + N T+ Sbjct: 341 LRVPGDTTFVLDVSGSMAG---TRMELLRSTMLEMISGEASSLTGDVSLRERENVTI--- 394 Query: 299 ATFFNDRVISDPSFSWG-VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 FN + + V R + G T I DA+ AY+ + Sbjct: 395 -IPFNFSPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQVEP----- 448 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQD 382 IVL+TDGE T Sbjct: 449 -----GASIPS--IVLMTDGEQTSG 466 >gi|156408866|ref|XP_001642077.1| predicted protein [Nematostella vectensis] gi|156229218|gb|EDO50014.1| predicted protein [Nematostella vectensis] Length = 251 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 19/120 (15%), Positives = 34/120 (28%), Gaps = 7/120 (5%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + G+T +A+ A + + I+ TDG Sbjct: 118 GTRKEAYRQLSKVPFIAGTTNTQEALNLAQRELFGKKNSGA---TPGAIGRVLII--TDG 172 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + K K G I +A + S ++ N F A S L Sbjct: 173 LSNVQRNLTLFNAYKLKMAGPEIYVVAVGQYLYGLHELVGLAS--STENHLFRAQSMRGL 230 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 54/176 (30%), Gaps = 19/176 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 N + P L F+ S ++ R+AL ++ IK+ D+ N Sbjct: 265 NGYFVHFFAPMNLPKLPKNVIFIIDISGSMSGREIQQTREALLKILDDIKEDDHFN---- 320 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F V + + F + G T + M D + +++E Sbjct: 321 --FILFGSDVHTWKETLIKATPENLDEARKFVRGIDTKGLTNLYGGMMKGIDMLNAAHE- 377 Query: 357 EVHRMKNNLEAKK---YIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFS 406 NL K+ I++LTDG+ I K + + F Sbjct: 378 ------GNLVPKRSASIIIMLTDGQPNVGISNTQDIQTHVKKAIEGKYTLYNLGFG 427 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 40.4 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 52/135 (38%), Gaps = 18/135 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P L+ VD S + K L + AL +I + + D+++ V GA Sbjct: 182 DRTEQPPLNLTLLVDVSGSMNHEDKLPLAKKALKLLIDKMDEDDHISVVVYAGAAG---T 238 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V+ ++ + + + GSTA + ++ AY ++ N D + Sbjct: 239 VLEP------TKGSEKSKIFAALDNLSAGGSTAGGEGLRLAY-SLAEQNYDAASVNR--- 288 Query: 366 EAKKYIVLLTDGENT 380 ++LLTDG+ Sbjct: 289 -----VMLLTDGDFN 298 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 40.4 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 48/299 (16%), Positives = 97/299 (32%), Gaps = 37/299 (12%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 A + ++ +S D + + P + R + ++ Sbjct: 74 SAPAAQKISDGVVSADYPLSTITGSDQGEQVAELAGWFAEHP-DALTDTYRRPTTANATL 132 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 ++ + +P+ S + + E FV S + ++ L++D + Sbjct: 133 PAELSSQTIIEAPFPGSKTVTDALIDAYTNQFRVPGETTFVLDVSGSMLGQRITLLKDTM 192 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV-----------KTF 327 + +I D N ++R +S FS+G H++I + + Sbjct: 193 SDLISGGATTDLANVSLR------GREKVSIIPFSFGPHEVISETLGAVGSPSRIDLQQR 246 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EE 385 G T I DA+ AY + IVL+TDGE T ++ Sbjct: 247 VEALQADGGTGIYDAVLAAY----------AESAGGDYIPS--IVLMTDGELTAGRTYDQ 294 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRD 443 + N S I +I V + A+ F+A + +L++ F++ Sbjct: 295 FLTEWNALPSN---IRSIPVFVILYGEANVADMEQLAATTGGKTFDAIN-GDLDEAFKE 349 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 40.4 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 65/214 (30%), Gaps = 30/214 (14%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ML D L +D+S L + + + + DN ++G + Sbjct: 23 MLMAATSDAQLDAILVIDASGSMKETDPNKLGLEGVKLFVDMLGLTDN-----QVGVVTY 77 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V S ++ + +K F ++ T I ++ A + Sbjct: 78 GSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAVKML------NQRNA 131 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICN----------KAKSQGIRIMTIAFSVNKTQ 411 N IV+ TDG N + +A+S+G I TI + N Sbjct: 132 SGNSPL---IVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPIYTIGLNDNGKL 188 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 E +L + + F EL I + Sbjct: 189 NE---AYLEKISVDTKAKAFATKDPAELPDILTE 219 >gi|149913213|ref|ZP_01901747.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] gi|149813619|gb|EDM73445.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] Length = 512 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 40/99 (40%), Gaps = 7/99 (7%) Query: 11 SKKLIKSCTG---HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 S++++ G F I +++ L +GG+ +D Y+ LKQ A A +V Sbjct: 12 SRRILHQEDGAGSVFGIFAVVMI---LLLGGVALDATNLWRYQQMLKQTADVAAHAGTVQ 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 L + ++ +F + + + N N + + Sbjct: 69 LASG-GDATNAYNAAFALVEANMPQSWYGNLFANPQADI 106 >gi|17555634|ref|NP_498247.1| CUTiclin-Like family member (cutl-23) [Caenorhabditis elegans] gi|13592472|gb|AAK31565.1| Hypothetical protein Y37B11A.1 [Caenorhabditis elegans] Length = 789 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 59/154 (38%), Gaps = 9/154 (5%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + VR+ ++ + F+ ++ ++ + G+TA N A+ A+D + S Sbjct: 285 EAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS 344 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + R K +V LTDG ++ I K ++ GI ++ ++ + Sbjct: 345 KDPSIGVR---EGVPKMALV-LTDG---HSHKSPKDISEKMRAAGIIMIAVSVTPRPLVD 397 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 E ++ S F + H+ F +G Sbjct: 398 EAELRLIAG--SEKRAFTPPNLHDFESEFMKYVG 429 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 13/84 (15%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKTQQEKARYFLS 420 I+LLTDG+ T + + A +G+R+ T+ F L Sbjct: 222 IILLTDGQRTTG-IDPLEAAKLASERGVRVYTVGVGTVEGVTVGFEGWSMHARLDEESLK 280 Query: 421 NCA--SPNSFFEANSTHELNKIFR 442 + A + +F A S L +++ Sbjct: 281 HIAQQTRAEYFHAASAEALTQVYE 304 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 105/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPENTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGTQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDSYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 52/132 (39%), Gaps = 10/132 (7%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I+ ++ + D VR+G ++D ++ + S + I + T Sbjct: 57 IKIVRDLPVHEDAVRVGIVQYSDEAKTEFNLSR-YSERNDIITHLETLKFMPGEDTRTGV 115 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ A D I D R+K A + I++ TDG + ++ + +G++I Sbjct: 116 ALSKADDEIFDY--DGGARLK----ATRLIIVFTDGLSM---DKPTLAAKALRRKGVKIY 166 Query: 402 TIAFSVNKTQQE 413 TI+ + E Sbjct: 167 TISVNSIGFVPE 178 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 58/171 (33%), Gaps = 21/171 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + I ++ DN +G F + + + + + Sbjct: 114 RLEFSKELIRGFISQ-RENDN------IGLVAFAKDASIVVPITTDREFFNKKLDDIYIM 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D +A+ + A + K++ K+ IV+LTDG D + + Sbjct: 167 DL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIDKDQV 213 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 N A+ ++I +I ++ + + S SF E L +I Sbjct: 214 INLAQGLNVKIYSIGIGSSEEFSVEFK-LRSGKFYQGSFKEVYDPSMLVEI 263 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 61/193 (31%), Gaps = 31/193 (16%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S ++ ++ + KI N G F+ ++ Sbjct: 101 GSLRNQYRQEVEFLKSLARTF-----KISNNGAHA--GVVTFSSIAELSIKL----NQYY 149 Query: 321 RTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 A+D+ T I+ A++ + + N + + LLTDGE Sbjct: 150 DQEQFERAVDDIPYMGYVTRIDLALRKSLEMFDEINGAR-------KSIPQILFLLTDGE 202 Query: 379 NTQD----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANS 433 +E ++I + +GI I I + L++ A S F A + Sbjct: 203 QYAGKGVVDENPVSIAKLLRDKGIVIFAIGI-----GSAVRQSQLNDIAGSSEKAFLAKN 257 Query: 434 THELNKI-FRDRI 445 +EL F +I Sbjct: 258 FNELVNSDFLKKI 270 >gi|254458848|ref|ZP_05072272.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084614|gb|EDZ61902.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 629 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 44/151 (29%), Gaps = 24/151 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + L + +I N R+G F S+ K + ++ Sbjct: 104 YPNRLELAKKKALELI-------NKATKDRVGVIAFAKNSYLVSPISFDT-KTVSFLLSK 155 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T I + T N KKY+++L+DG D + Sbjct: 156 LDTSSITQKGTNILTMLGTV-------------EKTNTSTDKKYLLILSDG---GDETDF 199 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 A + AK +GI + + K + Sbjct: 200 SAEIDFAKEKGIIVFVLGIGTEVGASIKNKD 230 >gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99] gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length = 733 Score = 40.4 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 53/174 (30%), Gaps = 22/174 (12%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 SL + P +D S K R A ++ + D Sbjct: 251 EGTWSLTLVPPAEPSSAPRDVVVVLDRSGSMGGW-KMVAARRAAGRIVDMLDAGD----- 304 Query: 295 VRMGATFFNDRVISDPSFSWGV---HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R F+DR+ + P+ G+ R ++ G T + + A + + Sbjct: 305 -RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQPLTNAVEMLA 363 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 S ED +VL+ DG+ + ++ ++ RI + Sbjct: 364 DSGEDRQAS----------VVLVADGQISGEDHLLRSLAPAVGRT--RIYCVGV 405 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 40.4 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 85/313 (27%), Gaps = 33/313 (10%) Query: 146 LFLRSMGIKSWLIQTKAEA--------ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 F ++ K+ + +E +S + + ++D S SM + S Sbjct: 574 GFFKATDEKTNYLPAGSETINNYEYNVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGS- 632 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + + K Y S + + L + Sbjct: 633 SRLTHLKNNVIKFITKLYEHNPDSRVSVITFAYSADGSITNNNFVKLSDIKSGNETWYTY 692 Query: 258 FVDSSSLRHVIKKKHLVRDALA--SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 ++ IK + L V + N+ F D + F Sbjct: 693 LTKNNGGIKNIKASGGTQIDLGLYEVRNQLSSATGENNR---SVIVFTDGQPGNKGF--- 746 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 T D ++ G +A+ A S N ++ + KY Sbjct: 747 ---------NTSYNDYDDNGYRVGAEALNQADFIKFSGNLTGINNYIESSNGSKYYGHKN 797 Query: 376 DGE-----NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 D N N+ G K G I TI + N + FL+ AS + + Sbjct: 798 DDITKNRSNNNSNDAGNRTNRSGKGLGKTIFTIGLNSNNSS--LFDSFLTRLASEGHYTK 855 Query: 431 ANSTHELNKIFRD 443 AN++ + F Sbjct: 856 ANNSSAMENAFNS 868 >gi|126306104|ref|XP_001362407.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 870 Score = 40.4 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 67/225 (29%), Gaps = 26/225 (11%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S++++ + Q +KS + KS Sbjct: 237 MFMQSIDSVVEFCTEKNHNRFAPNLQNKMCYLKSTWEVIQDSEDYKKSTPMMAAEPPKSR 296 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + G +D+S V + + +R A + I + + G Sbjct: 297 FSLKQIG----ERILVLVIDTSRSMKVGNRLNRLRQASQFFLLQIIEKGSW-----TGIV 347 Query: 301 FFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + R T++ G I ++TA+ + + Sbjct: 348 TFDSSATIQSELIQIESDVQRKTLISRLPNVTVAGGGAHICSGLRTAFMVV------KKK 401 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTI 403 + + E +VLLTDGE+ C + K G I TI Sbjct: 402 FLTDGSE----MVLLTDGEDN-----TTNTCFEEVKQSGAIIHTI 437 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 47/128 (36%), Gaps = 14/128 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +I N + R+G + + + + D TAIN+A++ Sbjct: 119 VTQIINNLASDRVGIIAYAGKAFPQLPITTDYASAKMFLQNMN-TDMLSSQGTAINEAIE 177 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + +V+++DGE+ + +A +A ++GIRI TI Sbjct: 178 LAKTYYDDDQQTN-----------RVLVIISDGEDHSEAAANVAE--EASNEGIRIFTIG 224 Query: 405 FSVNKTQQ 412 K Sbjct: 225 VGDAKGGP 232 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 88/305 (28%), Gaps = 44/305 (14%) Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + ++ L +G S + +A + S + G Sbjct: 458 MTVHGLVFMGGILGSVSTEMSPRAHTAGTPKGPST------------ISSQPSFHSMENG 505 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 P T ++ E S K+++Y L F Sbjct: 506 SPEESGVWQGSHTWLLPMAELRAPPKCPEAWSAAAGGGCKTVHYDL------------VF 553 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + +S + VR +A+++ + + + R+G ++D+ + G+ Sbjct: 554 LLDTSSSVGKEDFEKVRQWVANLVDTFEVGP---ERTRVGVVRYSDQPTTAFEL--GLFG 608 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T DA++ S + K+ +LL G Sbjct: 609 SREAVKAAARHLAYHGGNTNTGDALRFITRHSFSPQ---AGGRPGDRAFKQVAILLPAGR 665 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTH 435 + + A A GIRI + ++ L AS F + + Sbjct: 666 SQDLVLDAAA---AAHRAGIRIFAVGVGAALKEE------LEEIASEPKSAHVFHVSDFN 716 Query: 436 ELNKI 440 ++KI Sbjct: 717 AIDKI 721 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNRDATKNALDKLQFADR-----TATGEAIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKT---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F L A S + + A + EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINGQRQPVPVDDETLKKVAQLSGGNAYNAATLAELKSVYASLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 21/50 (42%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + G+ +I AL P ++ + V++ L+ A TA + + Sbjct: 16 RDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTAALAGA 65 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 40.4 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 58/183 (31%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA + + TA Sbjct: 130 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEGIFTAL 184 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 185 QAIATVG---AVIGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPIST 241 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E L A S + + A S EL ++ + +IG E Sbjct: 242 ISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKSVYATLQQQIGYE 301 Query: 449 IFE 451 + Sbjct: 302 TIK 304 >gi|254468097|ref|ZP_05081503.1| von Willebrand factor, type A [beta proteobacterium KB13] gi|207086907|gb|EDZ64190.1| von Willebrand factor, type A [beta proteobacterium KB13] Length = 326 Score = 40.4 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 44/164 (26%), Gaps = 30/164 (18%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F++ + + + + T T I + + + + ED Sbjct: 131 IGVITFSNSAMFVLPLTQNKSAITGAVNATAGNALF---QTNIGAGLS-SVSELFAKVED 186 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 + ++LL+DG D I + I + I Sbjct: 187 S---------GSRAVILLSDGAGRIDAPTQQKIRDWFDRFDIGLYWIVLRQPGGISIFDE 237 Query: 409 ---------KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 Q + + SP +EA L K +D Sbjct: 238 NLKIRDETQPPPQIELFDYFKTFRSPFQAYEAEDPASLEKAIKD 281 >gi|254445696|ref|ZP_05059172.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198260004|gb|EDY84312.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 923 Score = 40.4 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 55/172 (31%), Gaps = 23/172 (13%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 SL + L +D+S + +V ++ ++ + D R+ Sbjct: 567 SLQTAAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKD------RLS 620 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + G T + T A N G T + A+Q +Y T ++ Sbjct: 621 IVSFDRQ---PRLVLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENA 677 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRI--MTIAF 405 ++L+TDG N + + +GI + I F Sbjct: 678 INR---------VILITDGAANLGNTNAEQLRTTVTENRIRGIALDCFGIGF 720 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 40.4 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 48/337 (14%), Positives = 94/337 (27%), Gaps = 55/337 (16%) Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL-----LNPLSLFLRSMGIKS--- 155 +NF + V T V + K VV ++ L LN +L R ++S Sbjct: 19 QNFVKVDTSKQVDQTLVAGSSGKP---VVTAATQKLEAVPGFLNDFNLMTRIALVESNYG 75 Query: 156 ----------WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 W I T +A S+Q +++ + +L +++ F Sbjct: 76 ENNKGSLGGIWEITTSIQATLRGSLSS-----SVQTIVNEAGCIL--GQENTNLTNTDFS 128 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP----------------L 249 P + VG ++ + S Y Sbjct: 129 TPIRSAFAARIFIQQSVGSSAIPITLTQQATWWSTVYRPGANATKFIELVSQVENVTVGC 188 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + FV S LAS+ + + VR+G + ++ Sbjct: 189 ATHKLDLWFVIDGSGSVGFSNFQDSLRFLASLTKRFTIGPDD---VRVGFSVYSSTSTIH 245 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + H + G T+ A+ + + + + Sbjct: 246 SH--FNQHMNNSALEAEILGTSYTGGGTSTGRAINDVLNNGFVER-NGARPASEGVP--R 300 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +V++TDG Q + + K+ GI + + Sbjct: 301 ILVVMTDG---QSGDSVKTPSDNVKAAGITVFGVGIG 334 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 60/184 (32%), Gaps = 28/184 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L+++ L ++ ++ D R+ F+ K +K Sbjct: 1463 LDLLKETLLFLVDLLQTGD------RICLIQFSTNAQRLTPLLSIESKDNIKSIKNEINR 1516 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG---- 386 G T I MQ A+D + + R KN + + + LL+DG N + Sbjct: 1517 LVAKGGTNICQGMQLAFDVL------KQRRYKNPITS---VFLLSDGLNDGAENKIRDLL 1567 Query: 387 --IAICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + I T F + +K + +F+ H +++ F Sbjct: 1568 KQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLM-----DGNFYYIGDIHRIDECFI 1622 Query: 443 DRIG 446 D +G Sbjct: 1623 DALG 1626 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 50/308 (16%), Positives = 102/308 (33%), Gaps = 43/308 (13%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVI------DFSRSMLDYQRDSEGQPLNCFGQPAD 209 + ++ + VS + + D + + P + Sbjct: 236 LHTMISEDGADITVVVPADGVVSADYPLSTITGSDQGEHVAELAGWFAEHP-DALTDTYR 294 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 R + ++ ++ + +P+ S + + E FV S + + Sbjct: 295 RPTTANATLPAELSSQTIIEAPFPGSKTVTDALIDAYTNQFRVPGETTFVLDVSGSMLGQ 354 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI--------- 320 + L++D ++ +I D N ++R + +S FS+G H++I Sbjct: 355 RITLLKDTMSDLISGGATTDLANVSLR------DREKVSIIPFSFGPHEVISETLGAVGS 408 Query: 321 --RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 RT ++ G T I DA+ AY + IVL+TDGE Sbjct: 409 PSRTDLQQRVEALQADGGTGIYDAVLAAY----------AESAGGDYIPS--IVLMTDGE 456 Query: 379 NTQDN--EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTH 435 T ++ + N S I +I V + A+ F+A + Sbjct: 457 LTAGRTYDQFLTEWNALPSN---IRSIPVFVILYGEANVADMEQLAATTGGETFDAIN-G 512 Query: 436 ELNKIFRD 443 +L++ F++ Sbjct: 513 DLDEAFKE 520 >gi|323529406|ref|YP_004231558.1| hypothetical protein BC1001_5117 [Burkholderia sp. CCGE1001] gi|323386408|gb|ADX58498.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 353 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 55/163 (33%), Gaps = 5/163 (3%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV---- 75 G ++TAL + ++G+ + VD+ R + L+ AA A + + L + + Sbjct: 16 GAVAVMTALCLTALVGITALAVDLGRAWVVRNELQNAADAAALAGAGSLGPNYKSPNWTQ 75 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ S + L+ N + A QV +S Sbjct: 76 AAAKAQSAITLNKTEGVALVTAQVQTGYWNVKGTPAGMQALPVPAPGAYDRPAVQVTVSR 135 Query: 136 RYDLLLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEHGV 177 PLSL L +GI + I A A + Y + Sbjct: 136 AAGQNGGPLSLVLAPVLGITTMPISATAVAVISAPGYAGPGAL 178 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 180 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 240 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 292 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 349 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 350 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 407 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 408 RDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 24/56 (42%), Gaps = 1/56 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G F ++ A+ M V + V G VD+ + L++ A A + + + + Sbjct: 2 GSFALVAAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQP 56 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 51/349 (14%), Positives = 108/349 (30%), Gaps = 50/349 (14%) Query: 125 RKSAYQVVLSSRYDLLLN-PLSLFLRSMGIK-SWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + Y V +++ Y + PLS R++ + S L+ + ++ + + S + Sbjct: 1108 SGTEYIVTVTASYSSGASQPLSGRARTLFLGVSHLLSYQVRTTSLCVQWQPQRHASTYRL 1167 Query: 183 IDFSRSMLDYQRDSEGQPLN--CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC---- 236 + S Q + CF Q T S S + S Sbjct: 1168 LTESLPSGQKQEVRLSGATSRHCFNQLEPNTRYSVSVHAQLADGSEGPAVTVTESTREFR 1227 Query: 237 --NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA----------------- 277 + P P L +S + + A Sbjct: 1228 ATAWDRPPVAVPTPRPTRLPSTTPAPTSPTPKDVCRAARADLAFLVDGSWSIGDDNFLKI 1287 Query: 278 ---LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 L S + ++ +I V + F+D ++ ++ +++ + + Sbjct: 1288 TRFLYSAVGALDRIGPEGTQVAI--VQFSDEPRTEVQLK--SYRKKERLLEAISSISYKG 1343 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T A+Q D++ + K +VLLTDG + ++ + + + Sbjct: 1344 GNTKTGRAIQHMKDSVFTEEGGAR------TAVPKVLVLLTDG---RSQDDVSKVSKELQ 1394 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 QG + I F+ + S +S + FF + F++ Sbjct: 1395 KQGFVVFAIGFA--DADYGELVNVASKPSSTHVFF-VDD----LDAFKE 1436 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 240 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 292 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 349 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 350 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 407 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 408 RDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 44/286 (15%), Positives = 77/286 (26%), Gaps = 25/286 (8%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 +S + + ++D S SM + S + + K Y Sbjct: 601 SGNVKSTTVKAPQDVVLLLDKSGSMDESMNGS-SRLTHLKNNVIKFITKLYEHNPDSRVS 659 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA--SVI 282 S + + L + ++ IK + L V Sbjct: 660 VITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYEVR 719 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + N+ F D + F T D ++ G +A Sbjct: 720 NQLSSATGENNR---SVIVFTDGQPGNKGF------------NTSYNDYDDNGYRVGAEA 764 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQG 397 + A S N ++ + KY D N N+ G K G Sbjct: 765 LNQADFIKFSGNLTGINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTNRSGKGLG 824 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I TI + N + FL+ AS + +AN++ + F Sbjct: 825 KTIFTIGLNSNNSS--LFDSFLTRLASEGHYTKANNSSAMENAFNS 868 >gi|114775649|ref|ZP_01451217.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] gi|114553760|gb|EAU56141.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] Length = 745 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 34/97 (35%), Gaps = 19/97 (19%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + VK+ + ST + AM+ A H +K KK +++L Sbjct: 620 GFSERWGDEVKSRLAEMEAGFSTRMGGAMRHA-----------AHYLKAQQADKKLMLIL 668 Query: 375 TDGENTQDNE--------EGIAICNKAKSQGIRIMTI 403 TDGE + + + N+ GI I Sbjct: 669 TDGEPSDIDVDDEQLLIADARQAVNELDRDGIYSYCI 705 >gi|156358436|ref|XP_001624525.1| predicted protein [Nematostella vectensis] gi|156211311|gb|EDO32425.1| predicted protein [Nematostella vectensis] Length = 1323 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 64/182 (35%), Gaps = 14/182 (7%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 SC + ++ P P + +S ++ + L ++ +K N Sbjct: 1116 EFYSCPGEIPHLAKPCPKSLDIGIALDRSTSVGP---TNFNIAKTFLKILVERMKISTNG 1172 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + G ++ SF + + I + E G T + A+Q A + Sbjct: 1173 SHF---GLIAYSSSASRVISFRF--SQKAADINRQIDAIEFTGGKTRTDFALQVAITDLF 1227 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +++ + + + ++++T+G +Q + + K + + ++ + + + Sbjct: 1228 TNSAGDRENVTD------VLIVMTNGRTSQGSLPYKDVMKPLKEKKVDVIAVGIGPDVNE 1281 Query: 412 QE 413 E Sbjct: 1282 AE 1283 >gi|319777804|ref|YP_004134234.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171523|gb|ADV15060.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 343 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 61/219 (27%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + ++ ++ G VD R +QA A+++ + S Sbjct: 4 LFVFMLAPIITAIGFSVDYTRAVQTRSNEQQALDAAVLSITGMDTTSTLAQRQTMLQDTF 63 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + +F + T + + T R + V + S + Sbjct: 64 IANHGLGTPTLNSFVVSANGTATAQAMASYSMPTVFMQIARINTVPVAVGSAASKTPALV 123 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + + W +T T + + +SI++ + Y + N Sbjct: 124 QTTFKVSKVSGWWNKTMTLYGTTFGATVAKPLMSIEYTYNGFGDPKGYGTTNVYTITNNG 183 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 G V+S V + ++ + + Y Sbjct: 184 GADIKTLVQSQVCTTAGVSTFTGLTADAILQTSGTKKYS 222 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 79/193 (40%), Gaps = 22/193 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-----WGVHKLIRTIV 324 K + ++A+ + + + NV+ V G ND S + + K +T + Sbjct: 178 KMDIAKEAIQQFVSDLPEAVNVSLRVY-GHKGSNDEKDKTASCGAIENVYTLQKYNQTTL 236 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + +G T + +A++ + +T S+ ++ + M Y+V +DG T Sbjct: 237 RQSLDGFQPVGWTPLAEAIKRSTETFQSAKANDKNIM--------YVV--SDGVETCGG- 285 Query: 385 EGIAICNKAKSQGIR-IMTI-AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K + I+ IM I F V+ +++ + S + AN+ EL F+ Sbjct: 286 NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAE--VSKGKYVLANNAKELQDQFK 343 Query: 443 DRIGNEIFERVIR 455 + G +I R ++ Sbjct: 344 ET-GKDITSRRLK 355 >gi|294011132|ref|YP_003544592.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] gi|292674462|dbj|BAI95980.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] Length = 157 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 18/71 (25%), Positives = 34/71 (47%), Gaps = 1/71 (1%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ-TAIITASVPLIQ 70 K+L+ G+ + A++MPV++ + M DV + L++AA+ TA + A+ Sbjct: 3 KRLLSDRYGNSTVELAIIMPVLVLLTCMAGDVAMAFKAKIGLQRAAERTAQLAAAGGYTN 62 Query: 71 SLEEVSSRAKN 81 + S N Sbjct: 63 DTTDTSKAYNN 73 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 58/335 (17%), Positives = 107/335 (31%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP------LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P + V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYEDNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 24/70 (34%), Gaps = 1/70 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ-SLEEVSSR 78 G ++ AL + + L ++VD R L+Q A A + A+ ++ Sbjct: 15 GAIGLMAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATRGGNCGAGATANA 74 Query: 79 AKNSFTFPKQ 88 + Sbjct: 75 YAQASVVRNN 84 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 100/333 (30%), Gaps = 30/333 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 89 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 148 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 149 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 207 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 208 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 264 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 265 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 324 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 325 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSRTNGGTFTQKALR 382 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 A + N KK IVLLTDG Sbjct: 383 DAGSMLSVPNGH-----------KKVIVLLTDG 404 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 48/139 (34%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A V + + K++N +G F + + + +++ Sbjct: 112 LQKAKRLVAQLVDKMENDK----VGMIVFAGDAFTQLPITSDYISA-KMFLESIDPSLIS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI A+ A + + ++++TDGEN + + + A Sbjct: 167 KQGTAIGAAINLASRSFT-----------PQEGVGRAVIVITDGENHEGDA--VEAAKDA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 +GI++ + + + Sbjct: 214 AEKGIQVNVLGVGMPEGAP 232 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 240 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 292 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 349 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 350 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 407 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 408 RDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 18/124 (14%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + +NE ST I A++TA + N K IVL+TDG + + Sbjct: 770 IDSLLTYKNENRSTNIVKALETA-----------IGNFTTNQYTSKAIVLVTDGYSNSNG 818 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIF 441 E + AK +GI I T++ T EK L + +S ++ S L+ Sbjct: 819 LE--QVLRDAKLKGIAIHTVSVGSYTTVNEK---LLKDISSETNGTYQNITSIENLHGSL 873 Query: 442 RDRI 445 + I Sbjct: 874 QAII 877 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 44/137 (32%), Gaps = 18/137 (13%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P L+ +D+S K L+R + ++ +++ D V G+T Sbjct: 252 PPLNLVFLIDTSGSMESADKLPLLRQSFRLMLDNLRPEDEVAIVTYAGSTSIALEPTQAS 311 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 R + N GST ++ AY MK + + Sbjct: 312 E---------RATIIAALNALNAGGSTNGQGGLEQAY--------ALAETMKTAGDVSR- 353 Query: 371 IVLLTDGENTQDNEEGI 387 ++L TDG+ + Sbjct: 354 VILATDGDFNVGLSDPR 370 >gi|157831431|pdb|1IDO|A Chain A, I-Domain From Integrin Cr3, Mg2+ Bound Length = 189 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 10/110 (9%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 77 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 +A +G+ I + + EK+R L+ AS + F+ N+ L Sbjct: 131 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEAL 179 >gi|31615649|pdb|1N9Z|A Chain A, Integrin Alpha M I Domain Mutant Length = 192 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 10/110 (9%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 76 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 129 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 +A +G+ I + + EK+R L+ AS + F+ N+ L Sbjct: 130 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEAL 178 >gi|31615583|pdb|1MF7|A Chain A, Integrin Alpha M I Domain Length = 194 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 10/110 (9%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 76 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 129 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 +A +G+ I + + EK+R L+ AS + F+ N+ L Sbjct: 130 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEAL 178 >gi|22219356|pdb|1M1U|A Chain A, An Isoleucine-Based Allosteric Switch Controls Affinity And Shape Shifting In Integrin Cd11b A-Domain Length = 195 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 10/110 (9%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T ++ + + K +V++TDGE D + Sbjct: 83 TQLLGRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVI 136 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 +A +G+ I + + EK+R L+ AS + F+ N+ L Sbjct: 137 PEADREGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEAL 185 >gi|113460773|ref|YP_718840.1| hypothetical protein HS_0628 [Haemophilus somnus 129PT] gi|112822816|gb|ABI24905.1| conserved hypothetical protein, with von Willebrand factor (vWF) domain [Haemophilus somnus 129PT] Length = 212 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 51/172 (29%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR L +++ ++++ +T + F+ + ++ Sbjct: 21 IESVRSGLQTLVSALRQDPYALETAYLSVITFDSSARQVTPLT--------DLMSFQLPS 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G TA+ +A+ D + +K + + + LLTDG T D + GI Sbjct: 73 IEASGLTAMGEALGLLADCVNREVNKGSAEVKGDWKP--VVFLLTDGIPTDDLQSGIFAL 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K T F L + + + F+ Sbjct: 131 KNVK-------TGTFVACAAGAGADTEELKKITETVVSLDTADANSIKAFFK 175 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 101/333 (30%), Gaps = 30/333 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 89 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 148 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 149 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 207 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 208 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 264 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 265 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 324 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 325 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSWTNGGTFTQKALR 382 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 A + + N KK IVLLTDG Sbjct: 383 DAGNMLSVPNGH-----------KKVIVLLTDG 404 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 37/104 (35%), Gaps = 10/104 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+Q D I + + K IV+LTDG+ +D + N K Q Sbjct: 295 TRTASAIQHVLDDIFTQRHGSRKKA------SKVIVVLTDGDTFEDPLNLTVVINSPKMQ 348 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 GI I + ++ + L AS F+ + L Sbjct: 349 GIERFAIGVG-DAFKKHQTEQELKLIASDPDETHAFKVTNYSAL 391 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 47/368 (12%), Positives = 103/368 (27%), Gaps = 54/368 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK G II ++P+++ + + + + + +AA+ A + + E+ Sbjct: 3 IKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEK 62 Query: 75 VSSRAKNSFTFP-KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A+ E + R F + + + D V+ + + Sbjct: 63 NQEYAQKIVDHYIPDNKGEVVARVF--HRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDS 120 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 Y+ L+ MG T + + I ++ID S SM++ Sbjct: 121 WISYNDGEMGLTKDFEVMG-------------TSTSRKFLPQPLDIYFIIDMSGSMVNPW 167 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S + +R V R + + + Sbjct: 168 GGSGKTKYDVVADTINRIVDDLREFKTDRKSRVAVIGFH------------HTAVKQVGR 215 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 F SS + ++ + KI + ND N + D + Sbjct: 216 QRTAFDYSSYRTPSAT--------VNNMFTA-PKIHSRND-------SSNIKTFEDIPLT 259 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + ++ II + + +L ++ +L Sbjct: 260 EDYDAFLTKFNSSNYYA--------SRYGLTESWQGIIGAAQMAEQA--TDLNPEQVFIL 309 Query: 374 LTDGENTQ 381 L+DG + Sbjct: 310 LSDGRDGD 317 >gi|87306384|ref|ZP_01088531.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] gi|87290563|gb|EAQ82450.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] Length = 1030 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 35/81 (43%), Gaps = 6/81 (7%) Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-N 421 N A K++++++DG+ + N ++ GI++ T+A + Sbjct: 541 PNEVAVKHMIIISDGDPSPANPFTLS---AIAKAGIKVTTVAIG--THGPANSLELKKIA 595 Query: 422 CASPNSFFEANSTHELNKIFR 442 A+ ++E + L +I++ Sbjct: 596 SATGGKYYEVTNPKALPRIYQ 616 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 76/218 (34%), Gaps = 23/218 (10%) Query: 233 MVSCNKSLYYMLYPGPLDPSLS----EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + C+ Y G D + FV SS L ++ + +V+ + I Sbjct: 583 LTDCDVMTYVRETCGCCDCEKRCGALDIMFVIDSSESIGYTNFTLEKNFVINVVSRLGSI 642 Query: 289 --DNVNDT-VRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D + T R+G ++ + + + E G T A+Q Sbjct: 643 AKDPKSLTGARVGVVQYSHEGTFEAIKLDDERIDSLSSFKEAVKRLEWIAGGTWTPSALQ 702 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRIMT 402 AY+ +I + E A+ + V++TDG + D++ A+C + + + T Sbjct: 703 FAYNKLIKESRREK--------AQVFAVVITDGRYDPRDDDKNLGALC----GRDVVVNT 750 Query: 403 IAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNK 439 I Q E++ +S C P + +L Sbjct: 751 IGIGDMFDQPEQSETLVSIACNEPQRVQKMRLFSDLVA 788 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 29/115 (25%), Positives = 49/115 (42%), Gaps = 20/115 (17%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG------ENTQDNEEGIAIC 390 T I A++ A D +MK + + + +++LLTDG +N + +E I Sbjct: 105 TNIGAALEKA-------AYDHKQQMKADQDYQTHVILLTDGMVDIDRDNRLNKKERQRIL 157 Query: 391 NKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 N + GI + TIA S N ++ + L A+ A + EL +F Sbjct: 158 NNVLPMYQQSGITLHTIALSDNADKKLLNKLAL---ATDGKVSVAKNAEELMNVF 209 >gi|330448513|ref|ZP_08312161.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492704|dbj|GAA06658.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 899 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 33/278 (11%), Positives = 88/278 (31%), Gaps = 30/278 (10%) Query: 168 SRSYHKEHGVSIQWVIDFSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + ++ V+I +VID S SM + + + D +S K+ Sbjct: 365 TPPQAEQQPVNINYVIDISGSMYYGRMLTLDAVKDHVAKSYEVYVDSNSTLTASDGTKLS 424 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 R ++ L Y + + + + +V+ + ++ + Sbjct: 425 DRPGWVTVSYDQLKAGLQYDAGSRAGIDIKASDGEYFHTKFDALPYLMDIVKQSYQTLTQ 484 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH-----KLIRTIVKTFAIDENEMGSTA 338 I + + F++ V + +F + ++ + D G T Sbjct: 485 EILHNIDDKSKLEFNIVTFSNAVRGNTTFHYDDSTHQFVNKQNVTIENYIHDLTAGGGTQ 544 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 ++ A I S++ + + L+DG+ +E+ + +G Sbjct: 545 FEWPLKDASAHITDSSK------------RNVVYFLSDGK----DEDKLDTTGIHFLKGT 588 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 I++I + ++ A + ++ ++ + Sbjct: 589 EIVSIGVGPSADAKQMGE-----IAQMGTGYDKDNPNA 621 >gi|149725809|ref|XP_001495640.1| PREDICTED: similar to leukocyte adhesion glycoprotein [Equus caballus] Length = 1140 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 73/206 (35%), Gaps = 18/206 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+ F+ S +++ + V+ KK + ++ TF Sbjct: 144 CPQQESDIAFLIDGSGSIYENDFQKMKEFVTIVMNQFKKSKTLFSLMQYSDTFQTHFTFK 203 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + L+R I + G T ++ + S Sbjct: 204 EFANNPNPGSLVRPINQL-------GGRTHTATGIRKVVRELFHSRNGARKNAL------ 250 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D E + +A +GI I + + EK+R L+ AS Sbjct: 251 KILVVITDGEKFGDRLEYEDVIPEADQEGIIRYVIGVGIAFS-IEKSREELNTIASKPAR 309 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F N+ L I ++++ +IF Sbjct: 310 DHVFRVNNFEALKTI-QNQLQEKIFA 334 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 40.0 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 53/165 (32%), Gaps = 22/165 (13%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + + + + +D +A+ + A + Sbjct: 142 IGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL--GNGSALGLGISIALSHL------ 193 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA- 415 K++ KK +++LTDG D + N A+ ++I ++ ++ Sbjct: 194 -----KHSEAPKKSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSVGIGSDEELNVGFK 248 Query: 416 -------RYFLSNCASPNSFFEANS-THELNKIFRDRIGNEIFER 452 + L P+ FE ++ T L D ++ + Sbjct: 249 LRSGKFYQGVLKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQ 293 >gi|170743237|ref|YP_001771892.1| cell wall anchor domain-containing protein [Methylobacterium sp. 4-46] gi|168197511|gb|ACA19458.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium sp. 4-46] Length = 761 Score = 40.0 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 15/138 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R A AS++ ++ ++ + R F+D + + R + F Sbjct: 390 MRQAKASLLVALDRLGPAD---RFNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEA 446 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + ++ A D V + IV LTDG NEE I A Sbjct: 447 RGGTEMLPPLRAALADPHPEEGDRV----------RQIVFLTDGA--IGNEEQIFSAISA 494 Query: 394 KSQGIRIMTIAFSVNKTQ 411 R+ I Sbjct: 495 GRGRSRLFMIGIGSAPNG 512 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 40.0 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 39/113 (34%), Gaps = 16/113 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R Sbjct: 267 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLT-KMSDAGRQRALQAV 319 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T I + ++ + R KN + + I+LL+DG T Sbjct: 320 NSVVANGGTNIAEGLRKGVKVMED------QRDKNPVAS---IILLSDGRATY 363 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 25/214 (11%), Positives = 65/214 (30%), Gaps = 11/214 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I+ G +I + M ++ V +D+ L+ + A + ++ Sbjct: 8 RGIRKQRGLVVVIVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAA--ALAAAIVLDK 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR-----DIVRDTAVEMNPRKS 127 + ++A K+ + + EV+ + D+ +P Sbjct: 66 DGTEAQADTIARSTLTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVFPDSGYSSSPDGD 125 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y V+ ++ DL F R++G+ L + + + +++ D + Sbjct: 126 RYVRVVINQLDLE---SFFFARALGVTKRLTASAVAGPSPGGNACNIVPMAVCEG-DDAG 181 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + ++ P + G Sbjct: 182 TNGYDSGVVYALKISSQSDPTMGSGNFQLLDFGS 215 >gi|313238993|emb|CBY13979.1| unnamed protein product [Oikopleura dioica] Length = 676 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 49/134 (36%), Gaps = 21/134 (15%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + ++ +K + T R+G F ++ WG+ + MG T Sbjct: 235 VRNFLKEVKSY--MGSTSRIGVVRF----ATNSELIWGLDDGLDGFTNAVTNMAWTMGGT 288 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + AYD ++ ++ A K IVL+TDG A+ + Sbjct: 289 YTAKGLDLAYDHML---------LRGRRSATKTIVLMTDG------YTSNQSAYDAEIKK 333 Query: 398 IRIMTIAFSVNKTQ 411 I+ +++F++ Sbjct: 334 IKGSSVSFNLQAIG 347 >gi|332665830|ref|YP_004448618.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332334644|gb|AEE51745.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 630 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 72/204 (35%), Gaps = 23/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + VD S K LV+ + + ++ D V V GA Sbjct: 253 PVENLPAANLVFLVDVSGSMSAANKLPLVQASYKLLAEQLRPQDRVAIVVYAGAAG---- 308 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ + + K+ I K GSTA + + AY ++ E+ + N Sbjct: 309 LVLESTTGNNKTKIKEAIDK-----LQAGGSTAGGEGILLAYK---TAKENFIKGGNNR- 359 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++L +DG+ + E + I + + G+ + + + + + K + Sbjct: 360 -----VILASDGDFNVGVSSDGELVRIIEEERKSGVYLTILGYGMGNYKDNKMQKLAD-- 412 Query: 423 ASPNSFFEANSTHELNKIFRDRIG 446 + + ++ E ++ G Sbjct: 413 SGNGNHAYIDNLDEARRVLVSEFG 436 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 99/328 (30%), Gaps = 28/328 (8%) Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + S+ + QS + + K + + + + + K T+ E + +T Sbjct: 68 VQLSLAVEQSSIQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKET 127 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA----ETVSRSYHKEH-- 175 + + + + + NP + + + A S +Y + Sbjct: 128 NKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQ 187 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 Q V++ + + N SY G D + Y Sbjct: 188 PTGNQNVLNHQGNKDGSAQWDGQTSWNGD---PTNRTNSYIEYGGTGDQADYAIRKYARE 244 Query: 236 CNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + +Y E VD S + + V+ + + ++ Sbjct: 245 TTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSG 304 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ + MG ++ ++ + G ++ +K + G T A++ A D Sbjct: 305 ITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDM 362 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + N KK IVLLTDG Sbjct: 363 LATPNGH-----------KKVIVLLTDG 379 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 71/195 (36%), Gaps = 11/195 (5%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 PS + V K V+D + ++ + D + VR+G ++ V Sbjct: 201 PSSGVDVLFVLDGSGSVGKNFDKVKDWVKNITAKL---DIGKEIVRVGVVQYSHYVEGKS 257 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA--K 368 K I T + + A++ Y T +V R ++ K Sbjct: 258 I---NKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQKVIRDFDDAYIGNK 314 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + ++LLTDG+ +DN+ + N+ +++GI + + + + S S + Sbjct: 315 QVLLLLTDGQA-KDNKLILPNANRLRNKGIATFAVGVG--EYDISELKLIASGTDSTDRV 371 Query: 429 FEANSTHELNKIFRD 443 F EL+ I + Sbjct: 372 FTVTDFGELDSIVKS 386 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 40.0 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 58/335 (17%), Positives = 107/335 (31%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP------LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P + V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYEDNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 40.0 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 40.0 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPRYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|115749084|ref|XP_001197592.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115959785|ref|XP_001193076.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 966 Score = 40.0 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 55/167 (32%), Gaps = 15/167 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ I N V + FN I D + + R + + + G+T I + Sbjct: 342 IQSIVPNNSYVAI--VEFNYESIVDSYMTELTSVISRKDLASL-LPTLADGATCIGCGIV 398 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA ++ D Y++LL+DGE + + G+ + +IA Sbjct: 399 TAIQVAQYNDMDSRGV---------YLILLSDGEENHGT-PIADTMDDIEGSGVIVHSIA 448 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 F TQ E ++ + + S + F I Sbjct: 449 FYEADTQLEDLAQMTGGISATCA--DGGSAQCVISAFVSIIAQRPQS 493 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 68/241 (28%), Gaps = 41/241 (17%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K+ Y + + ++ + + +D + +++ + Sbjct: 80 KTEYVYTETANSIMFVIDISPSMAAKDINEKTRIQAAKDIITDFVQTY----PADAF--- 132 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G T + + + + TA+ + A Sbjct: 133 GLTALASTAALVIPPTIQHEQFFARLNSLQIGEL--GEGTALGMGLAVA----------A 180 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS---------VN 408 H KN ++ + I+LLTDGE+ KS+ I I V+ Sbjct: 181 AHFAKNTVKTQS-IILLTDGESNTGEIHPNLAAELIKSKKIGFYIIGIGKDGYANLEYVD 239 Query: 409 KTQQEKARYFLSNCASP-----------NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + EK L + + A S L +IF++ I I R + Sbjct: 240 PSTGEKREGTLQTIFNERELRELAHRGNGIYVSAKSFASLQEIFKN-ISQNISPTPARFS 298 Query: 458 K 458 + Sbjct: 299 E 299 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 70/212 (33%), Gaps = 23/212 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + +D S+ K LV++A + +K+ D ++ G Sbjct: 157 AKNTDTTTKPSNLVFLIDVSASMDEPDKLPLVKNAFLLLCDELKENDTISIVTYAGTDSV 216 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 G + + + D GSTA +D ++TAY + E + Sbjct: 217 VLE---------GAKGSDKKSIMSAIEDLTAGGSTAGSDGIKTAYKIAEKYFKTEGNNR- 266 Query: 363 NNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +VL TDG+ E I + K K I + + F + + K + Sbjct: 267 --------VVLATDGDLNVGITSEGELIKLIKKEKESNIFLSVLGFGTDNIKDNKMQSLA 318 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 N ++ +S E K+ D +G F Sbjct: 319 DN--GDGNYSYIDSRFEAKKVLSDELGANFFT 348 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 14/94 (14%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------LS 420 IVLLTDGE T A A G+R+ T+ + + L Sbjct: 219 IVLLTDGERTTGPPLDFAT-RLAADHGVRVYTVGVGTAEGGVVGYEGWSMRVRLDEAALK 277 Query: 421 NCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + A + +F A S L I+R ++G + + Sbjct: 278 SIADETRGEYFHAQSAEALRTIYR-KLGTRLTLQ 310 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 55/374 (14%), Positives = 98/374 (26%), Gaps = 66/374 (17%) Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE-TVSRSYHKEHGVSIQWVI 183 + Y V + LN + L+S I+ ++ + + HG Sbjct: 276 NGAIYTVTRLDSNVVYLNGIDTRLKSNSPSGGTIRKCLTSDCNLVVTTSAAHGF------ 329 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + + LN + S +C +S Sbjct: 330 -DTGDQISFANLGGLTTLNGQTYSITDLTSNTFDTGVPGLGTAAFTSGGTATCEQSTTPG 388 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + S+ DA S + A ++ Sbjct: 389 CRRLAYVSNWGTNEVRALSTCVSERTGADAYTDAAPS------TAFVGTNYPSTSADSYS 442 Query: 304 DR---VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS------N 354 S L I GSTA + + + + + Sbjct: 443 PNPCPSAKITPLSSDKTALKAQINNYSV-----GGSTAGQIGLAWGWYMVAPNFGYIWPS 497 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE------------------------------NTQDNE 384 + + + K ++++TDG N Sbjct: 498 ASQRPAAYKSKDLMKVVIMMTDGAFNTPYCNGVIAANAGIGSGSDEDHINCNATNGDPFA 557 Query: 385 EGIAICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKI 440 + A+C K+ I + T+ F+V A+ FL++CAS +S FF A + EL Sbjct: 558 QARALCTVIKNSANDITLYTVGFAV--GSDYTAKTFLTDCASDSSKAFFPATGS-ELKAS 614 Query: 441 FRDRIGNEIFERVI 454 F I EI I Sbjct: 615 FT-AIAREISSLRI 627 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 46/332 (13%), Positives = 93/332 (28%), Gaps = 45/332 (13%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++++L + G I ALL + + ++DV R S ++ A A + A+ Sbjct: 18 FARRLRRDDRGAIAIQFALLALPLSILLFGVLDVSRLSLQRRQMQDALDAATLMAARSAA 77 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + ++ D + + + S + Sbjct: 78 TASADL----------------------------DTTGDAAFLAEIAGMNLGLTASSSTF 109 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V +R + + ++ A +E V S + E V+D + SM Sbjct: 110 SVGTGNRVIGTATATLKPIIANLWQAGDFTVTATSEVVRSSKNLEVA----LVLDITGSM 165 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + R ++ L PY N Y + GP+ Sbjct: 166 SGTRIADLKVAASDLVDIVIRDTQT-------PFYSKVALVPYAAGVNVDTYADMARGPI 218 Query: 250 DPS--LSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +S+R V K + D V I ++ V N Sbjct: 219 PVRNISNVAWLATGTSIRGVTKALPALLWSDNHGLVTGDRVFISGISGGVLTSMASLNGA 278 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + + + + T + G T Sbjct: 279 IYTVTRL--DSNVVYLNGIDTRLKSNSPSGGT 308 >gi|58616382|ref|YP_195512.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315844|emb|CAI10488.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 212 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 54/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ L ++ +++ +TV + F+ + + +V+ D Sbjct: 21 IEAVKVGLQTMTSALRTDPYALETVHLSVITFSQQATQSVPLT--------ELVQFQVPD 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ I + K + + + LLTDG+ T D GIA Sbjct: 73 IHASGTTALGEALALLAQCIERDVKKTTPDSKGDWKPVCF--LLTDGQATDDLNRGIAAL 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K L + ++ L F+ Sbjct: 131 RNVKMG-------TLVACAAGPSANTAELRKITESVVSLDTADSNTLKAFFQ 175 >gi|145493674|ref|XP_001432832.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399947|emb|CAK65435.1| unnamed protein product [Paramecium tetraurelia] Length = 618 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 62/199 (31%), Gaps = 23/199 (11%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 ++ NGK + + DS S+ Sbjct: 393 TKEKFTTFWKNNGKEFCQYRFSREGKEFKFVENQVEIQAAQQPSFHYIILLDDSGSMSG- 451 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI-SDPSFSWGVHKLIRTIVKT 326 A +I S+ + + +R+ FND S + + + +V Sbjct: 452 ----DRFNQAQNGLISSLSSAKDNQN-IRVTIIIFNDNARCVVDSQTINMQTIKNAVV-- 504 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL-LTDGENTQDNEE 385 G T+ A Q AY I + +KN + K+++ TDG ++ + Sbjct: 505 -----CNGGGTSFQSAFQLAYQKIAA--------VKNFEQFNKHVIFFYTDGGDSYPTQA 551 Query: 386 GIAICNKAKSQGIRIMTIA 404 N ++Q ++I IA Sbjct: 552 LNQFANLPQAQRMKIDLIA 570 >gi|119512060|ref|ZP_01631154.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463286|gb|EAW44229.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 418 Score = 40.0 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 25/157 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ +K D R+ F+ R +K Sbjct: 59 LETVKQAAIGLVDKLKPGD------RLSVVAFDHRATVLVP---NQTITNPGQIKKQINS 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAI 389 G TAI++ ++ + + ++ V + LLTDGEN DN+ + Sbjct: 110 LTADGGTAIDEGLRLGIEELAKGKKETVSQAF----------LLTDGENEHGDNQRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A + + T+ F + + L A Sbjct: 160 AQLATGYNLTLNTLGF-----GDKWNQDVLEKIADAG 191 >gi|303241097|ref|ZP_07327606.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591357|gb|EFL61096.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 689 Score = 40.0 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 63/183 (34%), Gaps = 26/183 (14%) Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 NGKV + + K + + +Y +D ++ + + + + + A+ Sbjct: 287 NGKVNFSASQPIIHGRVIAKEIGFYMYKLTVDSRGTDSYLLPFLKYVEYETRLYKEKTAV 346 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST- 337 S I S +D ++G + ++ + + +I+K + + Sbjct: 347 KSFIDSFSS----DDRTKVGVVSYAEKAKIVSGLTSTLDTTNVSIMKANIDALSPSNAIV 402 Query: 338 ------------AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 I DAM+ AY + +S + KYI++LTD + + Sbjct: 403 SNVDGSYKSSRRNIGDAMRRAYYMLKNSTDAGA---------SKYIIVLTDEISNSWTWD 453 Query: 386 GIA 388 I Sbjct: 454 SIT 456 >gi|164687484|ref|ZP_02211512.1| hypothetical protein CLOBAR_01125 [Clostridium bartlettii DSM 16795] gi|164603258|gb|EDQ96723.1| hypothetical protein CLOBAR_01125 [Clostridium bartlettii DSM 16795] Length = 261 Score = 40.0 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 56/165 (33%), Gaps = 18/165 (10%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y L P+ FV +S KK + + +I + + N ++ Sbjct: 12 YDPLEAKPITKKSMVIFFVIDTSESMKGKKIDDLNRVMREIIPQLAGVGGCNTELKYAVL 71 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ W + + D + G T + A++ + S + Sbjct: 72 SFSSGC------KWITSEPMIVEDNNSWKDLSANGITD----LGMAFEELTSMLSRKKFL 121 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-----GIRI 400 +L I L+TDG T + ++G+ K K+ G++I Sbjct: 122 KSPSLSYAPVIFLMTDGYPTDNYKKGL---EKLKNNNWYKYGMKI 163 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 54/165 (32%), Gaps = 17/165 (10%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 S S + G D+ S + S + P D L + + K+ Sbjct: 60 NSKSIAKIEGGDGDQYALHLTASGDSSSSTVTTAVPADIVLVLDKSGSMKNSNRDTNAKN 119 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++ + V+M F+DR + F+ + A+ Sbjct: 120 AATALASKLLTAANAALPAEQQVQMAVVTFSDRARTTSQFTTSPGAI------GTAVSAW 173 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 G T DA++TA + + S +K+IV L+DG Sbjct: 174 PNGGTNWEDALKTA-NDLSS----------GRSGVQKHIVFLSDG 207 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 46/113 (40%), Gaps = 12/113 (10%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K G T + A+++A + + A++ IVL+TDGE+T Sbjct: 125 KAAVATLRPTGYTPVGLALRSAAEDLG-----------TGSTARR-IVLITDGEDTCAPP 172 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + + +QG +++ + ++ + + A+ ++ A S EL Sbjct: 173 DPCEVARELAAQGTKLVVDTLGLAPDEKVRQQLLCIAGATGGTYTAAQSADEL 225 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 69/228 (30%), Gaps = 38/228 (16%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 YQ SE L R + S + + SP +V Y + P+DP Sbjct: 63 YQSVSEHSDLVISPVNEIRPANRFPSHRKSFFAENLRASPPVVPVAVDKYAVPVANPMDP 122 Query: 252 SLSEEHFVD------------------SSSLRHVIKKKHLVRDALASVIRSIKKIDNV-- 291 V S + ++ A+AS R +KK+ Sbjct: 123 ENPNAWDVTLKITTKAVTVPVDVVMVIDQSSSMGGQNIARLKSAIASGQRFVKKMLPKGT 182 Query: 292 -NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + VR+ ++ F+ L + I + T ++ A + + Sbjct: 183 ATEGVRIALVSYDHEPHRLSDFTKDTAFLCQKI-----RALTPIWGTHTQGGLKMARNIM 237 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKAKSQG 397 +S + H I+L++DG T Q + + + G Sbjct: 238 ATSTAVDKH-----------IILMSDGLATEQYPVKNVTTADFIGKTG 274 >gi|218460583|ref|ZP_03500674.1| hypothetical protein RetlK5_14236 [Rhizobium etli Kim 5] Length = 309 Score = 40.0 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 43/329 (13%), Positives = 100/329 (30%), Gaps = 70/329 (21%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + A+V I ++E+ + K + +++ + +I DTA Sbjct: 5 AALIAAVKQIDNVEDADTLKKKVSDWFHAQVDN---------------SYSLGEIEIDTA 49 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + ++ + + F++ I+S + + + + SY +++ Sbjct: 50 --------NHNITATASGTVP----TTFMKIANIESVDVSVASAVKGPATSY-----LNV 92 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 VID S SML S + Sbjct: 93 YIVIDTSPSMLLAATTSGQATM-------------------------------YAGIGCQ 121 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + SS + + + + DA+ V+ I + D ++ +++G Sbjct: 122 FACHTGDAHTIGKKKYANNYEYSSEKKIKLRADVAGDAVREVLDMIDESDANHERIKVGL 181 Query: 300 TFFNDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 D + P+ S + +T + ++ T + + + Sbjct: 182 YSLGDTLTEVLTPTLSTDTARTRLADASYGLTSSTSKAATYFDVSLATLKQKVGAGGDGS 241 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + K ++LLTDG +Q Sbjct: 242 -----TSGSPLKLVLLLTDGVQSQREWVT 265 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 42/126 (33%), Gaps = 6/126 (4%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIA 388 D +G T +D + A D + + ++ +TDGE N + + + Sbjct: 667 DMEAIGGTVYSDGYEAAKDWFGGKTSPDSLQNNGGEN---IVIFVTDGEPNNEWSAKNAY 723 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 A I + T+ ++ L+ + + F +L +F + + + Sbjct: 724 NQLVAAVDNITVETVGIAITDKDATDLLNGLTT-NNNGAHF-IEDASKLGDVFGEIVSDI 781 Query: 449 IFERVI 454 V Sbjct: 782 TTSTVT 787 >gi|189536038|ref|XP_693697.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Danio rerio] Length = 1157 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 22/220 (10%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P L + FV S + K + A+ S++ +++ D N Sbjct: 275 DGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLREGDYFN---- 330 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTAINDAMQTAYDTI 350 F+D V +W + +R K F G T IN A+ +A + Sbjct: 331 --LITFSDDVH-----TWKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSA-AKL 382 Query: 351 ISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGIAICNKA-KSQG-IRIMTIAFS 406 ++ + +++ I+ LTDGE T E I + A KS G + + +AF Sbjct: 383 LNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSLGLVSLFGLAFG 442 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + R L N +E + K F D + Sbjct: 443 DDADFPMLRRLALENRGVARMVYEDDDAAIQLKGFYDEVA 482 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 26/160 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L G + AL +PV L V ++++ Y + L A + Sbjct: 4 FRRLAADGRGSVAVEFALFLPVFLLVIFSIIELGAAWYQKQMLVNA------------SR 51 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + S Q+++E + N + + + +V+ V+ S Sbjct: 52 EGARLGALFSTSGGLTAQEVQERV------NQYLSDSGFPSQAVVQAVGVD---GASGDP 102 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 V ++ D LS F+ ++ A TV R Sbjct: 103 VTVNVSADYEFPVLSAFIGAV-----PGTISLSATTVMRH 137 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYTEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|164687487|ref|ZP_02211515.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] gi|164603261|gb|EDQ96726.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] Length = 273 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 47/149 (31%), Gaps = 10/149 (6%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + L P+ F+ +S KK + + ++ ++ + +++ Sbjct: 21 FDPLEVKPISKKNLVIFFLVDTSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKLAVM 80 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ W + + + G T + A+ + + + Sbjct: 81 TFSSGCE------WITKEPMSVDDYQYWTRLKAEGLTD----LGEAFTELSNKLSRKEFL 130 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +L I LLTDG T D EG+ Sbjct: 131 NAPSLSYAPVIFLLTDGYATDDALEGLKT 159 >gi|295093271|emb|CBK82362.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Coprococcus sp. ART55/1] Length = 612 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 54/345 (15%), Positives = 102/345 (29%), Gaps = 42/345 (12%) Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 +V + DTA+ + S Y + + + F + + T A + Sbjct: 95 DVATVAGDTAMVTDTSNSMYTEIAYDTREYDSVAENGF---VSTADRPLSTFAADRDTAS 151 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + S + + LN F R K I E Sbjct: 152 YSNVRSYI-------ESGCLPPDGAVRIEEMLNYFTYDYRRK----PEDGEKFSIYTEYS 200 Query: 230 SPYMVSCNKSLYYMLYPGPLD----PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 K + + +D + +D+S + K LV+ + A + ++ Sbjct: 201 DCPWNKATKLMMVGINTDEIDFGDKKPSNLVFLIDTSGSMYEDNKLPLVQQSFAMLAENL 260 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D V+ G D G + + G T DA+ T Sbjct: 261 DENDKVSIVTYAG---------EDTVVLSGTSGSEQYTINEALSSMTAEGCTNGGDAIIT 311 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT----QDNEEGIAICNKAKSQGIRIM 401 AY E + ++L TDG+ +++ I + K I + Sbjct: 312 AY-------ELAEKNFIEGGNNR--VILATDGDLNVGLTSESDLVDLITEEKKENNIFLS 362 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + F + + K N S+ +S +E K+ D +G Sbjct: 363 VLGFGTDNLKDNKLEALADN--GDGSYAFIDSAYEAKKVLVDEMG 405 >gi|225873423|ref|YP_002754882.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] gi|225793805|gb|ACO33895.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] Length = 339 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 49/118 (41%), Gaps = 16/118 (13%) Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + +W + + + D + G TA+ DA+ +A + ++ + + Sbjct: 163 NFILNWSNN---LDTLSSAIQDLHPGGGTALYDAVYSA------CRDKLLNAASGPIYVR 213 Query: 369 KYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + I+L++DG++ Q + + I C +A++ I ++ + T + L A Sbjct: 214 RAIILVSDGDDNQSHAYLTDAIKECQRAQTA---IYAVSTDTDPT-PDPGDDILRKMA 267 >gi|308472943|ref|XP_003098698.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] gi|308268298|gb|EFP12251.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] Length = 416 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 58/156 (37%), Gaps = 12/156 (7%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA--MQTAYDTII 351 T R+G +N D L F + + +T A + A + + Sbjct: 82 TTRVGLVTYNSAAKLDADL-NKFQDLDGLYNGVFKDLSDVVDTTDSFLATGLNAAEELLQ 140 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + + KK I++ + + + N+ K G+ I+T+A+ ++ Sbjct: 141 SQSLNT-----TRDHYKKVIIVYASEYKGSGELDPVPVANRLKGSGVVIVTVAY--DQGG 193 Query: 412 QEKARYFLSNCASPN-SFFEA-NSTHELNKIFRDRI 445 E L+N ASP ++ A N+ L +D + Sbjct: 194 DEGLLRDLANIASPGFAYSNAPNNAGNLVGQIQDSL 229 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 48/188 (25%), Gaps = 20/188 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + + G T + + + + Sbjct: 111 TRIAAAKKIIRKFVAKY----PGDSF---GLTALSSSAALILPPTIDHKVFLSRLDSLSI 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + + + A+ +AY T N YIVLLTDGEN Sbjct: 164 GELGDGTAIGMGLAVSSAYMTRTKLNSS-------------YIVLLTDGENNTGEINPKT 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ I I + + + S F EL KI + G Sbjct: 211 AAKVLVNKNIGFYVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDELELKKIAQYGNGKY 270 Query: 449 IFERVIRI 456 I Sbjct: 271 ASASSPEI 278 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 49/323 (15%), Positives = 98/323 (30%), Gaps = 26/323 (8%) Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + Y+ P + + S + T+ T S +++ +++ + Sbjct: 114 SAAPAYEAREAPNTEKYNGESVSSVMRVTETPVSTFSVDVDTGAYANVRRMLND-GTTPT 172 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + LN F + + + L P P Sbjct: 173 EAAVRTEELLNYFRYDYPLPQDRSKPFSITTDVAQTPWNAQTRLMRVGLRAYDVPRSERP 232 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + VD S + K LV+ AL+ + +++ D V+ V GA Sbjct: 233 AANLVFLVDVSGSMNDPDKLPLVKTALSMLSDNLRPDDKVSIVVYAGAAG---------- 282 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + VK + GSTA M AY E + +K + + Sbjct: 283 MVLAPTHEGKY-VKQALECLSAGGSTAGGQGMALAY------ATAEANFIKGGINR---V 332 Query: 372 VLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +L TDG+ E A+ + + G+ + + F + ++ Sbjct: 333 ILATDGDFNVGISSIGEVEALVKQNRESGVTLTALGFGTGNYNEALMEKMAD--VGNGNY 390 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 +S E K+ D + + +F Sbjct: 391 AYIDSAMEARKVLDDELSSTLFT 413 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 99/328 (30%), Gaps = 28/328 (8%) Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + S+ + QS + + K + + + + + K T+ E + +T Sbjct: 68 VQLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKET 127 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA----ETVSRSYHKEH-- 175 + + + + + NP + + + A S +Y + Sbjct: 128 NKNDSAPEKTEQPTAAEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQ 187 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 Q V++ + + N SY G D + Y Sbjct: 188 PTGNQNVLNHQGNKDGGAQWDGQTSWNGD---PTNRTNSYIEYGGTGDQADYAIRKYARE 244 Query: 236 CNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + +Y E VD S + + V+ + + ++ Sbjct: 245 TTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSG 304 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ + MG ++ ++ + G ++ +K + G T A++ A D Sbjct: 305 ITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDM 362 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + N KK IVLLTDG Sbjct: 363 LATPNGH-----------KKVIVLLTDG 379 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVKQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 180 KNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRP 239 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 240 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 292 YASETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 349 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 350 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 407 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 408 RDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 60/171 (35%), Gaps = 4/171 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K KS G+ ++ AL + +++ V +D+ + S L+ A + A+ + S Sbjct: 5 KFCKSTQGNVSVMAALFIGLLIAVIAGTIDISQKSSLNRELQSVVDAAALAAAREMAVSS 64 Query: 73 EEVSSRAKNSFTFPKQK-IEEYLIRNFENNLKKNFTDREVRDIVRDTAV-EMNPRKSAYQ 130 + + + ++ E + ++ K ++ + + +++ + Sbjct: 65 ADQTRVQSVASSYVDANWTGEQATTHAVLDVSKGIITVSSTAPKTIASILKKDQKETNFY 124 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA--EAETVSRSYHKEHGVSI 179 + N + L + +Q +A AE + + S+ Sbjct: 125 AEAVAEVSGGGNVCLIGLSDHEQGTINLQQRARITAENCQVYSNSDDRYSM 175 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 40.0 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 49/139 (35%), Gaps = 11/139 (7%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 V + RT KT + G T I A+ A + + + IVL++ Sbjct: 103 VGPIDRTEAKTAVATLSPTGWTPIGPALLGA-----------ADDLDGDEGGSRRIVLIS 151 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DGE+T + + + ++G+ ++ + + + + A+ ++ Sbjct: 152 DGEDTCGPLDPCEVAREIAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKE 211 Query: 436 ELNKIFRDRIGNEIFERVI 454 +L+ + + V Sbjct: 212 DLSDKVKQLVDRAADPVVT 230 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 40.0 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 51/312 (16%), Positives = 93/312 (29%), Gaps = 28/312 (8%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V + V AK S K + E + KN + E + T NP Sbjct: 147 VSVTSEKITVEDSAKESTEPEKITVPENMK-----ETNKNDSAPEKTEQPTATEEVTNPF 201 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 A + R +L L ++ + ++ + H+ + Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQNVLNHQGN--------KD 253 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + D Q G P N + + E +P + L Sbjct: 254 GGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDV--YLNVRGN 311 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 L VD S + + V+ + + ++ N+ + MG ++ Sbjct: 312 VQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNN-INMGYVGYSSD 370 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ + G ++ +K + G T A++ A D + + N Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDMLATPNGH--------- 420 Query: 366 EAKKYIVLLTDG 377 KK IVLLTDG Sbjct: 421 --KKVIVLLTDG 430 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 40.0 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 49/139 (35%), Gaps = 11/139 (7%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 V + RT KT + G T I A+ A + + + IVL++ Sbjct: 104 VGPIDRTEAKTAVATLSPTGWTPIGPALLGA-----------ADDLDGDEGGSRRIVLIS 152 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DGE+T + + + ++G+ ++ + + + + A+ ++ Sbjct: 153 DGEDTCGPLDPCEVAREIAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKE 212 Query: 436 ELNKIFRDRIGNEIFERVI 454 +L+ + + V Sbjct: 213 DLSDKVKQLVDRAADPVVT 231 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 104/334 (31%), Gaps = 31/334 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 D + TA E NP A + R +L L ++ + ++ Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + H+ + + D Q G P N + + Sbjct: 189 TGNQNVLNHQGN--------KDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 E +P + L L VD S + + V+ + + Sbjct: 241 YARETTTPGLFDV--YLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVD 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 ++ N+ + MG ++ ++ + G ++ +K + G T A+ Sbjct: 299 TLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKAL 356 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + A D + + N KK IVLLTDG Sbjct: 357 RDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 62/181 (34%), Gaps = 20/181 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ LA ++ + + D R F + + K Sbjct: 184 KIEMVKQTLALLLDFLNEND------RYQLITFESQAQRLTPLKRVTDGNKQYF-KQVIQ 236 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G T I A + A+ + + + +NN+ + I LL+DG++ Q N+ Sbjct: 237 QINSGGGTTIGTATEIAFKQL------QERKYRNNVTS---IFLLSDGQDGQANQRIQEQ 287 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ + T F + Q + SF+ L++ F D +G I Sbjct: 288 IKTV-NEVFTLHTFGFGEDHDAQMMTQLCNLK---SGSFYFVQDVTLLDEFFADALGGLI 343 Query: 450 F 450 Sbjct: 344 S 344 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 49/149 (32%), Gaps = 12/149 (8%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G +ITA+ M +LG+ + +D+ + L+ AA A + + L + + Sbjct: 11 GSVAVITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLY----QRAQCG 66 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + T P +F N V+ + +++ S Y + S L Sbjct: 67 NTTATEPDWTTATQKASSFATASTSN--------TVQGSVIKVAQTGSGYWNITGSPGTL 118 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 P + + I A S Sbjct: 119 ETVPFTPGTNDLPAIQVTITKSAANANGS 147 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 40.0 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|47223676|emb|CAF99285.1| unnamed protein product [Tetraodon nigroviridis] Length = 628 Score = 40.0 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 50/152 (32%), Gaps = 17/152 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 RM F+ + + + K+ + + + G T ++ + A + I + Sbjct: 80 RMSFIVFSTDGRTLMALTEDRDKIRAGLEELRMV--QPGGDTYMDRGLHRASEQIYYAAG 137 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQEK 414 D I+ LTDGE +D + +A+ G + + Q Sbjct: 138 DGYRAAS-------VIIALTDGELREDQFDTAQREAGRARQLGASVYCVGLKDFNETQ-- 188 Query: 415 ARYFLSNCA-SPNSFFEANSTHELNKIFRDRI 445 LS A S + F + E + D I Sbjct: 189 ----LSTIADSKDHVFPVHDGFEALQSVIDSI 216 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 39.6 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 55/186 (29%), Gaps = 22/186 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 A S++ I+K+ + R G + V + S + + + + D Sbjct: 36 AKKSLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHK 95 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+Q Y + ++ + + I+L+TDG + + I + ++ Sbjct: 96 LKSGTNTKKALQAVYSMMSWPDDVPP---EGWNRTRHVIILMTDGLHNMGG-DPITVIDE 151 Query: 393 AKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + Q S + F+ L Sbjct: 152 IRDLLYIGKDRKNPREDYLDVYVFGVG-PLVNQVNINALASKKDNEQHVFKVKDMENLED 210 Query: 440 IFRDRI 445 +F I Sbjct: 211 VFYQMI 216 >gi|226326039|ref|ZP_03801557.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] gi|225205581|gb|EEG87935.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] Length = 823 Score = 39.6 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 43/337 (12%), Positives = 98/337 (29%), Gaps = 31/337 (9%) Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT---DREVRDIVRDTAVEMNPR 125 ++ + ++ + T + +E F N FT ++ + T ++ + Sbjct: 250 VKEVVDMVADETKEATVETTETKEVEKAAFVTNSFSTFTITWKKDSATYFKLTLHHVDEK 309 Query: 126 KSAYQVVLSSRYDLLLNPLSLF------LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + LSS L N F + + K + + S + G Sbjct: 310 GNELPTTLSSDATLNYNSDISFSDYVEIIDNYDYKYAKVDKISGKTVTSAVTNGSSGKRA 369 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + N + ++ S S + Y+ + + Sbjct: 370 KLTLYNGKDNKIETLSYSTTTKNIYLVYTAKSTGSTGSGTIEDTAPQLSHQKYIKKKDGN 429 Query: 240 LYY-------MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 Y +D+S+ K + A+ +I ++ V+ Sbjct: 430 NYDLTLNVSSKKGTTSSKKKFDIVLIMDTSTSMSNNNKWRNSKTAVNKLIDTLSSQTTVD 489 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R F + W + +++ + ++I E++ T D + + + S Sbjct: 490 VNYR--LVTFGTTAQIQTN--WTTGETVKSTLSNYSIKEDQ--GTNYEDGLVKTKEALSS 543 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + +K IV LTDG+ T Sbjct: 544 GTRADA---------EKIIVFLTDGQPTFYKSGTSYA 571 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 65/211 (30%), Gaps = 30/211 (14%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 Q+ G T + N V + K + Y + D Sbjct: 421 TQQGGGNDAYAETGPFTVGTTSHVTIVNSNVKPSNNKSIVKTGGGDGDQYTLYLTASGDS 480 Query: 252 SLSEEHFVDSSSLRHVIKKK--------HLVRDALASVIRSI----KKIDNVNDTVRMGA 299 + S + + V+ K + ++A ++ + + V+M Sbjct: 481 TSSTVTTTTPADIVLVMDKSGSMKGELDNNAKEAANALAKKLLTDKNSTLPSEQQVQMAV 540 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + + +F+ V K+ A++ + G T A++ A + Sbjct: 541 VTFSTKATIEQNFTTDVLKINN------AVEGDPDGGTNWEAALKQA------------N 582 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + KK+I+ L+DG+ T C Sbjct: 583 ILSGRSNVKKHIIFLSDGDPTFRTSSYGGSC 613 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 39.6 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 53/158 (33%), Gaps = 18/158 (11%) Query: 302 FNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ S+ F++ R+ G T ++T + S Sbjct: 192 FSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVRELFHSKNGAR 251 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +K I+++TDGE +D E + +A+ I I + Q AR Sbjct: 252 KSA------RKIIIVITDGEKYKDPLEYKDVIPEAEKANIIRYAIGVG-DAFQAHAAREE 304 Query: 419 LSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L S + F+ +S L I ++ E++ Sbjct: 305 LKIIGSVPSEDHVFKVDSFAAL-----SSIQKQLQEKI 337 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 39.6 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 53/158 (33%), Gaps = 18/158 (11%) Query: 302 FNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ S+ F++ R+ G T ++T + S Sbjct: 192 FSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVRELFHSKNGAR 251 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +K I+++TDGE +D E + +A+ I I + Q AR Sbjct: 252 KSA------RKIIIVITDGEKYKDPLEYKDVIPEAEKANIIRYAIGVG-DAFQAHAAREE 304 Query: 419 LSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L S + F+ +S L I ++ E++ Sbjct: 305 LKIIGSVPSEDHVFKVDSFAAL-----SSIQKQLQEKI 337 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 39.6 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 47/136 (34%), Gaps = 19/136 (13%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++T + GSTA+ DA+ A I + + ++ ++++TDG + Sbjct: 136 KELETAISSFDVGGSTALYDAILLAQSHI-----------RGGVYGRRILLVITDGGDNS 184 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCASPNSFFEANSTHELNK 439 + G+ I I Q K L+ +FF L+ Sbjct: 185 SKATLEEAVDAVAKAGVVIYAIGIYDPNDKDQNPKVLAHLAEVTGGEAFF----PTALSD 240 Query: 440 IFRDRIGNEIFERVIR 455 I RI EI V R Sbjct: 241 I--TRICEEIAADVRR 254 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 39/124 (31%), Gaps = 12/124 (9%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V + + + + S ++ R I G T A+ A + Sbjct: 104 DKDQVHIAMIQYAETPTIEFSL--DTYRNPRDITNHIMTINFHSGGTRTGKALLAAKVEL 161 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNK 409 S + K IVL TDG ++ + + + + ++I + + Sbjct: 162 FSEKKGARKNA------SKIIVLFTDG---LSVDDPVKHAQQLREVEKVKIYVVYVGSDG 212 Query: 410 TQQE 413 + E Sbjct: 213 FEYE 216 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 69/229 (30%), Gaps = 34/229 (14%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLS-------GKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGI-- 183 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV--IKKKHLVRDALASVIRS 284 Y + + D S S + ++ + ++R +I+ Sbjct: 184 --AIAYRTDPIQGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKD 241 Query: 285 IKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + +I NV+ + F+ + + G + +I TI K + N G T Sbjct: 242 LAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNTIIATITK--PENLNPDGVTNPG 295 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D ++ ++ S KYIVLLTDG + A+ Sbjct: 296 DGLRYGMISLQSQPAQL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|78776856|ref|YP_393171.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497396|gb|ABB43936.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 595 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 58/180 (32%), Gaps = 24/180 (13%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K ++ D ++ + + K+ + + ++ R+ Sbjct: 75 KEAKTLIKAKSADIMIALDISDSMLAEDVYPKRLESAKRKALAFLKE-------AKDERV 127 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F S+ H + + + E GS + ++I + E Sbjct: 128 GVVAFAKDSYLVSPLSFDKHSVSFLLEQLDTTSITEQGSD---------FLSVIGAVEKI 178 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 K KK +++L+DG + D E IA+ AK GI I + + + K Sbjct: 179 QKDEK-----KKVLLILSDGGDKSDFSEEIAL---AKKSGITIFILGIATKQGAPIKRED 230 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 67/184 (36%), Gaps = 32/184 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKT 326 K + ++ + +I N +D+V +G T + R D K+ R + Sbjct: 35 KITIAKEVMTDLIT------NWDDSVDLGLTVYGHRRKGDCADIEVVAMPGKVDRQALID 88 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I+ + A ++ K +VL++DG T + + Sbjct: 89 KVQSITPRGKTPISKTLSLAALSV------------GFFSGKSSVVLVSDGLETCNADP- 135 Query: 387 IAICNKAKSQGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 C +AKS GI + I F V + + + + + + FF AN+ EL Sbjct: 136 ---CAQAKSLGIINPGFDVHVIGFDVTEEEFKSLQCIATE--TGGKFFRANNAEELKDAL 190 Query: 442 RDRI 445 R + Sbjct: 191 RQTV 194 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 69 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 128 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 129 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 188 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 189 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 248 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 249 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 297 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 298 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 355 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 356 LRDAGDMLATPNGH-----------KKVIVLLTDG 379 >gi|306828192|ref|ZP_07461454.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] gi|304429615|gb|EFM32662.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] Length = 1036 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 41/306 (13%), Positives = 87/306 (28%), Gaps = 62/306 (20%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 K+ + I V+D S SM + E + G + + Sbjct: 430 KKQPLDILVVVDRSGSMQEGIGSIEKYKYW---KYKYDEYYHIWRNAGTIYFDN------ 480 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL---ASVIRSIKKID 289 L P + + + S+ IK+ V+DAL +++ I+ Sbjct: 481 ----------YLGPRYQPDTYTYYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDIN 530 Query: 290 NVNDTVRMGA---TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM-QT 345 N +G + W + + D + T+ N + Sbjct: 531 PQNQLAVVGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSL 590 Query: 346 AYD---------TIISSNEDEVHRMKNNLEAKKYIVLLTDG--------ENTQDNEEGIA 388 +Y ++ ++E N +K +V ++DG +N + ++ Sbjct: 591 SYQDRNGTNYHAALLKADEMLQKVANNGH--RKIMVFISDGVPTFYFGADNYRSGNGTVS 648 Query: 389 ICNKAKSQG---------------IRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEA 431 N SQ + I ++ S + + L + + + Sbjct: 649 DGNIINSQKGSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYLSGDDYYSGI 708 Query: 432 NSTHEL 437 T +L Sbjct: 709 TDTEQL 714 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 70/181 (38%), Gaps = 35/181 (19%) Query: 272 HLVRDALASVIRSIKKID-NVNDTVR--------MG-ATFFNDRVISDPSFSWGVHKLIR 321 ++ D S+ SI + ++D VR G F+ ++ +++ +L+ Sbjct: 202 EILLDKSGSMEDSIDTMKRTMSDFVRNLNYKVGDTGELISFDSYLMYMATYTNDKDRLLT 261 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I G TA+ DA+ Y I +++ N+ I TDG++ + Sbjct: 262 GIDNM-----TPYGMTALYDAL---YTGITNAS--------NHPGFNCVIAF-TDGQDNE 304 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + + AK +GI + I S + L N A + F++ NS ++ + Sbjct: 305 STHTADEVISLAKEKGIPVYLIGTSEADSST------LQNIANETNGYFWDMNSISDMQE 358 Query: 440 I 440 + Sbjct: 359 V 359 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 70/196 (35%), Gaps = 32/196 (16%) Query: 269 KKKHLVRDALASVIRS-IKKIDNVNDTVRMGATFFNDRVISD----PSFSWGVHKLIRTI 323 K + + + + I +D + T R+GA F+ + + + G ++ Sbjct: 407 TKIDGGQSRIELMESTAIAALDVLPKTTRLGAWAFSSNLQKNHVDYLPLTNGEQPILDDT 466 Query: 324 VKTFAIDEN--------EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + + G TA+ D + AY ++ + + +V+LT Sbjct: 467 YRNGLIAKAHTLPGLAAKNGDTALYDTIAAAYKSVTDTYDPNYVNS---------VVVLT 517 Query: 376 DGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 DG N N + + ++I+TI+ T + + A+ Sbjct: 518 DGTNDDPNGGLALDQLLARLKSQYSADKPVKIVTISLG-TGTDPDALKRIAK--ATDGLS 574 Query: 429 FEANSTHELNKIFRDR 444 ++ + +++ +F D Sbjct: 575 YQTKTPEQISGVFVDA 590 >gi|77735553|ref|NP_001029472.1| calcium-activated chloride channel regulator 4 [Bos taurus] gi|74268230|gb|AAI03388.1| Chloride channel accessory 4 [Bos taurus] gi|296489202|gb|DAA31315.1| chloride channel accessory 4 [Bos taurus] Length = 933 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 49/146 (33%), Gaps = 26/146 (17%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R ++++ G T+I +++ + I R + IVLLTD Sbjct: 366 SSERRKLLESLPTA--ASGGTSICSGIESGFQAI---------RNADFQIDGSEIVLLTD 414 Query: 377 GENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GE + C K K G I IA + Q K A+ + A+ Sbjct: 415 GE-----DSTAKSCIEKVKQSGAIIHFIALGPDAAQAVKEMS----IATGGKYIYASDEG 465 Query: 436 E---LNKIFRDRIGN--EIFERVIRI 456 + L F + ++ +++ Sbjct: 466 QNNGLIDAFAALASENIDASQQSLQL 491 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 59/173 (34%), Gaps = 29/173 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L R+A + + D ++G F+ + V + Sbjct: 429 IVLAREASKAAAELLSSRD------QVGVIAFDGSAKLVTDL---TSAANKGEVLSQIDG 479 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + AM D + ++ H +++L+DG++ + EGI+ Sbjct: 480 IGAGGGTNLYPAMVMGRDMLGIASAKIKH-----------MIVLSDGQSQGGDFEGISS- 527 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 + G+ I T++ Q A ++ A + N+ E+ +IF Sbjct: 528 -ELAQMGVTISTVSL-----GQGAAVDLMAAIAQIGNGRAYVTNNAEEMPRIF 574 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 40/126 (31%), Gaps = 3/126 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ V+LG ++VD+ + L+ A A +TA+ + + Sbjct: 27 RDDGGQISLLIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGAALTAAQHVSLAGAYT 86 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + A+ P E Y + TA ++ V LS Sbjct: 87 TELAEW---LPLSDAEVYAAVADYVGEPGRAPQSCRSGTLSITAATLDATDRTVSVSLSC 143 Query: 136 RYDLLL 141 L + Sbjct: 144 TVSLPI 149 >gi|329850448|ref|ZP_08265293.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] gi|328840763|gb|EGF90334.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] Length = 395 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 68/243 (27%), Gaps = 38/243 (15%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I L +++ GG +D+ S H ++ A A++T Q S Sbjct: 1 MIYGLSATLVIAAGGGALDLFNASNLRHDMQNALDAAVLTGVRASSQMGTSAS------- 53 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 +N +++ D + + S L+ L Sbjct: 54 --------NAFKQNVADDM-----DGASQSYSSAVSSSSASSSSYVTTTLTGTASLESP- 99 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML------------- 190 + F++ +G+ + K+ A+ + +V+D S S Sbjct: 100 -TYFMKLLGLNDLTVSVKSVAQGTTTIAPAGKPC--IYVLDPSGSQALLVNSGANVQALS 156 Query: 191 -DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + S G P F + K Q + + + C S P Sbjct: 157 CEIHVKSTGNPAAIFNSGSSLNFKKLCVQGTNIIKNSVTVPNLVTGCAASGDPYAGTLPT 216 Query: 250 DPS 252 S Sbjct: 217 PAS 219 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 47/357 (13%), Positives = 104/357 (29%), Gaps = 29/357 (8%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 ++A + SV L S + +S +S + +Y + K + ++ D V Sbjct: 221 ESAHLKESVYLKDSAKALSDEIDSSVKPAE--YAQYYAK-------KQLSSFDIHDQVSA 271 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + A Q + + + AE+ S +++ Sbjct: 272 PERIFDEAPMAEQHNNRVALKVAFDSEMPIEHIRSPYHQINVDMAESGGAIVSLNRDAIA 331 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + +V+ ++ +P +T S +S + + + S Sbjct: 332 NKDFVL-------TWKPIQGNEPTAAVFSQIGKTHTSQAS-SSEASTEPQTASEKYGLVM 383 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 P +D+S + AL + ++ D N Sbjct: 384 LMPPQGAEQQPSSIHRELILVIDTSGSMSGDAIIQA-KTALKYALAGLRPTDKFN----- 437 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 FN V + + + G T ++ A+ A + ++++ Sbjct: 438 -IVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETVTDKET 496 Query: 358 VHRMKNNLEAK---KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + NN + ++ +TDG NE + +A+ R+ TI Sbjct: 497 GTELDNNDLGSNLLRQVLFITDGA--VSNESMLFELIEAQLGDSRLFTIGIGSAPNA 551 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 76/246 (30%), Gaps = 37/246 (15%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 S S+ + S+ + S + ++A+ + + +++ ND Sbjct: 53 YSLTVSVSSTDMDTAQQQTESDVVVLMDVSGSMTTTDMKVAKNAVNGLANQL--LNDEND 110 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 TVRM F+ FS G +V G+T + A+Q A + Sbjct: 111 TVRMSIVRFSSEA-KTLEFSNGSEWTHSPALVAQALNTLTSRGNTNWDGALQNASALVQG 169 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE--------------------EGIAICNK 392 + K Y+VL++DG N + I N Sbjct: 170 DSAR-----------KSYVVLMSDGYPNTINSCYPAVANCTDTSWSEPNAVPKAIEAANT 218 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + I ++ S +++ +E P LN F D I + I + Sbjct: 219 MPNTQIYAVSTRTSASESMKELVDGINAKAPKYPAQIMYGTDQQSLNNAF-DTIADAIRK 277 Query: 452 RVIRIT 457 R +T Sbjct: 278 RFTDVT 283 >gi|239624983|ref|ZP_04668014.1| predicted protein [Clostridiales bacterium 1_7_47_FAA] gi|239521369|gb|EEQ61235.1| predicted protein [Clostridiales bacterium 1_7_47FAA] Length = 878 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 23/151 (15%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS---TAINDAMQTAY 347 V +R+G +F D V S + + V +F ++ T I A+ A Sbjct: 84 VGSDIRIGGVYFADHVYQRCSLTSLTGEEDTKKVMSFLNFTDKDDGNRDTNIGSALSEAL 143 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ----------DNEEGIAICNKAKSQG 397 + ++ K+ IVL +DG N N + Sbjct: 144 KLFENQ----------DISRKRIIVLFSDGINEDYEGTGSYTARANNMTSQAAGEINEAQ 193 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I + + ++ + R ++ + Sbjct: 194 IALYCVFLEKDRADEAYLRNLVNYFKEDGQY 224 >gi|87308939|ref|ZP_01091077.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] gi|87288282|gb|EAQ80178.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] Length = 346 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 41/154 (26%), Gaps = 25/154 (16%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ + FF + + P W K + + G T Sbjct: 214 ILETVSSLPREAQFQ---VIFFQSQAVPFPQKGWRHPKRDFNALSEWLKTVGPAGGTNPL 270 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A + A +K + + +TDG + + N +++ Sbjct: 271 PAFEIA--------------LKFSPRPD-AVFFMTDGLFDDNVVGEVKRQNDLSEPKVKV 315 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 I+F + + A S + Sbjct: 316 HAISFMDRSAEP-----LMRQIAGESGGEYRHVQ 344 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 60/191 (31%), Gaps = 15/191 (7%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + D Q G P N + + E +P + L Sbjct: 255 GAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDV--YLNVRGNV 312 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 L VD S + + V+ + + ++ N+ + MG ++ Sbjct: 313 QKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNN-INMGYVGYSSDG 371 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ + G ++ +K + G T A++ A D + + N Sbjct: 372 YNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 367 AKKYIVLLTDG 377 KK IVLLTDG Sbjct: 421 -KKVIVLLTDG 430 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 64/199 (32%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E ++ + + + L + ++ D R G F D Sbjct: 103 DLSGSMEEKDFATKSGEQLSRLTAAKKVLRDFVTQ-RQGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + E M T + DA+ S Sbjct: 156 TPFT----ADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPSTSQ--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS- 424 + +++LTDG +T + A ++GIRI IA + +Q +S +S Sbjct: 203 DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVSRVSSL 262 Query: 425 -PNSFFEANSTHELNKIFR 442 F A +LN+ ++ Sbjct: 263 TQARSFVAIDQSQLNEAYQ 281 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 79/265 (29%), Gaps = 55/265 (20%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQP--------------LNCFGQPADRTVKSYSSQ 218 ++ + + +V+D S SM + +C+ + + + Sbjct: 86 EQQPLEVTFVLDVSGSMAWCAEEHRHSADCGYNCGLEEHRHGDDCYTKCTRQNHPDHWEN 145 Query: 219 NGKVGIRDEKL-------SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS-------- 263 G V + S Y ++C K+ + + H S Sbjct: 146 WGLVWVHKLGTDCSLSWGSYYYLTCTKAEHTHSSSCYACGKVEHNHDRASGGEPCSMVQS 205 Query: 264 ---LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + V++ + ++ + V + + + V S Sbjct: 206 GKAQSRLKIAVSAVKNMVGTLREQLGG-KLTAKFVVFSSEGYKNGVDKRAS--------A 256 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + I + +G T ++ + D SS+ A++ +V++ DG+ Sbjct: 257 KVITEAQLDQLTAVGGTDLSAGVALGVDQFKSSS------------ARQVLVVVADGD-- 302 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAF 405 D+ N K++ I T+ F Sbjct: 303 SDDGYPNRTANNFKNKDGIIYTVGF 327 >gi|308497646|ref|XP_003111010.1| CRE-CUTL-23 protein [Caenorhabditis remanei] gi|308242890|gb|EFO86842.1| CRE-CUTL-23 protein [Caenorhabditis remanei] Length = 801 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 53/135 (39%), Gaps = 10/135 (7%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + VR+ ++ + F+ ++ ++ + G+TA N A+ A+D + S Sbjct: 298 EAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS 357 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + R K +V LTDG ++ AI K ++ GI + IA SV + Sbjct: 358 KDPSTGVR---EGVPKMALV-LTDG---HSHKSPKAISEKMRAAGIIM--IAVSVTP-RP 407 Query: 413 EKARYFLSNCASPNS 427 L A + Sbjct: 408 LVDEAELRLIAGDDK 422 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 49/345 (14%), Positives = 105/345 (30%), Gaps = 33/345 (9%) Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDL-LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + + V S+ + +++ + +N + + + E K Sbjct: 30 INTSFVSAEGNSSSSEKTITNSIQIDNMNEGEVRVFKTAKPIPNSINRWEISIDVFGRLK 89 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG--------------QPADRTVKSYSSQN 219 I V+D S SM + ++ A + SY ++ Sbjct: 90 REPSDIVLVLDTSGSMDPQKNPQGIDRISKAKREAIHFVNEIFERDASARVALVSYGTKV 149 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 ++ S +++ KSL + ++ SS + K L+ D Sbjct: 150 SSNSFHTKQESNLLINEIKSLKAEGGTFTQGALYEAKMLLNQSSAPN--KTIVLLSDGQP 207 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDP-SFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + +K N + G R F G+ + + + Sbjct: 208 TYRYPLKAKVNQDLLRYDGNVIVQKRYNGQQRPFDIGITSSSNQAIPGYRFKSRPNTNVF 267 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +AM +E R + N Y V ++ + I N+ + I Sbjct: 268 DYNAMVYGTGNEYYLDELGELRSQGNQN---YFVYMS------SADAAIIESNQIHQEQI 318 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-LNKIFR 442 + I F + + L S N++++A+S+ + L+ IF+ Sbjct: 319 HLYAIGFDTD----ARGTDILKRI-SNNNYYDASSSRDNLDDIFK 358 >gi|50119743|ref|YP_048910.1| hypothetical protein ECA0798 [Pectobacterium atrosepticum SCRI1043] gi|49610269|emb|CAG73712.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 543 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 45/136 (33%), Gaps = 12/136 (8%) Query: 3 FDTKFIFYSKK----LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 +DT + ++ + G AL +L +VD + +K+A Sbjct: 13 YDTGITLFLRERLSAFVHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATD 72 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + EE S N + EY+ N + K +++ V V T Sbjct: 73 AAALAVGHQATINGEEYSQEDTNKLAY------EYVKNNL--GMNKALSEKLVASDVSVT 124 Query: 119 AVEMNPRKSAYQVVLS 134 + + Y V + Sbjct: 125 EGRNSATRKTYTVTAA 140 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 79/246 (32%), Gaps = 17/246 (6%) Query: 48 YYEHALKQAAQTAIITASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 E +A +I + P + ++ P + + + Sbjct: 111 QSEEQKAGSADEPVIELATPSEAQPATASATPTQKPTGTENPTTVERSVQSDDDDADTVA 170 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQ-VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 N + + + + + ++Y+ ++ S+ L + + G +++++ Sbjct: 171 NQNEAKDDETKNNADKTVRLGIASYRGMLKSASSGLSTPEHTKSIEYQGNGAYILKLNVI 230 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLD--------YQRDSEGQPLNCFGQPADRTVKSY 215 + S S + I V+D S SM D + + +N F +T + Sbjct: 231 GKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTI 290 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 N KV + K + + + + + S + + ++ + L++ Sbjct: 291 EDDNDKVKVALVKYANQIGTATGADGC-----RISNSRQSDTGNCTQIVQELTTDAGLLK 345 Query: 276 DALASV 281 ++ + Sbjct: 346 TSVNGL 351 >gi|322433071|ref|YP_004210320.1| hypothetical protein AciX9_4225 [Acidobacterium sp. MP5ACTX9] gi|321165298|gb|ADW71002.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 457 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 55/164 (33%), Gaps = 14/164 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI------ITASVP 67 L+ + +G + A+ + V+LG G+ +DV + L+ A TA I Sbjct: 7 LVSNESGQVLVAAAVSLLVILGFLGISIDVGHRQLSKLRLQSATDTAATAAALEIRVCGS 66 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L+ S A S L+ N + +A ++ Sbjct: 67 LVSCPAMQS--AVQSSFIENGYPATPLLLNCATSSDDLTLVLNSPPCAMGSADPNYGKRG 124 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 +V ++ + F++ +GI + I ++EA Sbjct: 125 YVEVQVAQHV------PTYFMKMLGISQFNISARSEAARNPGGA 162 >gi|198426173|ref|XP_002130506.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 384 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 67/201 (33%), Gaps = 18/201 (8%) Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH-FVDSSSLRHVIKKKHLVRDALAS 280 V + + S P P+ + + FV +S K V + S Sbjct: 164 VSPNEADVRTCTESGTWEGIKSCCARPCPPNGNIDVVFVLHASDPGNWKTGLDVIATIQS 223 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI---VKTFAIDENEMGST 337 + + D R+G + ++ +++ A + G++ Sbjct: 224 LFKM------GKDFARVGFVGDIGHPFESTAMHLNDFSERASLSLAIRSQAFNNTANGAS 277 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +I + ++ ++ S+ R N A+ IVL+TD N+ A + + G Sbjct: 278 SIAEVLRYVKRSMFST------RNGNRKNAENIIVLVTDQNNSAGGMTSAAY--ELRRSG 329 Query: 398 IRIMTIAFSVNKTQQEKARYF 418 I+ + F ++ ++ Sbjct: 330 IKTFVVNFGETESNKDLRNLL 350 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 69/228 (30%), Gaps = 38/228 (16%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 YQ SE L R + S + + SP +V Y + P+DP Sbjct: 63 YQSVSEHSDLVISPVNEIRPANRFPSHRKSFFAENLRASPPVVPVAVDKYAVPVANPMDP 122 Query: 252 SLSEEHFVD------------------SSSLRHVIKKKHLVRDALASVIRSIKKIDNV-- 291 V S + ++ A+AS R +KK+ Sbjct: 123 ENPNAWDVTLKITTKAVTVPVDVVMVIDQSSSMGGQNIARLKSAIASGQRFVKKMLPKGM 182 Query: 292 -NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + VR+ ++ F+ L + I + T ++ A + + Sbjct: 183 ATEGVRIALVSYDHEPHRLSDFTKDTAFLCQKI-----RALTPIWGTHTQGGLKMARNIM 237 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKAKSQG 397 +S + H I+L++DG T Q + + + G Sbjct: 238 ATSTAVDKH-----------IILMSDGLATEQYPVKNVTTADFIGETG 274 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 73/202 (36%), Gaps = 25/202 (12%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D + +D SS K A ++ I N V G F+ + Sbjct: 142 DIAQETMLVIDDSSSMKTSDKNDRRLTAANELLEHID----GNRKV--GLIRFSKDIHCY 195 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + K+ ++ + ++ + G T INDA+ Y + + Sbjct: 196 IPMDY--LKVNKSTLNHELENKAKEGGTDINDAL---YAVL------NAFDKVGTATGSR 244 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 ++LLTDG++T N + + N+A S I+I I+ + F+ S Sbjct: 245 SVILLTDGKST-TNVDEEYLINRANSMNIQINVISL-----GNHTDKAFIKRITSSTGGK 298 Query: 430 EANSTHE--LNKIFRDRIGNEI 449 A ++ + L+ + +G+ I Sbjct: 299 AAKTSSDFYLDAAYGVFLGSHI 320 >gi|197117534|ref|YP_002137961.1| hypothetical protein Gbem_1146 [Geobacter bemidjiensis Bem] gi|197086894|gb|ACH38165.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 383 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 36/106 (33%), Gaps = 2/106 (1%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++ V L + G+ +D+ + L+ +A+ A +T + L Q L + ++ Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKQRLLFQAQQSPEKLAQVS 67 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + R+ + +V N + + Sbjct: 68 ADPLQSAARSTAVD--NATGKHSASALVALMNDNGNALSENNDITV 111 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 23/56 (41%), Gaps = 1/56 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 2 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQP 56 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 55/180 (30%), Gaps = 32/180 (17%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + R + + +D +A+ + A + Sbjct: 142 IGLVAFAKEASLIVPLTIDRDFFSRKLDDIYIMDL--GNGSALGLGVSIALSHL------ 193 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA------------ 404 K++ K+ +++LTDG D + N A+ ++I +I Sbjct: 194 -----KHSEAPKRSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSIGIGSGEELSVGFK 248 Query: 405 -----FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F ++ L AS F+ + +D E ER +RIT Sbjct: 249 LRSGKFYQGTLREVYDPSMLLEIASKTGGLFYSVSDDFSFKLAIQDFSKKENVERKVRIT 308 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 41/116 (35%), Gaps = 16/116 (13%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIA 388 G T I + TA + ++ + + ++L+TDG+ NT + Sbjct: 431 KLQAGGGTDIYTPVMTALQQLAGADVSQCNPA---------VILMTDGQSNTGRTFTNVQ 481 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 K+ + I I +I F Q K LS A F+ +L F+ Sbjct: 482 STYKSIGKDIPIFSIEFGAADPTQLKQFGTLSKAA----LFDGRK--DLVAAFKQA 531 >gi|118372013|ref|XP_001019204.1| hypothetical protein TTHERM_00997640 [Tetrahymena thermophila] gi|89300971|gb|EAR98959.1| hypothetical protein TTHERM_00997640 [Tetrahymena thermophila SB210] Length = 996 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 51/139 (36%), Gaps = 10/139 (7%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+G FN+ V K + G +I++A+ A+ I Sbjct: 824 KKFDRIGYYLFNESVKEVFGLQR-ADKNWCFLENMLQNMPTVSGKASISNALNEAF-KIY 881 Query: 352 SSNEDEVHRM---KNNLEAKKYIVLLTD-----GENTQDNEEGIAICNKAKSQGIRIMTI 403 N+ H + K +L KK+++L TD E Q ++ I + I ++ Sbjct: 882 DKNQYYAHTLHMKKVDLNYKKFVILFTDEADILQEGKQIKKQWQKIKENFRFSNIILVIF 941 Query: 404 AFSVNKTQQEKARYFLSNC 422 AF+ N + + + C Sbjct: 942 AFTDNHEVMKIFKELVDYC 960 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 7/75 (9%) Query: 373 LLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSF 428 TDG + N + + +K Q ++ T+ F T + +L AS P + Sbjct: 260 FKTDGTDAAGKSWNGDPKDPADYSK-QLVQTFTVGFGQGIT--PTGKRYLQLAASRPEYY 316 Query: 429 FEANSTHELNKIFRD 443 +EA+ L+K+F D Sbjct: 317 YEADKPESLSKVFND 331 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 23/56 (41%), Gaps = 1/56 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 1 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQP 55 >gi|332365023|gb|EGJ42788.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK355] Length = 462 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 69/229 (30%), Gaps = 34/229 (14%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLS-------GKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGI-- 183 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV--IKKKHLVRDALASVIRS 284 Y + + D S S + ++ + ++R +I+ Sbjct: 184 --AIAYRTDPIQGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKD 241 Query: 285 IKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + +I NV+ + F+ + + G + +I TI K + N G T Sbjct: 242 LAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNTIIATITK--RENLNPDGVTNPG 295 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D ++ ++ S KYIVLLTDG + A+ Sbjct: 296 DGLRYGMISLQSQPAQL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 42/116 (36%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T+I M++A+ I R IVLLTDGE+ + Sbjct: 375 PKAANGGTSICAGMRSAFQVI---------REVYPQIDGSEIVLLTDGEDNSAKD----- 420 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ G I IA + ++A +S N FF ++ L F Sbjct: 421 CIDEVTQSGAIIHLIALGPSA---DQAVIEMSAMTGGNHFFASDEAQNNGLIDAFG 473 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 54/172 (31%), Gaps = 36/172 (20%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + + + + TAI A+ A + Sbjct: 130 KVGLIVFAGDAFTQLPITSDYVSA-KMFLSSINPSMVSTQGTAIGAAINLAVRSFT---- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + + K I+L+TDGEN +D+ + A +GI + + K Sbjct: 185 -------PSETSDKAIILITDGENHEDDA--VKAAAAAAEKGIHVNIVGMGDPKGSPIPV 235 Query: 413 EKARYFLS-------------------NCASPNSFFEANSTHELNKIFRDRI 445 + + ++ A ++ A++T+ + + I Sbjct: 236 DGSNNYMKDKDGNVVITKLNEEMCQEIAAAGHGTYVRADNTNSALRALQKEI 287 >gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 627 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 67/224 (29%), Gaps = 23/224 (10%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK------VGIRDEKLSPYMV 234 W ID S + + P K+ SS+ + + + + Sbjct: 221 WNIDQSADSPATIDRLSDTRAHVYYYPTRDEQKNQSSEGISGDFVIRYDVSHDLSAGQIQ 280 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 N + P L P FV S K ++AL ++ ++ D N Sbjct: 281 VLNGYFVHYFAPTGLPPVQKNVLFVIDVSGSMDGAKMGQTKEALRVILDDMRSFDRFNIL 340 Query: 295 VRMGATFF---NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 F N +++ K F + G T N + + + Sbjct: 341 TFSYEVSFWKENMMILATQE--------NILEAKNFVNNLRASGGTNFNGGLVEGVEMLR 392 Query: 352 SSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEEGIAICNKAK 394 +D N E ++V++ TDG+ T + I AK Sbjct: 393 RVTDDAE-----NTERSAFLVIMLTDGQPTSGETQLTKIQENAK 431 >gi|146296332|ref|YP_001180103.1| Ig domain-containing protein [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409908|gb|ABP66912.1| Ig domain protein, group 2 domain protein [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 1831 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM---LYPGPLDPSLSEEHFVDSSSLRHVIK 269 K+ +++ K + N Y L G + PS + + + Sbjct: 798 KATYNESSKTITVETDHFSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTGSMSD 857 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKT 326 + V+ + + + +K D +V +G + D P+ + G Sbjct: 858 EIDAVKQNINNFVDKLKTKD---ISVNLGLVTYKDITCDGPNSTVGHGFFSSADDFKNAL 914 Query: 327 FAIDENEMGSTAIN--DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 +I + G T DA++TA ++ E+ K+IV+LTD +N Sbjct: 915 GSIKVDGGGDTPETLIDALETA--RLLGFRENS----------TKFIVVLTDANYKLENR 962 Query: 385 EGIAICNKA----KSQGIRIMTIAFS 406 GI ++ KS I + ++ Sbjct: 963 FGIKSADEIIERLKSDNIIVSVVSTM 988 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 42/116 (36%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T+I M++A+ I R IVLLTDGE+ + Sbjct: 376 PKAANGGTSICAGMRSAFQVI---------REVYPQIDGSEIVLLTDGEDNSAKD----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ G I IA + ++A +S N FF ++ L F Sbjct: 422 CIDEVTQSGAIIHLIALGPSA---DQAVIEMSAMTGGNHFFASDEAQNNGLIDAFG 474 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 47/119 (39%), Gaps = 9/119 (7%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++T + + S + + AKK ++++TDG+ +D E + A Sbjct: 225 GLTYTATGIRTVVEELFHS------KNGSRKSAKKILLVITDGQKYRDPLEYSDVIPAAD 278 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFE 451 GI I + + + ++P + F+ + L I + ++ +IF Sbjct: 279 KAGIIRYAIGVGDAFQEPTALKELNTIGSAPPQDHVFKVGNFAALRSI-QRQLQEKIFA 336 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 51/151 (33%), Gaps = 16/151 (10%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S SLR K +AS D + + G F+ R Sbjct: 29 LDSSGSLRRDYKNLKEFLKTIASFF------DIKINGSQAGVITFSHRSEHSIKL-NDFS 81 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 I T I+ A++ + + + ++ LEA K ++LLTDG Sbjct: 82 DGDSFEKAVDKIPLMGST-TRIDKALRHSKNVMFTNQ------NGGRLEATKLLILLTDG 134 Query: 378 ENTQDN--EEGIAICNKAKSQGIRIMTIAFS 406 T E+ I ++ ++ G+ I+ I Sbjct: 135 SQTFSAKQEDPSIIADEIRNDGVLIIAIGIG 165 >gi|260813598|ref|XP_002601504.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] gi|229286801|gb|EEN57516.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] Length = 400 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 47/287 (16%), Positives = 90/287 (31%), Gaps = 37/287 (12%) Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 +E ++ S V +D D L+ + + Sbjct: 143 SELGNIASSAQTAASVGNFAALD----------DLRTSLLSSVCTDCSQNYYAIRKVKIL 192 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL--RHVIKKKHL-VRDAL 278 +G +S Y + Y + SL + VD S + +K L ++ A+ Sbjct: 193 LGNAQSTVSAYAIYMLCHDTYPYHSLLCGNSLDIIYLVDGSGSVGANNFEKVKLFIKKAV 252 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + I V G ++ ++ + S + + ++ E G T Sbjct: 253 SGFV-----IGPAATQV--GVIQYSSKIRQEFSM--NSFQTVSGLLGAIDAMEYMQGGTL 303 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ A S + + K +V++TDG ++E +A+ QGI Sbjct: 304 TGRAIRYASKYGFSVFDGARRGVP------KVLVVVTDG---VSSDEVAIPALEAQRQGI 354 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + I S +Q L AS N L R+ + Sbjct: 355 FVYAIGVSNYDAEQ------LQKIASTNESSAMVDNFNLLDSVRNTL 395 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 20/116 (17%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T+I +++A+ I E++ + E IVLLTDGE+ + Sbjct: 376 PTRAQGGTSICSGIKSAFQVIG-----ELYSQLDGSE----IVLLTDGEDNTASS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFR 442 C ++ K G + IA + +KA +SN + + ++ L F Sbjct: 422 CIDEVKQSGAIVHFIALGKDA---DKAVIEMSNITGGSHLYASDEAENNGLIDAFG 474 >gi|323143197|ref|ZP_08077893.1| HemX [Succinatimonas hippei YIT 12066] gi|322417038|gb|EFY07676.1| HemX [Succinatimonas hippei YIT 12066] Length = 483 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 46/122 (37%), Gaps = 4/122 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVD---VVRWSYYEHALKQAAQTAIITASVPLI 69 K IKS G FI+T+L + + +G+ + + + +A Q I+ L Sbjct: 111 KKIKSLKGSNFILTSLSVVIFIGLAAFAIYSVKTTDVLQAKDDMLEANQQEILNVKNQLQ 170 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV-EMNPRKSA 128 S + +S ++ K+ E N L T E + + + ++ R Sbjct: 171 DSAQSLSQASEQINLLLKRNGELEDALNSYKGLDGKLTGIESANKTNASNIASLSARLKR 230 Query: 129 YQ 130 Y+ Sbjct: 231 YE 232 >gi|311743549|ref|ZP_07717355.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311312679|gb|EFQ82590.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 445 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 17/151 (11%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G IITA++M +L +G + +D S+ + AA T + ++ Sbjct: 12 IHEDRGATAIITAVIMTSLLSIGALTIDYGAASHVRRQTQNAADA---TTRSIVENCAKQ 68 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ +N + + ++ N V + V ++ Sbjct: 69 AAAAGQNVLD--GACVSATGTADASTIVQGNAPGSAPDAPV------IGGAGREVSVTVA 120 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 D L + +G S ++++ A AE Sbjct: 121 EPVDYRL------AQLLGKDSDVVRSSATAE 145 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 72/231 (31%), Gaps = 38/231 (16%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI-- 224 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLS-------GKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAL 185 Query: 225 --RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 R + + M ++ D + S +++KK ++ +I Sbjct: 186 AYRTDPIQGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVI------MI 239 Query: 283 RSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + +I NV+ + F+ + + G + +I TI K + N G T Sbjct: 240 KDLAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNTIIATITK--RENLNPDGVTN 293 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D ++ ++ S KYIVLLTDG + A+ Sbjct: 294 PGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 36/125 (28%), Gaps = 44/125 (35%) Query: 326 TFAIDENEMGST-------------AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ G T + ++A + + KKY++ Sbjct: 114 SYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVTDPDNPAKIRGFSVKKYMI 173 Query: 373 LLTDGEN----TQDNEEGIAI---------------------------CNKAKSQGIRIM 401 +TDG+ + GI CN AK++GI I Sbjct: 174 FMTDGDMSPTWNDYSAYGIEYLDGRVNGSPTTDNAALLARHLQRFRMACNAAKAKGIDIW 233 Query: 402 TIAFS 406 IAFS Sbjct: 234 VIAFS 238 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + + +G + S + + K D TA +A+ TA Sbjct: 41 ADELTPGINLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR-----TATGEAIFTAL 95 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----NTQDNEEGIAICNKAKSQGIRIMT 402 I + + IVL +DG+ N + + AK QG+ I T Sbjct: 96 QAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIST 152 Query: 403 IAFSVNKTQQEKARY---------FLSNCA--SPNSFFEANSTHELNKIF---RDRIGNE 448 I+F E + A S + + A + EL ++ + +IG E Sbjct: 153 ISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYE 212 Query: 449 IFE 451 + Sbjct: 213 TIK 215 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 59/201 (29%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G F + R+ VK Sbjct: 118 RLAAAKEAGKQFADQLTPAIN------LGLVEFAANATLLVPPT-----TNRSAVKAGID 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + TA I + V + IVL +DG + Sbjct: 167 SLQPAPKTATGEGIFTALQAIATVG--SVMGGGEGPPPAR-IVLESDGAENVPLDPNAPQ 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ----------QEKARYFLSNC-ASPNSFFEANS 433 AK++G++I TI+F + C + F A+S Sbjct: 224 GAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 L ++ + +IG E + Sbjct: 284 LDSLKNVYSTLQRQIGYETVK 304 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 57/171 (33%), Gaps = 21/171 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++ + I ++ DN +G F + + + + + Sbjct: 122 RLEFSKELIRGFISQ-RENDN------IGLVAFAKDASIVVPITTDREFFNKKLDDIYIM 174 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D +A+ + A + K++ K+ IV+LTDG D + Sbjct: 175 DL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIXKDQV 221 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 N A+ ++I +I ++ + + S SF E L +I Sbjct: 222 INLAQGLNVKIYSIGIGSSEEFSVEFK-LRSGKFYQGSFKEVYDPSMLVEI 271 >gi|84385695|ref|ZP_00988726.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] gi|84379675|gb|EAP96527.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] Length = 436 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 96/327 (29%), Gaps = 39/327 (11%) Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL------RSMGIKSWL 157 KN D V D + D +++ + Y R + L+P + F I Sbjct: 46 KNLVDLYVTDNINDVEIDVTTTRCEYADGCVQR-NHELSPFTDFTVSARTEHKSWITHDE 104 Query: 158 IQTKAEAETVSRSYHKE---HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 I + E + RS ++ V I +++D S+SM + + Sbjct: 105 IGVEPEFKVSGRSVTRKFLPQPVDIYFILDTSQSMSNRWSGEKNNKTQMDVVKETIVKVV 164 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + K G + + Y Y D ++ D + R Sbjct: 165 EDLKTFKTGDEKKSQISLVT-------YNAYNAKFDKKSNQVKLYDYT--RDFKHTTETF 215 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + + A +N + I+ +VK+ + Sbjct: 216 EGIVDKMFDQDWPAETTSY-----AYMYNTSQD--IPLTDDYDNFIK-LVKSDKLKPATG 267 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + +I++ ++ N ++ + L+DG + + N K + Sbjct: 268 GGTMS-------WLGLIAAAKEANKVEDVNRNPEQVFIFLSDGADNKVNYPKDLYLEKTR 320 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSN 421 S +V+ T L Sbjct: 321 S-----YRSKHNVDWTHYSDGSTVLQY 342 >gi|327403933|ref|YP_004344771.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319441|gb|AEA43933.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 375 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 42/131 (32%), Gaps = 21/131 (16%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + + A+ ++ +K R+ F + + + + + + Sbjct: 148 NDVSRLTAAKQAIGELLNQLKGE-------RIAVVIFANDAYTQLPLTMD-YGAAKLFIP 199 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T + A++ A + E+ K I+++TDGE+ + + Sbjct: 200 DIETSMISDQGTNVGRALEIAQEQFKD------------TESGKAILVITDGEDHEALWK 247 Query: 386 GIAICNKAKSQ 396 I K Sbjct: 248 E-QIAELKKKN 257 >gi|326332116|ref|ZP_08198400.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] gi|325950087|gb|EGD42143.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] Length = 338 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 41/117 (35%), Gaps = 15/117 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD-PSFSWGVHKLIRTIVKTFA 328 + ++ L ++ ++ VR GA F V+ F++ + +A Sbjct: 103 RMTQLKKDLKALSAALPS-------VRFGAITFGGEVVRTEMPFTYDTTAFNAWVDGLYA 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + + ++ + E R ++ +V +DGENT++ + Sbjct: 156 ERAFDGSGSMVDAPRDEVISALERDQERFPER-------RRIVVFASDGENTREGVD 205 >gi|256420216|ref|YP_003120869.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035124|gb|ACU58668.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 211 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 55/173 (31%), Gaps = 13/173 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V++ L ++ +++ ++V + F+ V + ++ I Sbjct: 20 RIEAVKNGLQVLVSKLRQDPFALESVWISIITFDREVKQLLPLT------ALESLQLPEI 73 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G T + A++ + + K + + L+TDG+ D + Sbjct: 74 TTPESGPTNMGAALEMLCSKLDAEVAKGSDTQKGDWRPLLF--LMTDGK-PSDLAAFREV 130 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K KS+ + A E FL + + L F+ Sbjct: 131 VPKVKSKNLA----ALVACAAGAEAQDSFLKELTDNVVHLDTADSSTLMSFFK 179 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 17/162 (10%), Positives = 48/162 (29%), Gaps = 7/162 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G F + A+ M V + G+ VD+ + L++ A A + + + + Sbjct: 21 ARERGSFAVAAAIWMLVAIAALGV-VDIGNVFFVRRDLQRVADMAALAGAQRMDDQCSQP 79 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ A + + + + + T+ +N V +++ Sbjct: 80 NAAAAANASSNGFDRAASGNTLTVSCGRWDTQSNAAPSYFSTTSTPLNA------VQVTA 133 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 ++ L + + + + G+ Sbjct: 134 TQNVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASLQGGL 175 >gi|222530086|ref|YP_002573968.1| Ig domain-containing protein group 2 domain-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222456933|gb|ACM61195.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor bescii DSM 6725] Length = 1831 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 74/230 (32%), Gaps = 32/230 (13%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM--- 243 + + F P K+ +++ K + N Y Sbjct: 777 GQLSHFAVVCYDTKGGKFSLP-----KATYNESSKTITVETNHFSVYYLINLKKYLDITG 831 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L G + PS + + + + V+ + + + +K D +V +G + Sbjct: 832 LKSGTVSPSGQADIVFVIDTTGSMSDEIDAVKQNINNFVDKLKTKD---ISVNLGLVTYK 888 Query: 304 DRVISDPSFSWG---VHKLIRTIVKTFAIDENEMGSTAIN--DAMQTAYDTIISSNEDEV 358 D + + G +I + G T DA++TA ++ E+ Sbjct: 889 DITCDGLNSTVGHGFFSSADDFKNALGSIKVDGGGDTPETLIDALETA--RLLGFRENS- 945 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIA 404 K+IV+LTD +N GI ++ KS I + ++ Sbjct: 946 ---------TKFIVVLTDANYKLENRFGIKSADEIIERLKSDNIIVSVVS 986 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 99/328 (30%), Gaps = 28/328 (8%) Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + S+ + QS + + K + + + + + K T+ E + +T Sbjct: 68 VQLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKIIVPENTKET 127 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA----ETVSRSYHKEH-- 175 + + + + + NP + + + A S +Y + Sbjct: 128 NKNDSAPEKTEQPTATEEVTNPFAEARMAPANLRANLALPLIAPQYTTDNSGTYPTANWQ 187 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 Q V++ + + N SY G D + Y Sbjct: 188 PTGNQNVLNHQGNKDGSAQWDGQTSWNGD---PTNRTNSYIEYGGTGDQADYAIRKYARE 244 Query: 236 CNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + +Y E VD S + + V+ + + ++ Sbjct: 245 TTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSG 304 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ + MG ++ ++ + G ++ +K + G T A++ A D Sbjct: 305 ITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKALRDAGDM 362 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + N KK IVLLTDG Sbjct: 363 LATPNGH-----------KKVIVLLTDG 379 >gi|116748933|ref|YP_845620.1| hypothetical protein Sfum_1496 [Syntrophobacter fumaroxidans MPOB] gi|116697997|gb|ABK17185.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 427 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 49/152 (32%), Gaps = 1/152 (0%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 K +SK + G II AL + ++LG G VD+ + L+ AA + Sbjct: 2 KKHAIFSKSRLSERDGATVIIVALALVMLLGFGAFAVDIGYLYVVRNELQNAADAGALAG 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + L + N + + +R + R P Sbjct: 62 AAALYNNDGTAVQPTANVIG-QEAAMRNTAVRTAVEVTLNGNSGDVQRGHYSFATGTFTP 120 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 S V L + L+ + F+ ++ + + Sbjct: 121 NASLLPVSLWNVSTEELDANTDFINAVKVTTH 152 >gi|224051388|ref|XP_002199708.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 417 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 64/168 (38%), Gaps = 17/168 (10%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S ML S++ +D SS I + ++ + +++V ++ + D + Sbjct: 219 SHEQMLCSKTCYNSVNIGFLIDGSSSIGEINFRLML-EFVSNVAKAFEISDIGSKV---A 274 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A F + F+ H ++ + G TA DA+ T+ +D Sbjct: 275 AVQFTYNQRKEFGFT--DHVTKEKVLSAIHNIQYMSGGTATGDAISFTTRTVFGPVKDG- 331 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 K ++++LTDG++ D A AK GI + ++ + Sbjct: 332 -------PNKNFLIVLTDGQSYDDVTGPAA---AAKKAGITVFSVGVA 369 >gi|260797475|ref|XP_002593728.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] gi|229278956|gb|EEN49739.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] Length = 186 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 45/117 (38%), Gaps = 17/117 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G ++D + G H + V + + G TA DAM+ A + + Sbjct: 39 TRVGVVQYSDFNTLACNL--GDHPDEASFVTAINTMQYQGGGTATGDAMEYARVKLQA-- 94 Query: 355 EDEVHRMKNNLEAKKY-----IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + +K+ +++LTDG + ++ +A + G+ + I + Sbjct: 95 -----VWRPAPTPRKFPLFQIMIVLTDG---KSGDDVVAAAQALAADGVTVYAIGVA 143 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 24/181 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V DA +++ + + D V+ ++ V H + V Sbjct: 58 KMEAVIDASCALVNWLTRHDAVS------IVAYSADVQLIQPV---THLTEKVSVTDKIR 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + ST ++ +A ++ S + K ++LLTDG T ++ A+ Sbjct: 109 NIQVATSTNLSGGWLSALKSLNQSKIPNAY---------KRVLLLTDGNPTSGIKDKEAL 159 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A GI TI + ++ A +F+ ++ + IF + G Sbjct: 160 VTIAADHLSMGISTTTIGVGNDFNEEMLVEI---AKAGGGNFYYIDNPENASDIFFEEFG 216 Query: 447 N 447 + Sbjct: 217 D 217 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 39/115 (33%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S+ L + +G +I AL PV++G G+ + W + L+ AA + A V Sbjct: 15 SRALKRDVSGTAAVIAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVRKRA 74 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + A + L N+ + + E+ PR Sbjct: 75 GDPKSQIDAAALNIALNSGMSSSLGNMLANSPPTSGIKAGDTSSLEVILTEVRPR 129 >gi|160880107|ref|YP_001559075.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160428773|gb|ABX42336.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 513 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 40/120 (33%), Gaps = 20/120 (16%) Query: 306 VISDPSFSW---GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + W + + + + D G T I ++ A + + + Sbjct: 390 IPFSSELKWVDSAISGIDTANLISRIKDTEAHGKTNIYAPVEHAIEIL---------KDF 440 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-VNKTQQEKARYFLSN 421 + K IVL+TDGE+ + +G + I + +I F N Q ++ Sbjct: 441 DADVYTKSIVLMTDGESAGNFRKGTSY-------DIPVFSIMFGEANPKQLDEISRLTKG 493 >gi|312878214|ref|ZP_07738140.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] gi|311795008|gb|EFR11411.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] Length = 1831 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 27/204 (13%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM---LYPGPLDPSLSEEHFVDSSSLRHVIK 269 K+ +++ K + N Y L G + PS + + + Sbjct: 798 KATYNESSKTITVETNHFSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTGSMSD 857 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVKT 326 + V+ + + + +K D +V +G + D P+ + G Sbjct: 858 EIDAVKQNINNFVDKLKTKD---ISVNLGLVTYKDITCDGPNSTVGHGFFSSADDFKNAL 914 Query: 327 FAIDENEMGSTAIN--DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 +I + G T DA++TA ++ E+ K+IV+LTD +N Sbjct: 915 GSIKVDGGGDTPETLIDALETA--RLLGFRENS----------TKFIVVLTDANYKLENR 962 Query: 385 EGIAICNKA----KSQGIRIMTIA 404 GI ++ KS I + ++ Sbjct: 963 FGIKSADEIIERLKSDNIIVSVVS 986 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 47/139 (33%), Gaps = 10/139 (7%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N + V+ ++F N +G T + + NE + + Sbjct: 90 NSGKVQSCESIRNVYGFQSFNEQSFLNSLNAIGPTGWTPIAK-------ALNEAKSSFDQ 142 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + + +K + LLTDGE T I + I + I F + + + Sbjct: 143 LDTKGEKVVYLLTDGEETCGG-NPIKTAKELHKDNITVNVIGFDYKEGYKGQLNAIAK-- 199 Query: 423 ASPNSFFEANSTHELNKIF 441 +F A + ++ KIF Sbjct: 200 VGGGEYFPAYTQKDVEKIF 218 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 110/335 (32%), Gaps = 33/335 (9%) Query: 52 ALKQAAQTAIITASVP---LIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + + + + P L ++ E V+S K+ E I EN + N Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETN 179 Query: 106 FTDREVRDIVRDTAVE--MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT-KA 162 D + TA E NP A + R +L L ++ + ++ + Sbjct: 180 KNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQP 239 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 ++ S QW S + R + G AD ++ Y+ + Sbjct: 240 TGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTP 299 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G+ D L+ L VD S + + V+ + + Sbjct: 300 GLFDVYLN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFV 348 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ N+ + MG ++ ++ + G ++ +K + G T A Sbjct: 349 DTLADSGITNN-INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNI-TPSSTRGGTFTQKA 406 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ A D + + N KK IVLLTDG Sbjct: 407 LRDAGDMLATPNGH-----------KKVIVLLTDG 430 >gi|322434934|ref|YP_004217146.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162661|gb|ADW68366.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 373 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 56/160 (35%), Gaps = 24/160 (15%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D +S +DSS + K++ V A +++ +D F+ Sbjct: 134 KHQDLPVSIGLLIDSSGSMYD--KRNAVDKASIDLVKLSNPMDEEF------LVDFSTEA 185 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 D F+ V KL + + G TA DA+ + D + KN Sbjct: 186 FIDTDFTTSVDKLSQGL-----SYIKSSGGTAAYDALVASADYLT----------KNAKN 230 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAF 405 K+ ++++TDGE+ + + + G I + Sbjct: 231 TKQVLIIITDGEDNASSATLEQSIRRIQDLDGPVIYCVGL 270 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 48/144 (33%), Gaps = 16/144 (11%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + + T + + G T+ +AMQ+ I S+ Sbjct: 383 IMYDHSAYNCF-----TTNPLSTSLVSTMSSYVAGGGTSFTNAMQSVSSLISSTYP---- 433 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 N+ K ++ ++DGE++ D I S I + TI + + Sbjct: 434 ---NHQSYKIVVLFMSDGEDSADEAVSITG-QLVSSHDIILHTIQLGGSSDNTGLRQM-- 487 Query: 420 SNCASPNSFFEANSTHELNKIFRD 443 F AN + L I+++ Sbjct: 488 -AATGRGQFKRANDSASLAGIYQE 510 >gi|118353091|ref|XP_001009816.1| hypothetical protein TTHERM_00160860 [Tetrahymena thermophila] gi|89291583|gb|EAR89571.1| hypothetical protein TTHERM_00160860 [Tetrahymena thermophila SB210] Length = 1187 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 16/164 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 S+ I K D ++G FN+ V + K +I + G + Sbjct: 1007 RSIFDKIDKKD------KIGFVQFNELVHENFPLQEKELYNDLLEKKIRSIPMSTGGKSN 1060 Query: 339 INDAMQTAY---------DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + A+ + I + + +NN ++I L T+G N ++ AI Sbjct: 1061 LYKALDVTTGLFDKCQQMNNIEDNQKKSTLDQENNERL-RFICLFTEGNNQIKEDQLEAI 1119 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 K K + + ++ I RY + FF+ Sbjct: 1120 KEKMKKKQVNLIIINICNQNANMLYLRYLAKEISELGQFFQQGD 1163 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 39/114 (34%), Gaps = 13/114 (11%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F + + + + +D +A+ + A + Sbjct: 142 IGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL--GNGSALGLGISIALSHL------ 193 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 K++ K+ +++LTDG D + N A+ ++I +I ++ Sbjct: 194 -----KHSEAPKRSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSIGIGSSEE 242 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 61/198 (30%), Gaps = 21/198 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + Q + LN + SY G+ D + + + LY Sbjct: 241 HQGNRQGQQQWDGLNGWDGNPTNRNNSYIEYGGEKEDADYAIRKFAKETATPGLFDLYLN 300 Query: 248 PLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + VD S + + V+ + + ++ + D + MG Sbjct: 301 VRGNTQKNITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVG 359 Query: 302 FNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ S + G ++ VK+ G T A++ A + + N Sbjct: 360 YSIEGYSYSNGAVQMGSFDSVKNQVKSI-TPSWTNGGTFTQKALRDAGNMLSVPNGH--- 415 Query: 360 RMKNNLEAKKYIVLLTDG 377 KK IVLLTDG Sbjct: 416 --------KKVIVLLTDG 425 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 16/118 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 295 TKLALLKRAMGFVIQNLGPSD------RLSVIAFSSTARRLFPLRCMTEAGRQEAL-LSV 347 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++ I+ + KN + + I+LL+DG++T Sbjct: 348 NSLVSNGGTNIAEGLRKGAKVIVD------RKWKNPVAS---IILLSDGQDTYTVTSP 396 >gi|170741047|ref|YP_001769702.1| hypothetical protein M446_2843 [Methylobacterium sp. 4-46] gi|168195321|gb|ACA17268.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 439 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 49/159 (30%), Gaps = 25/159 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K + G +I L +P+++ G ++ R L++A TA + A+ L + Sbjct: 5 KTFLADRAGAAALILGLCLPMLVAGSGAALEYARIHKRRTELQKAVDTAALAAAGELTIA 64 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + Y+ + + V+ ++ Sbjct: 65 GSDT-----------------YVRNLAVRTAYEAAGGTDPAVTRATARVQNRRGWVQVEI 107 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + R SLF R + + + + +A E + Sbjct: 108 RETVR--------SLFGRLLTMPTMELSAQATGELSGST 138 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 68/197 (34%), Gaps = 13/197 (6%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF--FND 304 P + ++ + + + K + + ++ S+ + V +V G+ N Sbjct: 33 APANVAVLLDASGSMAKRIDGVSKFNSAKKEISKFASSLPEGTQVKMSV-FGSEGNNKNS 91 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + V+ ++F N +G T + + NE + + + Sbjct: 92 GKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIAK-------ALNEAKSSFDQLD 144 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + +K + LLTDGE T I + + I + I F + + + Sbjct: 145 AKGEKVVYLLTDGEETCGG-NPIKTAKELQKDNITVNVIGFDYKEGYKGQLNAIAK--VG 201 Query: 425 PNSFFEANSTHELNKIF 441 +F A + ++ KIF Sbjct: 202 GGEYFPAYTQKDVEKIF 218 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 77/200 (38%), Gaps = 23/200 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + +D S + K L++DA ++ +++ D V+ V GA V+ P Sbjct: 213 PPSNLVFLLDVSGSMNTPNKLPLLKDAFRMLVNELREEDQVSIVVYAGAAG----VVLPP 268 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K++ I N GSTA ++ AY + N + + Sbjct: 269 TPGNEKDKILTAIEN-----LNAGGSTAGGAGIKLAY-KLAQDNFIKSGNNR-------- 314 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ + E + + + +++G+ + + F Q K + + Sbjct: 315 VILATDGDFNVGVSSDTELVKLIEQKRNKGVFLTVLGFGSGNLQDSKMEKIANK--GNGN 372 Query: 428 FFEANSTHELNKIFRDRIGN 447 + ++ E K+ + IG Sbjct: 373 YAYIDNELEAKKVLVNEIGA 392 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 44/139 (31%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A V + + K+ N +G F + + + +++ Sbjct: 112 LEKAKRLVAQLVDKMQNDK----VGMIVFAGDAFTQLPITSDYISA-KMFLESIDPSLIS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI A+ A + + I+++TDGEN + + A Sbjct: 167 KQGTAIGAAINLATRSFT-----------PQEGVGRAIIVITDGENHEG--GAVEAAKAA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 +GI++ + + Sbjct: 214 TEKGIQVNVLGVGMPDGAP 232 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 33/117 (28%), Gaps = 3/117 (2%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +KK + G F ++T L+ + + VD R + L+ A A I A + Sbjct: 4 NTKKKQQGAIGLFGVLTLLMA---VLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVG 60 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + SS A + Y T N R Sbjct: 61 GCAQNNSSEAYKAAAQAAAAANGYQGNLLAAPNAVQLGGYHTDSDGIRTFAANNERS 117 >gi|221108025|ref|XP_002157834.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 299 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 182 RVGVITFSYRAELSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 238 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 239 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 293 Query: 414 KAR 416 + Sbjct: 294 LSH 296 >gi|229822438|ref|YP_002883964.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229568351|gb|ACQ82202.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 399 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 29/90 (32%), Gaps = 7/90 (7%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R ++ + + + TA++ + + + + E Sbjct: 111 RYSIIGWDSQATRQLPLTTDARAVRSWADTLRQEVSAYSAGTAVDRPL----EALRDALE 166 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 R N+ + + L+DGENT ++ Sbjct: 167 GAAERNPGNV---RLVFFLSDGENTNGDDS 193 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 58/201 (28%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G F + R VK Sbjct: 118 RLAAAKEAGKQFADQLTPAIN------LGLVEFAANATLLVPPT-----TNRAAVKAGID 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + TA I + V + IVL +DG + Sbjct: 167 SLQPAPKTATGEGIFTALQAIATVG--SVMGGGEGPPPAR-IVLESDGAENVPLDPNAPQ 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ----------QEKARYFLSNC-ASPNSFFEANS 433 AK++G++I TI+F + C + F A+S Sbjct: 224 GAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 L ++ + +IG E + Sbjct: 284 LDSLKNVYSTLQRQIGYETVK 304 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 8/145 (5%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G +N + + + + + + + + T D+ + Sbjct: 86 TTRVGLITYN----FNATLNANLSQFQSYDDLSNGVFHSLSNVTNSTDSFIGTGLAMAEQ 141 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + KK I++ ++E I ++ K G++I+T+ + + + Sbjct: 142 LLRRQNFNTTRDHYKKVIIVYASAFQRNEDETPEWIADRLKGSGVKIITVGYGNSHGLIK 201 Query: 414 KARYFLSNCASPNSFFEANSTHELN 438 LSN ASP F ++ L Sbjct: 202 S----LSNIASPGLSFNSSGDGNLI 222 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 24/72 (33%), Gaps = 6/72 (8%) Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG+ D + Q ++ T+ F N +Y + + +F A Sbjct: 303 DGDPNIDPKGV-----DYSKQTVQTFTVGFG-NGISPAGEQYLIRGASHDGWYFNAAKPD 356 Query: 436 ELNKIFRDRIGN 447 +L K F I Sbjct: 357 DLYKAFEKIISQ 368 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 58/201 (28%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G F + R VK Sbjct: 118 RLAAAKEAGKQFADQLTPAIN------LGLVEFAANATLLVPPT-----TNRAAVKAGID 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + TA I + V + IVL +DG + Sbjct: 167 SLQPAPKTATGEGIFTALQAIATVG--SVMGGGEGPPPAR-IVLESDGAENVPLDPNAPQ 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ----------QEKARYFLSNC-ASPNSFFEANS 433 AK++G++I TI+F + C + F A+S Sbjct: 224 GAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 L ++ + +IG E + Sbjct: 284 LDSLKNVYSTLQRQIGYETVK 304 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 48/361 (13%), Positives = 102/361 (28%), Gaps = 35/361 (9%) Query: 31 PVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 P+ LGV + W+ + ++ A + I S S+ + + F Sbjct: 122 PIALGVQSSVAQAQGWTSKQPTWQEIADSGITFGMTDPSTSNSGFSALSAATTAFADTG- 180 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +K+ + + S + + + Sbjct: 181 --------RALTEKDIKQSAGKVQKLFGNQTLTSGSSGWLADRFREHPEQADA------I 226 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 +S L Q K E + + +S + + S D +++ Q L + Sbjct: 227 FNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERPD 286 Query: 211 TVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + S+ + PY + + V +S + Sbjct: 287 KLTSFHLRVDNSDLPGTVFELPYPANEQTVDALEAAFAHELRNPGNTALVLDTSGSMEGE 346 Query: 270 KKHLVRDALASVIR-SIKKIDNVNDTV------RMGATFFNDRVISDPSFSWGVHK-LIR 321 + L++ +L +I S + + V ++ ++ K Sbjct: 347 RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKPATT 406 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G TA +A+ A+D + +S D +VL+TDGE T+ Sbjct: 407 KELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIG-----------TVVLMTDGEVTR 455 Query: 382 D 382 Sbjct: 456 G 456 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 48/159 (30%), Gaps = 17/159 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ + + G K T + GSTA + AY Sbjct: 227 RITLVTYASGTRVALPPTPGSDK---TAISAAIDQLVAGGSTAGASGIALAYQA------ 277 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++L TDG+ + ++ + + G+ + T+ F + Sbjct: 278 -AQQSFIAGGINR--VLLATDGDFNVGVTDFRQLKSMVEEKRKSGVSLSTLGFGTGNYNE 334 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + A ++ ++ E NK+ I + + Sbjct: 335 QLMEQLAD--AGDGAYSYIDNLMEGNKVLVSEISSTLAT 371 >gi|298483471|ref|ZP_07001648.1| two-component system sensor histidine kinase [Bacteroides sp. D22] gi|298270419|gb|EFI12003.1| two-component system sensor histidine kinase [Bacteroides sp. D22] Length = 869 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 100/290 (34%), Gaps = 22/290 (7%) Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFT-DREVRDIVRDTAVEM-----NPRKSAYQVVLSS 135 +F K E ++ K N ++ + T + PR S ++V++ Sbjct: 170 DNSFLSNKGLEDFEEEWKIFQKDNPDYRMKIYNTQNHTTSHIIAAICYPRNSYERLVVAP 229 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ-- 193 ++ L+ + ++ + + V +Y + S + S+L Sbjct: 230 KWSPFLSFVGKNSKAPVFSTQNV---GLTNGVFSAYDADSYTSASLAAQRAASVLKGTSP 286 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 RD + + + + + KV ++ K L+ +LYP L + Sbjct: 287 RDIGVTEITQGFIFDYKQLDFFHVDSDKVSSSGTIVNEPYWEKYKYLFILLYPSILALLI 346 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDAL--ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + ++ ++ R ++ L ++ + DNV ++R G ++ + Sbjct: 347 ASIVWLMRANRRESKRRIQAQTRLLVQNKLVEQRNEFDNVFHSIRDGVITYDTDLH---- 402 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +H R++++ + G M + I + +D +H+M Sbjct: 403 ----IHFTNRSLLQMLHLPYEA-GGRFYEGMMAGSIFKIYYNGQDILHKM 447 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 90/304 (29%), Gaps = 42/304 (13%) Query: 174 EHGVSIQWVIDFSRSM--LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + GVS++ + + S+ L G + + K R Sbjct: 5 KGGVSLKKIYNKIISLFVLFTLLLGYGGSVVSHVSAESTGGLQDTIGVDKTAKRTPGCRT 64 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + V+ N + P + L ++ + + + A + + N Sbjct: 65 FEVNLNITGELQQAPVDVVLVLDRSGSMNFVETPNSPTRLDYGKLAAINFAERV-LGPNG 123 Query: 292 NDTVRMGATFFN-----DRVISDPSFSWGVHKLIRT---------IVKTFAIDENEMGST 337 R+ F+ V ++P +G T V G T Sbjct: 124 IPGSRVSVVSFSGPAYATGVRNNPQRHYGQQNQATTDLDLSSDLRAVTDSINRITAFGGT 183 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK----- 392 + I + ++ N + K +++LTDG T N A Sbjct: 184 NTEAGFEQGRSVIEGTTSNQ------NPNSNKVVIMLTDGLPTASNGNPYAETTDINHVH 237 Query: 393 ---AKSQGIRIM--------TIAF--SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 A + G I TI +N T++ A L+ A ++ A S +L+ Sbjct: 238 IQRAINAGKNIYQNDIADVFTIGLTTGMNATEKALADNILTQ-AQNKGYYPAPSATDLDA 296 Query: 440 IFRD 443 IF + Sbjct: 297 IFEE 300 >gi|253701737|ref|YP_003022926.1| hypothetical protein GM21_3141 [Geobacter sp. M21] gi|251776587|gb|ACT19168.1| conserved hypothetical protein [Geobacter sp. M21] Length = 383 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 15/111 (13%), Positives = 37/111 (33%), Gaps = 12/111 (10%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 +++ V L + G+ +D+ + L+ +A+ A +T + E + R Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGA-------ESLKKRLLLQAQHSP 60 Query: 88 QKIEEYLIRNFENNLKKNFTDR-----EVRDIVRDTAVEMNPRKSAYQVVL 133 K+ + L ++ + D +V N + + Sbjct: 61 GKLAQVLADPLQSAARSVAVDTATGKHSASALVALMNDNGNALTENNDITV 111 >gi|299138185|ref|ZP_07031365.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600115|gb|EFI56273.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 382 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 53/166 (31%), Gaps = 3/166 (1%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS-IKKIDNVNDTVRMGATFFNDRVIS 308 D L+ VD+S + R A + + + ++ + F V Sbjct: 81 DVPLTLGLLVDTSQSQR--TVIDSERSASGTFLDKMLAPASANRESDKAFVVQFAREVEL 138 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + KL + + + + ST + T+ + + + + + Sbjct: 139 LQDPTDSRSKLKQALKELDTTAPSTSSSTGDDSGHAHGGTTLYDAVFLSADEVTSKQKGR 198 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + ++LLTDG + E A+ I I F + ++ Sbjct: 199 RALILLTDGVDRNSKESIAEAIEAAQRADTIIYAIYFKGEEPHRDN 244 >gi|294645005|ref|ZP_06722736.1| PAS domain S-box [Bacteroides ovatus SD CC 2a] gi|294806981|ref|ZP_06765803.1| PAS domain S-box [Bacteroides xylanisolvens SD CC 1b] gi|292639654|gb|EFF57941.1| PAS domain S-box [Bacteroides ovatus SD CC 2a] gi|294445816|gb|EFG14461.1| PAS domain S-box [Bacteroides xylanisolvens SD CC 1b] Length = 862 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 100/290 (34%), Gaps = 22/290 (7%) Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFT-DREVRDIVRDTAVEM-----NPRKSAYQVVLSS 135 +F K E ++ K N ++ + T + PR S ++V++ Sbjct: 163 DNSFLSNKGLEDFEEEWKIFQKDNPDYRMKIYNTQNHTTSHIIAAICYPRNSYERLVVAP 222 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ-- 193 ++ L+ + ++ + + V +Y + S + S+L Sbjct: 223 KWSPFLSFVGKNSKAPVFSTQNV---GLTNGVFSAYDADSYTSASLAAQRAASVLKGTSP 279 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 RD + + + + + KV ++ K L+ +LYP L + Sbjct: 280 RDIGVTEITQGFIFDYKQLDFFHVDSDKVSSSGTIVNEPYWEKYKYLFILLYPSILALLI 339 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDAL--ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + ++ ++ R ++ L ++ + DNV ++R G ++ + Sbjct: 340 ASIVWLMRANRRESKRRIQAQTRLLVQNKLVEQRNEFDNVFHSIRDGVITYDTDLH---- 395 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +H R++++ + G M + I + +D +H+M Sbjct: 396 ----IHFTNRSLLQMLHLPYEA-GGRFYEGMMAGSIFKIYYNGQDILHKM 440 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 60/201 (29%), Gaps = 19/201 (9%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 TV S+ + R ++ + S+ D + + S K Sbjct: 172 TVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGDSR----SK 227 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + +DAL ++ + D R+G FN+ + I D Sbjct: 228 MAVAKDALVALTEQLHPDD------RVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIRED 281 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I D M A D + + + + +++TD + A+ Sbjct: 282 IEAGGGTNIADGMAEAADMLGEYADSDPTEAETRQ------IVITDAMPNTGQTDDQALQ 335 Query: 391 NKA---KSQGIRIMTIAFSVN 408 ++ GI + V+ Sbjct: 336 DRLAGYAEDGIHTSFVGVGVD 356 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 30/86 (34%), Gaps = 7/86 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ V H + ++ Sbjct: 282 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTVRRLFHLRRMSHSGRQQALQA-V 334 Query: 329 IDENEMGSTAINDAMQTAYDTIISSN 354 G T I DA++ A I + Sbjct: 335 NSLGAGGGTNIADALKKAAKVIEDRS 360 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 20/43 (46%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 + L K G+ +I A+ + +LG +++DV ++ Sbjct: 10 FRYLFKDERGNVLVIFAVALIALLGFAAIVIDVGGMYVERRSM 52 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 59/180 (32%), Gaps = 25/180 (13%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ +K L++ + S+ I + G F+ + Sbjct: 80 KNYKNEKDLLKT-----LASLFSIKPNGS--QAGVITFSFYTEHSIKL--NQFSDQDSFN 130 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA--KKYIVLLTDGENTQD 382 +T I+ ++ A + +++N K +VLLTDG TQ Sbjct: 131 DAVDRIPLMGHTTRIDKGLRLAQKEMF--------KVENGGRPGVSKLLVLLTDGSQTQG 182 Query: 383 N--EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + I ++ + QG+ I+ I + E + + + A+ +L + Sbjct: 183 KGVIDPAIIADEIRKQGVPIIAIGIGKEINKNE----LIKIGGGEANTYSADDFEKLKES 238 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 67/203 (33%), Gaps = 23/203 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P P VD S H K L++ +L + RS+ D V+ V GA+ V Sbjct: 77 SPKCPPPIWVFLVDVSGSMHSPDKLPLLKRSLRLLSRSLDADDRVSLVVYAGASG---VV 133 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R ++ + GST ++ AY + E + N Sbjct: 134 LEPTP------GNKRATIEQALQQLSAGGSTNGGAGIRLAYAK---AREAFIEGGINR-- 182 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++L TDG+ A+ + K Q GI + T+ F Sbjct: 183 ----VILATDGDFNVGTVNHQALIDLIKQQRQAGIALTTLGFGGGNYNDHLMEQLADQ-- 236 Query: 424 SPNSFFEANSTHELNKIFRDRIG 446 ++ +S E + +R G Sbjct: 237 GDGNYAYIDSLMEARQGVGERAG 259 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 47/125 (37%), Gaps = 21/125 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIV 324 + ++R +I+ + +I N++ + F+ + + G + +I TI Sbjct: 226 SRMDILRKKSVIMIKDLAEIGNIS----VNLVGFSTSAKYIQQNFSNLDNGTNTIIATIN 281 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K + N G T D ++ ++ S KYIVLLTDG Sbjct: 282 K--RENLNPDGVTNPGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPNAYLV 328 Query: 385 EGIAI 389 + A+ Sbjct: 329 DSRAL 333 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 55 RVGVITFSYRAELSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 111 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 112 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 166 Query: 414 KAR 416 + Sbjct: 167 LSH 169 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 271 RVGVITFSYRAELSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 327 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 328 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 382 Query: 414 KAR 416 + Sbjct: 383 LSH 385 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 73/255 (28%), Gaps = 33/255 (12%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 M++Y P P + +P + L Sbjct: 129 RIEEMVNYFDYDYPAPEEGSASPLRV----------SPELAPAPWNPNHLLLRIGLQAKK 178 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P + +D S K L++ + ++ ++ D V Sbjct: 179 IDLAKAPPSNIVFLIDVSGSMDEENKLPLLQSSFKMLLGQLRPDDKVAIVTYAN----GT 234 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 +V + K+I+ + + G T+ +Q AY+ S +KN Sbjct: 235 KVALPSTSVKDKEKIIKVLDNLY-----ASGGTSGGKGIQLAYEQAQKS------FIKNG 283 Query: 365 LEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ + + K + GI + + F + + + A Sbjct: 284 NNR---IILATDGDFNIGINNTTDLEKFIEKQRESGIYMSVLGFGMGNYRDDMAETIADK 340 Query: 422 CASPNSFFEANSTHE 436 ++ ++ E Sbjct: 341 --GNGNYAYIDNITE 353 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 49/183 (26%), Gaps = 36/183 (19%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +DN N+ ++G F + + G T I A+ Sbjct: 121 NLVDNFNND-KIGLIVFAGDAFVQLPITTDYVSAKMFLQNITPGLIQTQG-TNIGAAIDL 178 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + + I+++TDGEN + + A A +GI + + Sbjct: 179 ASKSFTQQ-----------ENVGRAIIVITDGENHEPGAQEAAA--AANKKGINVFILGI 225 Query: 406 SVNKTQQ-----------EKARYFL-----SNC-----ASPNSFFEANSTHELNKIFRDR 444 K + C A + ++T + + D Sbjct: 226 GNTKGAPIPMGDGSYLKDNAGNTVMTALNEQMCKELAQAGKGQYIHVDNTSDAERALNDD 285 Query: 445 IGN 447 I Sbjct: 286 IAK 288 >gi|123232279|emb|CAM16354.1| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H family (plasma Kallikrein-sensitive glycoprotein) (ITIH) [Danio rerio] Length = 860 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 68/425 (16%), Positives = 134/425 (31%), Gaps = 45/425 (10%) Query: 51 HALKQAAQTAI---ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 + L A + A +++S + + + + + + + Sbjct: 71 NQLHIAKEAAFEVDLSSSAFISNFTITSNHKVYVAQVRRRTDARKIYDNAKKQGKTAGLV 130 Query: 108 DREVRDIVR-DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM-----GIKSWLIQTK 161 + R++ + AV + P +V S Y+ LL+ G + + Sbjct: 131 ATKEREMEKFRVAVNVPPGA---RVSFSLSYEELLSRRLGRYELSLGLRPGQPVQNLSLE 187 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + V ++ID Q C +++ + + K Sbjct: 188 VSISERTGISFIRALVLFLFLIDLLADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPK 247 Query: 222 VGIRDEKLSPYMVSC----------NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G+ + + Y V + + P L + FV S + K Sbjct: 248 -GLSADFIIQYDVELKDPMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKI 306 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI------VK 325 + A+ S++ +++ D N F+D V +W + +R K Sbjct: 307 KQTKAAMVSILSDLREGDYFN------LITFSDDVH-----TWKKDRTVRATRQNVRDAK 355 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDN 383 F G T IN A+ +A +++ + +++ I+ LTDGE T Sbjct: 356 EFVRKIIAAGWTNINAALLSA-AKLLNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGE 414 Query: 384 EEGIAICNKA-KSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 E I + A KS G + + +AF + R L N +E + K F Sbjct: 415 TETDVILHNAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVARMVYEDDDAAIQLKGF 474 Query: 442 RDRIG 446 D + Sbjct: 475 YDEVA 479 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 62/186 (33%), Gaps = 27/186 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + R+A + + D ++ +++ + K ++ + Sbjct: 112 RIEKAREAAILAVNMLNTTDTLS------VVAYDNHAEVIIPATKVTDKPA--LIASIQQ 163 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G TA+ + + + + H + + I+L++DG+ + Sbjct: 164 HIHPRGMTALFAGVS------MGIGQVDKHLNREQVNR---IILISDGQANTGPTSISEL 214 Query: 390 CNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A+ +GI I TI + + ++ A S + ++ +L K F Sbjct: 215 SDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSDGNHTFVANSADLEKAFTKE 269 Query: 445 IGNEIF 450 + + Sbjct: 270 FQDVMS 275 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 38.8 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 13/84 (15%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIM----------TIAFSVNKTQQEKARYFLS 420 +++LTDG+ T + + A +G+RI TI F + L Sbjct: 221 VIMLTDGQRTTG-VDPLDAAKAAADRGVRIYTVGVGTVDGETIGFEGWSMRVRLDEETLK 279 Query: 421 NCA--SPNSFFEANSTHELNKIFR 442 A + +F A + +L K++ Sbjct: 280 AVANKTQAEYFYAGTAADLKKVYE 303 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 38.8 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 61/198 (30%), Gaps = 21/198 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + Q + LN + SY G+ D + + + LY Sbjct: 241 HQGNRQGQQQWDGLNGWDGNPTNRNNSYIEYGGEKEDADYAIRKFAKETATPGLFDLYLN 300 Query: 248 PLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + VD S + + V+ + + ++ + D + MG Sbjct: 301 VRGNTQKNITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVG 359 Query: 302 FNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ S + G ++ VK+ G T A++ A + + N Sbjct: 360 YSIEGYSYSNGAVQMGSFDSVKNQVKSI-TPSWTNGGTFTQKALRDAGNMLSVPNGH--- 415 Query: 360 RMKNNLEAKKYIVLLTDG 377 KK IVLLTDG Sbjct: 416 --------KKVIVLLTDG 425 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 32/95 (33%), Gaps = 14/95 (14%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 GV +D+ + L+ A TA + +V + E+V+ E Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAV-VADKTEDVNQ-------------AETA 75 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 +R + + E+ +TAV + + Sbjct: 76 VRTTLGGISTEPGNSELTFSDSNTAVTFSHDMQTF 110 >gi|82703475|ref|YP_413041.1| hypothetical protein Nmul_A2358 [Nitrosospira multiformis ATCC 25196] gi|82411540|gb|ABB75649.1| putative membrane protein [Nitrosospira multiformis ATCC 25196] Length = 437 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 10/41 (24%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G II AL + V++G G+ +D+ + + L+ + Sbjct: 22 RQEKGVVAIIVALSLVVLVGFAGLALDLGKLYVAKSELQNS 62 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 49/162 (30%), Gaps = 32/162 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 LI S G ++ AL+M V + +++D+ L A + L + Sbjct: 3 LINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDSQ 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + A +Y RN +N V + +Q+ L Sbjct: 63 QAETVAL-----------DYASRNGLDN-----------------NVTVEITDDDHQIGL 94 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 S + +N F GI + ++A + Sbjct: 95 SGSKQVGMN----FAVIFGIDQVEVAASSKARVGHVTAVTGA 132 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 6/163 (3%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G + TA+ + +++ + ++ Y + L++A A + + + + S++ Sbjct: 19 GSILVNTAIALSLIVITL-IGTELGYLFYMKRELQKATDLAALAGAKEISYAGSCPSAKT 77 Query: 80 KNSFTFPKQKIEEYLIRNFEN--NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + RN +L+ + D + T+ + Q + R Sbjct: 78 AAKLS-ANGTGSTDRNRNLPISFSLEDAEIECGQWDPAKTTSDHFDSAPPDQQ--NAIRI 134 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 L P +L G ++ + A + ++ S+ Sbjct: 135 TLNRTPATLLSFFEGNRTIQTKAVATNDPIAAFSIGTGVASLD 177 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 45/116 (38%), Gaps = 15/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + ++++ D T+I +A+ A SN Sbjct: 130 RLGVILFAGEAYMQVPLTTD-YEAVSLLLQSANPDMISSQGTSIGEALAIAQTNTSKSNG 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + ++++TDGE+ + E A +A G++I TI + Sbjct: 189 N------------RVVLVITDGEDHEARAEAQAR--QAARAGMKIFTIGIGSEEGG 230 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 48/361 (13%), Positives = 102/361 (28%), Gaps = 35/361 (9%) Query: 31 PVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 P+ LGV + W+ + ++ A + I S S+ + + F Sbjct: 101 PIALGVQSSVAQAQGWTSKQPTWQEIADSGITFGMTDPSTSNSGFSALSAATTAFADTG- 159 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +K+ + + S + + + Sbjct: 160 --------RALTEKDIKQSAGKVQKLFGNQTLTSGSSGWLADRFREHPEQADA------I 205 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 +S L Q K E + + +S + + S D +++ Q L + Sbjct: 206 FNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERPD 265 Query: 211 TVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + S+ + PY + + V +S + Sbjct: 266 KLTSFHLRVDNSDLPGTVFELPYPANEQTVDALEAAFAHELRNPGNTALVLDTSGSMEGE 325 Query: 270 KKHLVRDALASVIR-SIKKIDNVNDTV------RMGATFFNDRVISDPSFSWGVHK-LIR 321 + L++ +L +I S + + V ++ ++ K Sbjct: 326 RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKPATT 385 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G TA +A+ A+D + +S D +VL+TDGE T+ Sbjct: 386 KELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIG-----------TVVLMTDGEVTR 434 Query: 382 D 382 Sbjct: 435 G 435 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 48/146 (32%), Gaps = 15/146 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G++K + V + + A Sbjct: 249 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINK---SEVLQHIQNLSPRTGKA 305 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 306 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT---HRDSEDNVTKAAVNLRREGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS 424 I T+ Q L AS Sbjct: 360 TIFTLGIEGASDTQ------LEKIAS 379 >gi|221109526|ref|XP_002169886.1| PREDICTED: similar to coagulation factor C homolog, cochlin [Hydra magnipapillata] Length = 336 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 55 RVGVITFSYRAELSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 111 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 112 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 166 Query: 414 KAR 416 + Sbjct: 167 LSH 169 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 62/186 (33%), Gaps = 27/186 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + R+A + + D ++ +++ + K ++ + Sbjct: 100 RIEKAREAAILAVNMLNTTDTLS------VVAYDNHAEVIIPATKVTDKPA--LIASIQQ 151 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G TA+ + + + + H + + I+L++DG+ + Sbjct: 152 HIHPRGMTALFAGVS------MGIGQVDKHLNREQVNR---IILISDGQANTGPTSISEL 202 Query: 390 CNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A+ +GI I TI + + ++ A S + ++ +L K F Sbjct: 203 SDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSDGNHTFVANSADLEKAFTKE 257 Query: 445 IGNEIF 450 + + Sbjct: 258 FQDVMS 263 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 165 RVGVITFSYRAELSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 221 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 222 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 276 Query: 414 KAR 416 + Sbjct: 277 LSH 279 >gi|126282074|ref|XP_001368568.1| PREDICTED: similar to Coch-5B2 gene product [Monodelphis domestica] Length = 549 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 39/111 (35%), Gaps = 13/111 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ A F P FS+ + ++ G TA DA+ + Sbjct: 404 KIAAVQFT--YDQRPEFSFTDYTTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIR 461 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D K ++V++TDG Q ++ A A GI + +I + Sbjct: 462 DG--------PNKNFLVIVTDG---QSYDDVRAPAAAAHKAGITVYSIGVA 501 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 68/170 (40%), Gaps = 18/170 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A+A + ++ + R+G F+ + + KT Sbjct: 354 LALAVAGALSLVETARPED---RLGIVTFSSGPRWLFPPRPMTAR-GKLEAKTLLDRLRP 409 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 GST + +A + A + + + K I++LTDG+ +D +A+ A Sbjct: 410 GGSTRMLEAYRQAIEALEAL-----------ELETKQILVLTDGQVEEDPAALVALAEAA 458 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++QGIR ++A + + AR +S F++ + +L ++F + Sbjct: 459 RAQGIRTNSVALGGDADRALLAR--MSR-VGEGRFWDVPTPEDLPRLFLE 505 >gi|51891532|ref|YP_074223.1| hypothetical protein STH394 [Symbiobacterium thermophilum IAM 14863] gi|51855221|dbj|BAD39379.1| hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 252 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 17/55 (30%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + G + + V + G +DV R L+ A + A + Sbjct: 3 RFRTEQRGSTGALFVFIWAVAVLAIGFALDVGRVFVLREQLRTAEEAAALAGVRQ 57 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 7/118 (5%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T I A+ A + + +D R + + ++L+TDG + + N I + + Sbjct: 247 GTNITRAVTLAVQILGPAVQD---RKLGDSTGPRQMILITDGRDRRLNNSVIFMLQNDTA 303 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 +G+ I TIA E+ LS + FF + + D + + + V Sbjct: 304 KGVVIDTIALG---DGAEEGLPLLSE-VTGGQFFFSPDSDAGGSALDDALTATVLKYV 357 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 65/199 (32%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E ++ + + + L + + ++ D R G F D Sbjct: 103 DLSGSMEEKDFATESGEQLSRLTAAKKVLRNFVTQ-RQGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + E M T + DA+ S Sbjct: 156 TPFT----ADQNVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPSTSQ--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS- 424 + +++LTDG +T + A ++GIRI IA + +Q ++ +S Sbjct: 203 DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSL 262 Query: 425 -PNSFFEANSTHELNKIFR 442 F A +LN+ ++ Sbjct: 263 TQARSFVAIDQPQLNEAYQ 281 >gi|197118197|ref|YP_002138624.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087557|gb|ACH38828.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 318 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 21/136 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A+ + +K R+G F + L ++K Sbjct: 101 SRLAAAKEAVRQAMAGLKGE-------RVGLVVFAGSAFLVCPLTTDYA-LFDQVLKEAG 152 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + T++ A++ A + ++ K +VLL+DGE+ + +A Sbjct: 153 EETLPLPGTSLAAALKEARRALQGEGDEP-----------KVVVLLSDGEDHEGEY--VA 199 Query: 389 ICNKAKSQGIRIMTIA 404 + G+++ +A Sbjct: 200 AARALNAAGVKLYAVA 215 >gi|17533687|ref|NP_496745.1| C-type LECtin family member (clec-65) [Caenorhabditis elegans] gi|3876683|emb|CAB03057.1| C. elegans protein F35C5.8, confirmed by transcript evidence [Caenorhabditis elegans] Length = 372 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 56/168 (33%), Gaps = 20/168 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRV----ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 I + + R+G +N SW L I + ++ ++ + Sbjct: 62 QIGTGYDDPRSTRVGFITYNWNATDVADFYKLQSW--ADLNSQIQRLQYTPQSSSPASRM 119 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + + A I ++ KK +++ T + + + + KS+GI Sbjct: 120 DTGLNAAIGMIDATAGFR-------DNYKKIVIVFTSVHGSYKSNQPRDVSKILKSRGIP 172 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS---THELNKIFRDR 444 ++T+ N + +L AS N F T E+ K D Sbjct: 173 VVTV----NTGSSSDTQAYLKQIASDNMSFAIADGNVTQEILKAMTDT 216 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 I+L++DG+ + + A+ +GI I TI + + ++ A S Sbjct: 233 IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSD 287 Query: 426 NSFFEANSTHELNKIFRDRIGNEIF 450 + ++ +L K F + + Sbjct: 288 GNHTFVANSADLEKAFTKEFQDVMS 312 >gi|253701050|ref|YP_003022239.1| von Willebrand factor A [Geobacter sp. M21] gi|251775900|gb|ACT18481.1| von Willebrand factor type A [Geobacter sp. M21] Length = 315 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 45/144 (31%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A+ +K R+G F + L ++K Sbjct: 100 SRLDAAKEAVRKATAGLKGE-------RVGLVAFAGSAFLVCPLTTDYA-LFDQVLKEAG 151 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + T++ A++ A + + K +VLL+DGE+ + A Sbjct: 152 EQTLPLPGTSLAAALKEARRALQGEGGEP-----------KVVVLLSDGEDHEGEYAAAA 200 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 + G R+ +A + Sbjct: 201 R--ALDAAGARLYAVAAGTSPGGP 222 >gi|194211147|ref|XP_001917810.1| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 909 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 49/132 (37%), Gaps = 22/132 (16%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T+I +++A++ I + IVLLTDGE+ Sbjct: 376 PTEAVGGTSICAGIKSAFEVITDMY---------SQIDGSEIVLLTDGEDNTAGS----- 421 Query: 390 C-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--LNKIFRDRIG 446 C ++ K G I IA + ++A +S + ++ L F + Sbjct: 422 CVDEVKQSGAIIHFIALGPSA---DQAVIEMSTITGGKHKYASDEAANNGLIDAFAALVS 478 Query: 447 --NEIFERVIRI 456 ++ ++ +++ Sbjct: 479 GNADLSQQSLQL 490 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 59/175 (33%), Gaps = 22/175 (12%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + K+ S K + + S K P+D + + +K Sbjct: 40 SAKASQSVIVKPQNSNAEGSIDFHLTPKGKATNANRDPIDVVFVFDKSGSMNDSGKNPQK 99 Query: 271 KHLVRDALASVIRSIKK-IDNVNDTVRMGATFFNDRVIS----DPSFSWGVHKLIRTIVK 325 +DA+ + + K+ + R G F+D V + + + + L ++ Sbjct: 100 FQSAKDAMTAAVNFFKENAGPND---RFGFVPFDDDVETGKVVNFAPENNMASL--NLIN 154 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + + +G T ++ A M N KY++ +TDGE T Sbjct: 155 SNSNSLSALGGTNYTQSLDAAL------------GMFGNSTNNKYVLFMTDGEPT 197 >gi|237714981|ref|ZP_04545462.1| two-component system sensor histidine kinase [Bacteroides sp. D1] gi|262409069|ref|ZP_06085614.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|229444814|gb|EEO50605.1| two-component system sensor histidine kinase [Bacteroides sp. D1] gi|262353280|gb|EEZ02375.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] Length = 869 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 99/290 (34%), Gaps = 22/290 (7%) Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFT-DREVRDIVRDTAVEM-----NPRKSAYQVVLSS 135 +F K E ++ K N ++ + T + PR S ++V++ Sbjct: 170 DNSFLSNKGLEDFEEEWKIFQKDNPDYRMKIYNTQNHTTSHIIAAICYPRNSYERLVVAP 229 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ-- 193 ++ L+ + ++ + + V +Y + S + S+L Sbjct: 230 KWSPFLSFVGKNSKAPVFSTQNV---GLTNGVFSAYDADSYTSASLAAQRAASVLKGTSP 286 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 RD + + + + KV ++ K L+ +LYP L + Sbjct: 287 RDIGVTEITQGFIFDYKQLDFFHIDPDKVSSSGTIVNEPYWEKYKYLFILLYPSILALLI 346 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDAL--ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + ++ ++ R ++ L ++ + DNV ++R G ++ + Sbjct: 347 ASIVWLMRANRRESKRRIQAQTRLLVQNKLVEQRNEFDNVFHSIRDGVITYDTDLH---- 402 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +H R++++ + G M + I + +D +H+M Sbjct: 403 ----IHFTNRSLLQMLHLPYEA-GGRFYEGMMAGSIFKIYYNGQDILHKM 447 >gi|157961836|ref|YP_001501870.1| hypothetical protein Spea_2015 [Shewanella pealeana ATCC 700345] gi|157846836|gb|ABV87335.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345] Length = 445 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 64/207 (30%), Gaps = 2/207 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ +G ++ + + +L + +D + L+ A A + A+ L Q Sbjct: 15 KRNQSGAILVMFTIGLFSILAFAALALDGGHILLSKGRLQNAVDAAALNAAKELQQGATL 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-VVL 133 +R ++T Q + N++ + D + VE + + + +++ Sbjct: 75 FEARQA-AYTILLQNLGFSENGELNNSISLSSPDFNTTQVTARLHVEFSEQPDPFNPILV 133 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + ++ L + K A V+ V+ + Sbjct: 134 EGSEYVRVRVENVNLSNFLADILNFDKKIRASAVAGRSQDLLCVNRVAPLLVCGDENSTA 193 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNG 220 D+ G + + S + Sbjct: 194 EDNYGLSEDLYLMKIGANQPSANGAGN 220 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 57/171 (33%), Gaps = 23/171 (13%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +N + + ++ + S+ Y L S + +D+ + R + ++++ Sbjct: 187 RNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGKRNSQGKWGPLDADNPR---ARMNILKKK 243 Query: 278 LASVIRSIKKIDNVN-DTVR-MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 ++ +K+I NV+ + VR G+ + K+ I Sbjct: 244 ANLLVDDLKEIGNVSVNLVRFSGSASY--IQEDFVELDKDTGKIKEKIKSLPTSWI---- 297 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T D ++ ++ + KY+VLLTDG Sbjct: 298 -TNPGDGLRYGLVSLQRNPAQL-----------KYVVLLTDGIPNAYTGSP 336 >gi|1098898|gb|AAC46961.1| CTRP [Plasmodium falciparum] Length = 2098 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 38/300 (12%), Positives = 84/300 (28%), Gaps = 24/300 (8%) Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 ++ L +G+ + + + + +W + S S + ++ + + Sbjct: 704 ENVKLLVVGVSTASENKLKMLVGCAPNVVCPFVIKTEWGLLKSVSEVFVKKICDNGVVLP 763 Query: 204 FGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 G P++ T S S S G N S P P E S Sbjct: 764 PGSPSESTPGSPSESTPGNPSESTPGSPSESTPGNPSESTPGSPSESTPGSPSESTPCSG 823 Query: 263 SLRHVIKKKHLV---------------RDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + L ++ + I ++ V MG F+D++ Sbjct: 824 TECLCHNTYDLTLIIDESASIGYSNWEKEVVPFTIGLASNLEISEKKVNMGILLFSDKIR 883 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + +++ + + + + L Sbjct: 884 EFIK----YGQKESYDKNNLVRRIHDLKKYYKSGGFSYIVEALKYGLYSYAKSTSSRLNV 939 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K +LLTDG NT ++ + + K + ++++ I + L C + Sbjct: 940 PKVNILLTDGNNTDTSDFILTEVSSLYKKENVKLLLIGIGGPTIHK---LRLLGGCDKSD 996 >gi|325285570|ref|YP_004261360.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321024|gb|ADY28489.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 235 Score = 38.4 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 44/141 (31%), Gaps = 22/141 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ +I +IK + + + FN+ V + RT F Sbjct: 96 TKLGAVK---RKLIPAIKGLPDGKKFL---VFSFNNNVTKQATEFRVASNTTRTSSNIFV 149 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T + + A T IVL++DG + + Sbjct: 150 QNLKASGGTNTLEGLLEALSTADVQE----------------IVLMSDGLPNSGPKAVLE 193 Query: 389 ICNKAKSQGIRIMTIAFSVNK 409 K + I I TIAF + Sbjct: 194 EIKKVNTSNIIIHTIAFGEDA 214 >gi|221193689|gb|ACM07876.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 38.4 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 43/312 (13%), Positives = 101/312 (32%), Gaps = 46/312 (14%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E K+ Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLE---AKAVNP 177 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL-----IQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + +L R + I+ +++ ++ +K+ + + +V+D Sbjct: 178 YSSEGEHIREIPEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDN 237 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SM + ++++ + + + +K N + + ++ Sbjct: 238 SNSMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIK 292 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 DP E + + KK + A +I+ I K Sbjct: 293 GFKEDPYHGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK------------ 337 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 WG L T K D +++G T A A + S Sbjct: 338 --------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS-------- 381 Query: 366 EAKKYIVLLTDG 377 +K IV LTDG Sbjct: 382 --RKIIVHLTDG 391 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 38.4 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 43/127 (33%), Gaps = 19/127 (14%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE--MGSTAIN 340 + IK + + R+G F A+ + G + + Sbjct: 118 QQIKDMVDEMSGDRVGLVVFAGETRQSVPL---TSHYEDFKQSLDAVGPHSVRRGGSLLG 174 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIR 399 DA+++A I D K IV+ TDGE+ + + + +A GIR Sbjct: 175 DAIRSATAGFIDKTNDH-----------KAIVVFTDGEDQES--KPVEAAKEAFTKNGIR 221 Query: 400 IMTIAFS 406 I T+ Sbjct: 222 IFTVGLG 228 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 38.4 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 39/149 (26%), Gaps = 3/149 (2%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G G+ +D R L+ A + + A+ L + + +S Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPDAISRAVSAGAAAGNLNGVNLQ 102 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 N+ + D RD + + S + L Sbjct: 103 SANWSGQGQITAADISFRDASYLSTTSATAAVYSQCTHTQSNIGMWLLKALGAFSGNSAS 162 Query: 155 ---SWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + T A A S V++Q Sbjct: 163 FPATGNVGTYAVATRASAQSACPIPVALQ 191 >gi|262380414|ref|ZP_06073568.1| tfp pilus assembly protein [Acinetobacter radioresistens SH164] gi|262297860|gb|EEY85775.1| tfp pilus assembly protein [Acinetobacter radioresistens SH164] Length = 1264 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 64/214 (29%), Gaps = 19/214 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 +V+ ++Y ++ L L L M ++ + SI ++D S SM Sbjct: 1 MVMKTKYQIMALSLILLLSGMNWS-------HASDLTIYQGNSSGQTSIFMMLDTSGSMG 53 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + + D +P S+ +Y G + Sbjct: 54 WGAGYGGSSMSLQNDYGVCVVKNNGKGGSDSIDSEDSTTTPSYKRYYCSVTKSIYDGLDN 113 Query: 251 PSLSEEHFVDSSSLRHV--IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P S + + ++DA+ +++ S +++GA +++ Sbjct: 114 PYKQRVRNDCSVNGTGYKCYDRLTRLKDAMFTLLNSRAL-----SGIKLGAGYYSYEGEG 168 Query: 309 DPSFSWGVHKLIR-----TIVKTFAIDENEMGST 337 K + T +K F G T Sbjct: 169 QKGIVSIPAKPMSDPTHVTDLKNFVAGLKATGGT 202 >gi|319952927|ref|YP_004164194.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421587|gb|ADV48696.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 212 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 55/171 (32%), Gaps = 17/171 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +I S+++ ++ + F+ + + D Sbjct: 21 IEAVKNGVQVMISSLRQNPQAIESAFLSIITFDSSARQIVPLT--------DLASFQMPD 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A++ I + K + + +I +TDG T D ++G++ Sbjct: 73 IQATGTTSLGEALELVSTCIDNEVASTTSESKGDWKPLVFI--MTDGIPTDDMQKGLS-- 128 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 K + T L + + ++K F Sbjct: 129 ELKKRR-----TAYIVACAAGNGADSTLLKQITENVVSLDTADSQSISKFF 174 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 54/172 (31%), Gaps = 36/172 (20%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + + + + TAI A+ A + Sbjct: 130 KVGLIVFAGDAFTQLPITSDYVSA-KMFLSSINPSMVSTQGTAIGAAINLAMRSFT---- 184 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 + + K I+L+TDGEN +D+ + A +GI + + K Sbjct: 185 -------PSETSDKAIILITDGENHEDDA--VKAAAAAAEKGIHVNIVGMGDPKGSPIPV 235 Query: 413 EKARYFLS-------------------NCASPNSFFEANSTHELNKIFRDRI 445 + + ++ A ++ A++T+ + + I Sbjct: 236 DGSNNYMKDKDGNVVITKLNEEMCQEIAAAGHGTYVRADNTNSALRALQKEI 287 >gi|21233895|ref|NP_640193.1| hypothetical protein Rts1_232 [Proteus vulgaris] gi|21203079|dbj|BAB93795.1| hypothetical protein [Proteus vulgaris] Length = 270 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 50/136 (36%), Gaps = 16/136 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 A+ S++ ++ + V A F + + +R + Sbjct: 111 ITEAEAAVVSLLYALDNLPG----VTTSAYHFPHTTRNSVGKLKDRKQTLRQAIAANHFG 166 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + GST ++ A+ A + D ++ +++ TDGE E+ I + Sbjct: 167 IHTTGSTPLSGALWPAIVDLAVEKAD-----------QRILIVCTDGE-PDSKEDVIQMI 214 Query: 391 NKAKSQGIRIMTIAFS 406 N AKS G+ ++ I F Sbjct: 215 NDAKSDGMVVIGIGFG 230 >gi|87200511|ref|YP_497768.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87136192|gb|ABD26934.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 193 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 35/106 (33%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL+ G I AL++PV L +D+ + Y L A + A +A++ + Sbjct: 6 KLVAEDCGVTTIEFALVLPVFLLAIVGCLDLGQMVYAVGVLDGAVEKAARSAALETGDTT 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + +++ N N +R T Sbjct: 66 AADAEVEDVMSRILPGSTLATSRKSYANYSDINRPERWNDADNNGT 111 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 55/171 (32%), Gaps = 33/171 (19%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEE 92 ML G D +R ++ A+I +V I + + + + ++E Sbjct: 3 MLVAVGASFDYIRSYNVRQKMQSDLDAALIA-AVKQINNTGDTDALKLKVTDWFHAQVEN 61 Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 T E++ + + + ++ + + F++ Sbjct: 62 -----------------------SYTLGEIDIDTTNHNITATASGTVP----TTFMKIAN 94 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 I + + + + + SY +++ VID S SML + + Sbjct: 95 IDTVPVSVASAVKGPATSY-----LNVYIVIDTSPSMLLAATTAGQATMYS 140 >gi|241842676|ref|XP_002415420.1| CUB and sushi multiple domains (CSMD) protein, putative [Ixodes scapularis] gi|215509632|gb|EEC19085.1| CUB and sushi multiple domains (CSMD) protein, putative [Ixodes scapularis] Length = 609 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 39/103 (37%), Gaps = 3/103 (2%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ + +KI+ +D R+GA FN+ S + E Sbjct: 456 LKTGIGLAKAITRKINVTDDGHRVGAVRFNNTAKLMLSPMNVKSTDAALLALDAVATEKT 515 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 G TAI +A++ ++ + H K+ + K I +TD Sbjct: 516 AGGTAIREAVKAIRTSLEIVDG---HLKKDGFKLKFSIFFITD 555 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 I+L++DG+ + + A+ +GI I TI + + ++ A S Sbjct: 196 IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSD 250 Query: 426 NSFFEANSTHELNKIFRDRIGNEIF 450 + ++ +L K F + + Sbjct: 251 GNHTFVANSADLEKAFTKEFQDVMS 275 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 71/205 (34%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + LV+ +L ++ +++ D V G N Sbjct: 242 PTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAG 297 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + + K+ I + GSTA + + AY I N + Sbjct: 298 VKLEATPGSNKQKIREAIDE-----LEASGSTAGGEGIMLAY-KIAQKNFISGGNNR--- 348 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 -----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK- 402 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 403 -GNGNHAYIDNLQEANRVLVNEFGA 426 >gi|221193539|gb|ACM07802.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193577|gb|ACM07821.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193657|gb|ACM07861.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193697|gb|ACM07880.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYYGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|218512349|ref|ZP_03509189.1| hypothetical protein Retl8_00989 [Rhizobium etli 8C-3] Length = 222 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 66/219 (30%), Gaps = 36/219 (16%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 +++ VID S SML S + Sbjct: 13 LNVYIVIDRSPSMLLAATTSGQSTM-------------------------------YSGI 41 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + D S+ +++ + + DA+ V+ I + D+ ++ ++ Sbjct: 42 GCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIK 101 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTI-VKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G D + + + + ++ + + D A I+ + Sbjct: 102 VGLYSLGDTTKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALAALQKIVGTGG 161 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 D ++ K ++LLTDG +Q N K Sbjct: 162 DG----TSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKK 196 >gi|77405751|ref|ZP_00782836.1| cell wall surface anchor family protein [Streptococcus agalactiae H36B] gi|77175608|gb|EAO78392.1| cell wall surface anchor family protein [Streptococcus agalactiae H36B] Length = 896 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYYGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|223934102|ref|ZP_03626049.1| von Willebrand factor type A [Streptococcus suis 89/1591] gi|164454849|dbj|BAF96969.1| serum opacity factor [Streptococcus suis] gi|223897228|gb|EEF63642.1| von Willebrand factor type A [Streptococcus suis 89/1591] Length = 564 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 45/316 (14%), Positives = 101/316 (31%), Gaps = 39/316 (12%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 +E+ + + + + + + + E +S+ L +P +L Sbjct: 43 LEKTAETDSTSPIMATAVVEDSKVVQDNENKEGEVAESSSTSDLPKSNAESASPEALTNE 102 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + + + + +E V + E + + + Sbjct: 103 NASEPTGQVAENTSSSEGKATEKEEPAVQY--------------VEKEVDDYSTNVEKPN 148 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 TV S + K + + ++ + P +D VD+S Sbjct: 149 ETVSSTVDKTPKELFTVSRTAEVGQDGVVTVTTQIVPKEIDKGAEIVVLVDTSKKMDE-- 206 Query: 270 KKHLVRDALASVIRSIKKI-DNVN-----DTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + A ++I+ ++K+ D N ++VR+ FN ++ + TI Sbjct: 207 --EAKKTAKDNIIKLVEKMTDPTNDHNSRNSVRV--IGFNRKLSESKEV--NKTNVKDTI 260 Query: 324 VKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 F+ E + A+ A + S + +K+IVLL+ GE+T Sbjct: 261 DNLFSNAEQNYNWGVDMQGAIHEARRILESEKSGK----------RKHIVLLSQGESTFS 310 Query: 383 NEEGIAICNKAKSQGI 398 + A + K + I Sbjct: 311 YDLTDAAKTETKYKTI 326 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 56/177 (31%), Gaps = 27/177 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + R+A I +K D V+ ++D + K ++K Sbjct: 87 RIEKAREAAIMAINMLKDDDIVS------VIAYSDNAYLIIPAT--KVKNKNEMIKIIND 138 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 GSTA+ + + +K N + I+LL+DG+ + Sbjct: 139 TIKPGGSTALFAGVSKGITEVN-------KFIKKNQVNR--IILLSDGQANIGPSTTKEL 189 Query: 390 CNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + A QGI + TI ++ A S + ++ +L F Sbjct: 190 ADLGQVAGKQGIAVTTIGL-----GNGYNEDLMTALAGFSDGNHAYVENSADLETAF 241 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 70/196 (35%), Gaps = 21/196 (10%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM------- 297 + G ++ S D+ ++ +K +AL + I+ + N + + Sbjct: 121 FSGSMNCSQDLNTKCDAKAVAKGKRKLDAAINALGTFIK-LASERKGNTYLSIVPFGVEG 179 Query: 298 -----GATFFNDRVISDPSFSWGVHKLIRTIVKT-FAIDENEMGSTAINDAMQTAYDTII 351 GA + +V S+ ++ + + ++ D+ +T A++ Sbjct: 180 KNDKPGACDYYPKVTSETLDNFNLVQDVKLTNFLGSLADKTPCATTNFYQALKETVKFFK 239 Query: 352 SSNEDEVH-RMKNNLE----AKKYIVLLTDGENTQDNEEGIA--ICNKAKSQGIRIMTIA 404 + E + + K + I+LL+DG + N + + + N ++ I + T+ Sbjct: 240 NDKEGRFYPKDKEGKPLKPQPRLSIILLSDGFDNNSNYQEVQKTLANLQNNKDIVVHTLG 299 Query: 405 FSVNKTQQEKARYFLS 420 + + Q K Sbjct: 300 YGLTPQQLGKKYSLGK 315 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 I+L++DG+ + + A+ +GI I TI + + ++ A S Sbjct: 196 IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSD 250 Query: 426 NSFFEANSTHELNKIFRDRIGNEIF 450 + ++ +L K F + + Sbjct: 251 GNHTFVANSADLEKAFTKEFQDVMS 275 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 54/175 (30%), Gaps = 32/175 (18%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A +++ S+ D R F D + + ++L + I Sbjct: 117 QAAARALVGSLADGD------RAHILNFGDSINVAVGMTADRNRLDQAI-----SGLRAW 165 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T +NDA+ + + + IVLL++G + + +++ + Sbjct: 166 GATRLNDAVFASAGALAGAEGRGA------------IVLLSEGPDADPSGAPLSVVDSEA 213 Query: 395 ------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 G + + R A+ ++F L F D Sbjct: 214 ALAAVVESGAPVYAVGLG-PGADAALLRRLAE--ATGGAYFPVADAAALPATFSD 265 >gi|294788348|ref|ZP_06753591.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] gi|294483779|gb|EFG31463.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] Length = 212 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR+ L ++ ++++ +T + F+ + + ++ + Sbjct: 22 IEAVRNGLQVLVSALRQDPYALETAYLSVITFDSQAKQVTPLT--------ELMNFQIPN 73 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ A+ D I +K + + + LL+DG T +GIA Sbjct: 74 IEANGATAMGGALTLLADCINREVVKGSAEVKGDWKP--VVFLLSDGSPTDSISKGIADI 131 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K+ GI F L + + + F+ Sbjct: 132 KAVKT-GI------FVACAAGPHADTSTLKQITETVVSLDTADANSIKAYFK 176 >gi|290991702|ref|XP_002678474.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284092086|gb|EFC45730.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 467 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 57/160 (35%), Gaps = 16/160 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I+N+ + R+G F+D+ + + V L + +K ++ E GST MQ Sbjct: 128 IENLREFERLGIVLFDDKAETLLPLTI-VQDLDKKSLKETVLNIKEKGSTNFEAGMQRGI 186 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAFS 406 D S + ++ I+ LTD + I K + G I + Sbjct: 187 DLFSSLDSSDLSNSNR-------IIYLTDACPNVGGTATLDILTKDANSGPYSIFSTFVG 239 Query: 407 VNKTQQEKARYFL---SNCASPNSFFEANSTHELNKIFRD 443 + K L C ++F ST E KI + Sbjct: 240 IGLDFNSKIVDELTRVRGC----NYFSVKSTEEFKKILNE 275 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 50/332 (15%), Positives = 99/332 (29%), Gaps = 30/332 (9%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + I +++ +FP + N K + + + +D Sbjct: 67 DGYAYQVNSGKITLEISSNTKQTIDLSFPIDPALYHSQANKLIVDNKEYDIIDETENKKD 126 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V + P+ + S + ++P +L S+ S E + Sbjct: 127 TDVSV-PKPDEIEEESSKENENSVSPFTLPTLSLPAVSVPSNQTIPTEYTTDDQGTYPKA 185 Query: 178 SIQW-----VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S Q V+D + ++ +N + + SY G D + Y Sbjct: 186 SWQPTGNTNVLDHQG---NKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKY 242 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + +Y + VD S + + V+ + + ++ Sbjct: 243 AKETSTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLA 302 Query: 287 KIDNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + MG ++ S + G ++ VK+ G T A++ Sbjct: 303 D-SGITDKINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSI-TPSRTNGGTFTQKALR 360 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 A + N KK IVLLTD Sbjct: 361 DAGSMLSVPNGH-----------KKVIVLLTD 381 >gi|167522505|ref|XP_001745590.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775939|gb|EDQ89561.1| predicted protein [Monosiga brevicollis MX1] Length = 1927 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 54/144 (37%), Gaps = 18/144 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLI 320 +S I + +++R + G FF+ + Sbjct: 1142 ASGAMSINDFTAAKTTALAILRRLALAQPDISV---GLIFFSQQAQVALPLLDINDETEF 1198 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + ++ + + +T + A+ +A D + +SN +++I+ +DG Sbjct: 1199 QLLLLVLQAGQYQGQATNLGSALSSAADLLETSNNGA----------RQFILF-SDG--- 1244 Query: 381 QDNEEGIAICNKAKSQGIRIMTIA 404 +++G + ++ GI+I+T+A Sbjct: 1245 SSDDQGTLVAQNIRATGIQILTVA 1268 >gi|221193495|gb|ACM07780.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYYGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 66/199 (33%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E S++ + + + L + + ++ D R G F D Sbjct: 139 DLSGSMEEKDFSTAAGEQLSRLTAAKRVLRNFVTQ-RQGD------RFGLILFGDAAFIQ 191 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + E M T + DA+ S Sbjct: 192 TPFT----ADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPSTSQ--------- 238 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS- 424 + +++LTDG +T + A ++GIRI IA + +Q ++ +S Sbjct: 239 DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSL 298 Query: 425 -PNSFFEANSTHELNKIFR 442 F A +LN+ ++ Sbjct: 299 TQARSFVAIDQPQLNEAYQ 317 >gi|221193481|gb|ACM07773.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193489|gb|ACM07777.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193493|gb|ACM07779.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193521|gb|ACM07793.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193533|gb|ACM07799.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193543|gb|ACM07804.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193547|gb|ACM07806.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193555|gb|ACM07810.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193575|gb|ACM07820.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193595|gb|ACM07830.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193623|gb|ACM07844.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193625|gb|ACM07845.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYYGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 63/192 (32%), Gaps = 28/192 (14%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-NDRVISDPSFSWGV 316 D+S V K + AL +I + +N + + F D + Sbjct: 17 LADTSGSMSVDGKIDAMNQALRDLIDTFSGESRLNAEIHLSVITFGGDGAKEHLPLT--- 73 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + F D G T + A + A + I + + + IVL++D Sbjct: 74 ---CAHTISGF-SDLQAHGMTPMGGAFRIAKELIEDKEK------IPSRAYRPVIVLVSD 123 Query: 377 G------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G E + G KA + I + + + F+++ +P F Sbjct: 124 GYPNDDWEAAFSSLRGSERAQKATRMAMAI------GSDADENMLKDFINDPETP--VFR 175 Query: 431 ANSTHELNKIFR 442 AN ++ + FR Sbjct: 176 ANGARDIIRFFR 187 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 72/204 (35%), Gaps = 23/204 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 PS + +D S + K L+ + + ++K D V V G N ++ Sbjct: 248 PSSNLVFLIDVSGSMNDSNKLPLLVSSFKLLTDQLRKTDRVAIVVYAG----NSGLVLPS 303 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 +T +K + GSTA ++ AY+ + +K Sbjct: 304 -----TSGDQKTTIKDALNKLSAGGSTAGGAGIRLAYEVAAKN------YIKGGNNR--- 349 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ +E+ + + + G+ + + F + + K + Sbjct: 350 VILATDGDFNVGASSDEDMEKLIEEKRKSGVFLTVLGFGMGNLKDSKMEVLADK--GNGN 407 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 + ++ +E K+ + G +F Sbjct: 408 YAYIDNINEARKVLVNEFGGTLFT 431 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 8/70 (11%) Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKI 440 + + A+CN AKS + I IAF + + NC++ + +F+A+ E+ Sbjct: 494 DAQLQALCNLAKSNNVTIFGIAFEA----PANGKTQIQNCSTSRSSHYFDASGL-EIQTA 548 Query: 441 FRDRIGNEIF 450 FR I ++I Sbjct: 549 FR-AIASQIS 557 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 68/209 (32%), Gaps = 23/209 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P + VD S K L++ +L ++R ++K D R+ + Sbjct: 221 PTAALPPANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAGS 274 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 +T + G TA ++ AY ++ + + N Sbjct: 275 TAVVLPP---TSGAQQTRIVEAIDSLQSGGGTAGASGIELAYK---AAQQAYLRGGINR- 327 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ D ++ + + + G+ + T+ F Sbjct: 328 -----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD-- 380 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 381 AGDGAYAYIDSALEARKVLTHELGSTLAT 409 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 69/209 (33%), Gaps = 23/209 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + VD S K L++ +L ++R ++K D R+ + Sbjct: 221 PTAALPAANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAGS 274 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 +T + G TA ++ AY ++ + + N Sbjct: 275 TAVVLPP---TSGAQQTRIVEAIDSLQSGGGTAGASGIELAYK---AAQQAYLRGGINR- 327 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ D ++ + + + G+ + T+ F Sbjct: 328 -----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD-- 380 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 381 AGDGAYAYIDSALEARKVLTHELGSTLAT 409 >gi|311028997|ref|ZP_07707087.1| hypothetical protein Bm3-1_00293 [Bacillus sp. m3-13] gi|311032266|ref|ZP_07710356.1| hypothetical protein Bm3-1_17297 [Bacillus sp. m3-13] Length = 245 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K I+L+TDG + + + IA+ AK QGI + I T E+ + A Sbjct: 7 KQILLITDGCSNSGD-DPIAMAALAKEQGITVNVIGVMDEDTIDERGMQEIEGIAMSG 63 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 38/135 (28%), Gaps = 10/135 (7%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE---------V 75 + A+ + V + V G+ +D+ + L++ A + A P+ Sbjct: 1 MAAIWVMVAIVVLGV-IDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKNNVT 59 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 SS N + I N + D + +A QV Sbjct: 60 SSANINDKGYAFTLISATATANPTAGNDQIAVSCGRWDSATAYVTPASASANAAQVTAYR 119 Query: 136 RYDLLLNPLSLFLRS 150 + + L L Sbjct: 120 QVNYFFLGLLSQLSG 134 >gi|295704248|ref|YP_003597323.1| hypothetical protein BMD_2120 [Bacillus megaterium DSM 319] gi|294801907|gb|ADF38973.1| conserved hypothetical protein [Bacillus megaterium DSM 319] Length = 396 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 94/266 (35%), Gaps = 18/266 (6%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 + T + V+L + G + +V + + + A I ++ P + E S SF Sbjct: 86 VTTVIGSVVLLWIIGSFLSIVYYHQSKTEYINSVNQAAIESAFPNVHIRETTSD--TQSF 143 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 + ++E Y E FT ++ D + + VE Y + LS Y +P Sbjct: 144 FRMQSQMEAYKQIGREEQAIGTFTFHQLFD--KTSGVEKAFSGGKYDLNLSFVYP---DP 198 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 L L + K W +T E + + E +S D + S+ + Q+ Sbjct: 199 KKLKLHAKEYKQWTNETWKALENLPKGTVSEVAIS----FDDTYSLKEVQQKMNA----- 249 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 D + Y+ G K P + + + P D ++ H S+ Sbjct: 250 -IDSRDFSTSWYALNTGHEQKTSTKDEPLIDLTSTFGFPEFLDLPYDHKGNQTHSTSKSN 308 Query: 264 LRHVIKKKHLVRDALASVIRSIKKID 289 + +++K + + +R + D Sbjct: 309 VLNMMKLLADNEETVQK-VRQLDHSD 333 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 68/197 (34%), Gaps = 13/197 (6%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF--FND 304 P + ++ + + + K + + ++ S+ + V +V G+ N Sbjct: 33 APANVAVLLDASGSMAKRIDGVSKFNSAKKEISKFASSLPEGTQVKMSV-FGSEGNNKNS 91 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + V+ ++F N +G T + + NE + + + Sbjct: 92 GKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIAK-------ALNEAKSSFDQLD 144 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 + +K + LLTDGE T I + + + I + I F + + + Sbjct: 145 AKGEKVVYLLTDGEETCGG-NPIKTAKELQKENITVNVIGFDYKEGYKGQLNAIAK--VG 201 Query: 425 PNSFFEANSTHELNKIF 441 +F + ++ KIF Sbjct: 202 GGEYFPVYTQKDVEKIF 218 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++TA L+ V L V + VDV + L+ A +A + A+ L S ++ + A+ Sbjct: 19 LVTAALL-VFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAATILDNSKDKDAVDAEIGT 77 Query: 84 TF 85 Sbjct: 78 AL 79 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 32/259 (12%), Positives = 72/259 (27%), Gaps = 43/259 (16%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 +E +++ D S SM S Q + ++ + + + Sbjct: 121 DEETPITLYLSPDDSNSMS-----SPVQVRDWVLNYGGNSLSGFPIRTWEFMNYYGFDYD 175 Query: 232 YMVSCNKSLYYMLYP---------------------GPLDPSLSEEHFVDSSSLRHVIKK 270 S+Y + P P + V +S Sbjct: 176 PAADGELSVYAAMNPIEGEGDEARFQMQIGVASELMTPEERPPMNVTLVLDTSGSMAGTP 235 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L+R+ ++ +K D V+ ++ + + V ++ D Sbjct: 236 IELLRETSRAIAAQLKLGDTVS------ICEWDTSNDWTLA-GYAVTGPNDELLLEKIND 288 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + +++ Y+ + + + +VL++DG + I Sbjct: 289 VVHGGGTNLYGGLESGYE--------LAQMVYDPDAINR-LVLISDGGANAGITDLDLIA 339 Query: 391 NKAKSQGIR-IMTIAFSVN 408 A G I + V+ Sbjct: 340 ENAAYGGSDGIYLVGVGVD 358 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 69/209 (33%), Gaps = 23/209 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + VD S K L++ +L ++R ++K D R+ + Sbjct: 242 PTAALPAANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAGS 295 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 +T + G TA ++ AY ++ + + N Sbjct: 296 TAVVLPP---TSGAQQTRIVEAIDSLQSGGGTAGASGIELAYK---AAQQAYLRGGINR- 348 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ D ++ + + + G+ + T+ F Sbjct: 349 -----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD-- 401 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 402 AGDGAYAYIDSALEARKVLTHELGSTLAT 430 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 44/150 (29%), Gaps = 32/150 (21%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + K + N G T + A+ A + + +S KY++ +T Sbjct: 170 TNTSAINSAKNLIDNMNASGGTNMEAALNKAKNLLNASPSGN----------DKYVIFIT 219 Query: 376 DG----------------------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 DG N E ++ G ++ + + Sbjct: 220 DGMPTFYLNGTHNGYPLVDGPGLQPNNTTKSETLSAVQSLSQSGTKLFVVGVDTTGADVD 279 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRD 443 K L + + +ST+ LN I +D Sbjct: 280 KTFIELMASTANGKSYYISSTNALNSILQD 309 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 13/94 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + I+ + + T I DA+ TA +T +KK IVL+TDG Sbjct: 151 ETFSYILDNLSTKSVTLQGTRIADALVTAKNTFNVDA-----------VSKKSIVLITDG 199 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 E+ + + + K I + TI ++ Sbjct: 200 EDHGGYFD--EVLKELKDMNISVYTIGVGTSQGA 231 >gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium nodulans ORS 2060] Length = 725 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 47/331 (14%), Positives = 94/331 (28%), Gaps = 31/331 (9%) Query: 97 NFENNLKKNFTDREVR--DIVRDTAVEMNPRKSAYQV--VLSSRYDLLLNPLSLFLRSMG 152 N N N E I V ++ A ++ V++ RY+ P++ G Sbjct: 160 NLFTNSVANIGPGETVLVQIAYQQPVRLSGGTHALRIPLVVAPRYNPAPAPVTPAAEGAG 219 Query: 153 IKSWLIQTKAEAETVSRSYHK-EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + + + + H + V++ + + Q + + G + R Sbjct: 220 ADPVPDRARITPPVLDPAEHAPANPVTLTVTLQAGFPLGSVQSATHPIRVEETGPESRRV 279 Query: 212 VKSYSSQNGKVGIRDEKLSP-----------YMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + I + V ++ L ++ P V Sbjct: 280 TLADGPVPADRDIELTWTAAPARAPAIGLFRERVGTDEYLLAVVTPPEGQNLARRPRDVT 339 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +R A AS++ ++ ++ + R F++ + + + Sbjct: 340 FVIDNSGSMAGASMRQAKASLLMALDRLAPAD---RFNVIRFDNTMDQLFPEAVPADERH 396 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + ++F G T + + A D V + IV LTDG Sbjct: 397 LAVARSFVAALEARGGTEMLAPLTAALADPTPERTDRV----------RQIVFLTDGA-- 444 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 NEE I A R+ I Sbjct: 445 IGNEEQIFSAIAAGRGRSRLFMIGIGSAPNA 475 >gi|238759127|ref|ZP_04620296.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702675|gb|EEP95223.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 351 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 49/145 (33%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I+S G I ++++ + L ++ R Y +L A A +A Sbjct: 5 FIRSNRGAIHIEFSIVIILFLFTLLSCAEIARLLYISASLDLAVSEAAKSAKNKEKDDNT 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + ++ N + NF++ I ++ +N + Y + Sbjct: 65 TYQEAFQEKLISQQGVFGSFITNNNSAVIDINFSNSITGIINSNSNTSVNRVNTIYNNEI 124 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLI 158 +RY + +FL I + + Sbjct: 125 LARYTVNYTYQPVFLPFNSIFTDTL 149 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 85/270 (31%), Gaps = 32/270 (11%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 ML + + +V + K + + + P Sbjct: 26 MLAQENAVTTFRTESNLVSLNVSVFDQEGRIVKGLPQSAFTVFEDNQKQEIKVFRQEDVP 85 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + L + S+ K+ V A +++++ ++ F++ Sbjct: 86 ISLGLVIDTSASMSN------KRDRVNSAALAMVKASN---PEDEVF---VISFSEEAFI 133 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ V +L ++ K G TA+ DA+ D + + + K Sbjct: 134 TQDFTSDVKQLESSLRK-----LGSKGETAMRDALSLGLDHLRAPARKD----------K 178 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS---VNKTQQEKARYF-LSNCAS 424 K +V++TDGE+ ++ + A + I I + Q + A+ Sbjct: 179 KVLVVITDGEDNSSIQKQENLIRAAHLSNVIIYGIGLLAAEAPASAQRAKASLDVLTLAT 238 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + ++ KI I +EI + + Sbjct: 239 GGRSWYPENVADIEKI-TPEIAHEIRNQYV 267 >gi|212693196|ref|ZP_03301324.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|237709938|ref|ZP_04540419.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725395|ref|ZP_04555876.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753590|ref|ZP_06088945.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664301|gb|EEB24873.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|229436082|gb|EEO46159.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456031|gb|EEO61752.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235304|gb|EEZ20828.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 340 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 43/144 (29%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V G F + + + ++T Sbjct: 110 SRLDKSKKLISRLVDTF-----NNDKV--GLIVFAGDAFTQLPITSDYVSA-KMFLETIN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A++ A + + I+++TDGEN + A Sbjct: 162 PSLITTQGTDIGTAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 K G+++ + Sbjct: 211 QEAAEK--GMQVFVLGVGSPDGSP 232 >gi|150005796|ref|YP_001300540.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294776175|ref|ZP_06741664.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|149934220|gb|ABR40918.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294449998|gb|EFG18509.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] Length = 340 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 43/144 (29%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V G F + + + ++T Sbjct: 110 SRLDKSKKLISRLVDTF-----NNDKV--GLIVFAGDAFTQLPITSDYVSA-KMFLETIN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A++ A + + I+++TDGEN + A Sbjct: 162 PSLITTQGTDIGAAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 K G+++ + Sbjct: 211 QEAAEK--GMQVFVLGVGSPDGSP 232 >gi|319783910|ref|YP_004143386.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169798|gb|ADV13336.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 393 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 32/78 (41%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + +S + I AL +PV++G G V+ W Y L+ A A ++ I + Sbjct: 1 MRRSTGANVATIFALTLPVVVGAAGFGVETSYWYYNSLRLQATADAAAYAGALEQISGSD 60 Query: 74 EVSSRAKNSFTFPKQKIE 91 + + A + + + Sbjct: 61 KPTIVAAATQSAASNGLG 78 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 39/121 (32%), Gaps = 9/121 (7%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + L++ A+ ++I +I + ++V+ + F R + + + Sbjct: 7 DKTRIQLLKSAVDNMIDNIAEKEDVDAKWEV--IDFATRAAVRGGGWLNTSNVKQYVTTA 64 Query: 327 FAIDEN--EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D N T M A + AKK ++ LTDG+ T Sbjct: 65 INEDNNVDIGRGTNYQAGMDLAQKEFEKKQPESDRPN-----AKKIVLFLTDGQPTYYGS 119 Query: 385 E 385 Sbjct: 120 G 120 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 52/144 (36%), Gaps = 18/144 (12%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEV 358 ++ + + H + +K G T DA+ A I Sbjct: 221 ITYSTWAQVEFNLK--AHHSSKAALKNAVNAIYYRSGWTYTADALDLAGRNIFQ----VA 274 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + M+ + K VLLTDG + +N + N ++ G+ + + ++ Sbjct: 275 NGMRPDKGIPKIAVLLTDGYSNGNN--PLGPANDLRAAGVNVFCVGIGNYYERE------ 326 Query: 419 LSNCASP---NSFFEANSTHELNK 439 L++ A+ + F+ + ++LN Sbjct: 327 LNDIATDPDKDHVFKLENFNDLNS 350 >gi|119470035|ref|ZP_01612840.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] gi|119446745|gb|EAW28018.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] Length = 1090 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 73/238 (30%), Gaps = 38/238 (15%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + D S SM + + Q C+ + ++ + + G + + Sbjct: 44 MIFDTSGSMDWSVINGDNQ--VCYIKKITQSNGNGNGNGNGNGNGNGNGNGGGGQVYYEN 101 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRH---VIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +E+ ++ + + ++ + ++A+ ++ N N + Sbjct: 102 VTCFASKDEYAEFNEQCYIGVNGVAVADCHDRRIDVAKNAMTQLV-------NDNSDIDF 154 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F V G K + + GST + + + AY I + D Sbjct: 155 GLMRFRSNVGGYVVAKLGTDKTS---LISDINSLPASGSTPMTETLWEAYRYITGQSLDY 211 Query: 358 VHRMK------------------NNLEAKKY-----IVLLTDGENTQDNEEGIAICNK 392 + NN + + ++L+TDG+ T D+ +I Sbjct: 212 AFNVSDRDKSADNSVVYTSPFKPNNGDPLRCDNSINVILMTDGDPTNDDGRDTSIAQT 269 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 10/81 (12%), Positives = 30/81 (37%), Gaps = 1/81 (1%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 Y +L ++T ++ + + + +DV + + L+ A T + + Sbjct: 12 MRQYPNRLRNQKGLTLVVMTMSMV-AFITIAALSIDVSHFVVNKTRLQNAVDTIALAGAT 70 Query: 67 PLIQSLEEVSSRAKNSFTFPK 87 ++ E+ + ++ K Sbjct: 71 VANRTNEKGDTDTAIIESYKK 91 >gi|170743966|ref|YP_001772621.1| hypothetical protein M446_5903 [Methylobacterium sp. 4-46] gi|168198240|gb|ACA20187.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 417 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 43/118 (36%), Gaps = 7/118 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ S G F ++ A+ G + +D Y + L++ A A + A + L S Sbjct: 12 KQGGASIIGLFGMLIAV------GFAAVAIDSGNLYYSKLKLQKIADAAALGAVMALPTS 65 Query: 72 LEEVSSRAKN-SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 +++ S P + + + + + AV++ R+S+ Sbjct: 66 SSVMAAALDLVSKNTPVGFGTVSTSADIQIGVYDPSSKTFTPSAIGQNAVQVTTRRSS 123 >gi|254882022|ref|ZP_05254732.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319640970|ref|ZP_07995678.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] gi|254834815|gb|EET15124.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317387415|gb|EFV68286.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] Length = 340 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 43/144 (29%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ + ND V G F + + + ++T Sbjct: 110 SRLDKSKKLISRLVDTF-----NNDKV--GLIVFAGDAFTQLPITSDYVSA-KMFLETIN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I A++ A + + I+++TDGEN + A Sbjct: 162 PSLITTQGTDIGAAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 K G+++ + Sbjct: 211 QEAAEK--GMQVFVLGVGSPDGSP 232 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 I+L++DG+ + + A+ +GI I TI + + ++ A S Sbjct: 161 IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSD 215 Query: 426 NSFFEANSTHELNKIFRDRIGNEIF 450 + ++ +L K F + + Sbjct: 216 GNHTFVANSADLEKAFTKEFQDVMS 240 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 61/186 (32%), Gaps = 27/186 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + R+A + + D ++ +++ + ++ Sbjct: 88 RIEKAREAAILAVNMLDASDTLS------VVAYDNNAEVIIPAT--KVNNKPALIAKIQQ 139 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + MG TA+ + S +V + N + + I+LL+DG+ + Sbjct: 140 HIHPMGMTALFAGV--------SKGIGQVDKNLNPEQVNR-IILLSDGQANTGPTSISEL 190 Query: 390 CNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A+ +GI I TI + + ++ A S + ++ +L F Sbjct: 191 SDLARMAAKKGIAITTIGLGEDYNE-----DLMTAIAGYSDGNHSFVANSADLESAFTKE 245 Query: 445 IGNEIF 450 + + Sbjct: 246 FQDVMS 251 >gi|182625113|ref|ZP_02952890.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] gi|177909733|gb|EDT72159.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] Length = 620 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 55/147 (37%), Gaps = 14/147 (9%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 N + + P + + + + + ++ ++ S++ + IDN+ ++ Sbjct: 67 DGNYEITLTVKGKPKKVTKPVDILLIMDASNSMYYNMDELKASMNSLVDKV--IDNIPNS 124 Query: 295 VRMGATFFNDRVISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+ F V SF+ + + + +K G+T I + A Sbjct: 125 -RIAVVAFGTEVEEVFSFNDKNNFTSKEEYKNAIKDSYYYITGRGNTNIEGTWRRA---- 179 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDG 377 NE + + NN +KK ++ +DG Sbjct: 180 ---NEIFKNELNNNSNSKKDVIFFSDG 203 >gi|158294866|ref|XP_315864.4| AGAP005840-PA [Anopheles gambiae str. PEST] gi|157015764|gb|EAA11549.4| AGAP005840-PA [Anopheles gambiae str. PEST] Length = 1299 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 29/247 (11%), Positives = 74/247 (29%), Gaps = 16/247 (6%) Query: 45 RWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEE----YLIRNFEN 100 W + + LK + + P + + + + + + + N Sbjct: 412 NWLFRKRQLKTTESSIAMLVPSPTEEVKALIGDKNADEISDLSEAGSDCEGYESDGNVNG 471 Query: 101 NLKKNFTDREVRDIVRDT---AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS-- 155 + V +DT + + + + +L + +N L + ++ + Sbjct: 472 GTVPSVAPVVVASSSKDTTSEGISSSSKTQSLDEILPPDSLVSINSLPVNEEALSEATNS 531 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 L+ + S + +SI+ V D + N F + Sbjct: 532 QLLADQVAQNGASERLQVDDLISIEPVADR-------VETTNPTLTNPFLDDVFEANNNV 584 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + + K+ R E + + Y P+D + + D ++ ++ Sbjct: 585 ITDDVKMLQRTETGDNRSSATESNGYGDRKEIPIDRAQIATNGGDVPEVQGDSGEQDASV 644 Query: 276 DALASVI 282 DA ++I Sbjct: 645 DAPDTLI 651 >gi|308502682|ref|XP_003113525.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] gi|308263484|gb|EFP07437.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] Length = 861 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 29/116 (25%), Positives = 45/116 (38%), Gaps = 12/116 (10%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 ND R+G FND V + GV + TI E G T + A++ A D + Sbjct: 405 DNDFTRVGVLTFNDIVTEKLTLQKGVD--LATINAAIDSVEYLGGLTDVTAALKAAKD-L 461 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-EGIAICNKAKSQGIRIMTIAF 405 S D H K +++L+D T D + IA + G+ + + Sbjct: 462 FSKESDNAH--------SKVLIVLSDAVPTVDTYADEIAAGQALSAAGVATFFVGY 509 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 I+L++DG+ + + A+ +GI I TI + + ++ A S Sbjct: 161 IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNE-----DLMTAIAGYSD 215 Query: 426 NSFFEANSTHELNKIFRDRIGNEIF 450 + ++ +L K F + + Sbjct: 216 GNHTFVANSADLEKAFTKEFQDVMS 240 >gi|239943867|ref|ZP_04695804.1| hypothetical protein SrosN15_22911 [Streptomyces roseosporus NRRL 15998] gi|291447330|ref|ZP_06586720.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291350277|gb|EFE77181.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 396 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 48/132 (36%), Gaps = 16/132 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 PS + S S++ ++ ++ AL + ++ + V + + + Sbjct: 212 PSRTVYVLDTSGSMKG--RRLAQLKSALNGLTGDFRE----REQVTLLPFGSTVKQVRTH 265 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K ++ A + G TAI ++ AYD + E Sbjct: 266 TVDPADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS---------- 315 Query: 371 IVLLTDGENTQD 382 IVL+TDGENT Sbjct: 316 IVLMTDGENTAG 327 >gi|221193633|gb|ACM07849.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYHGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|239990323|ref|ZP_04710987.1| hypothetical protein SrosN1_23653 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 48/132 (36%), Gaps = 16/132 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 PS + S S++ ++ ++ AL + ++ + V + + + Sbjct: 343 PSRTVYVLDTSGSMKG--RRLAQLKSALNGLTGDFRE----REQVTLLPFGSTVKQVRTH 396 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + K ++ A + G TAI ++ AYD + E Sbjct: 397 TVDPADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS---------- 446 Query: 371 IVLLTDGENTQD 382 IVL+TDGENT Sbjct: 447 IVLMTDGENTAG 458 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 66/228 (28%), Gaps = 34/228 (14%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPG----PLDPSLSEEHFVDSSSLRHVIKKKHL 273 Q G G D + +L L PG P+ +D S K Sbjct: 312 QPGTPGGLDVLAYKESANKTGTLMMTLTPGDDLQPIQRGTDWTLLLDISGSMQG--KFQT 369 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN- 332 + + + ++ D R+ FND + + G + + + Sbjct: 370 LIEGVKKGLKRFNPQD------RVRVVLFNDY---ASNLTGGFLPATQKNIAEIIRKLDL 420 Query: 333 --EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + D ++ A + + I L+TDG + Sbjct: 421 VLPNGGTHLMDGVRFALSGLDADRTSA-------------IWLVTDGVTNVGETKQRKFV 467 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + K + IR+ T N + + AS +++ ++ Sbjct: 468 DLLKQKDIRVFT-FIMGNGANRPLLKAITK--ASNGFAINVSNSDDII 512 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 42/116 (36%), Gaps = 17/116 (14%) Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--- 403 YD + + D++ +K ++LLTDGE+ + + A+ + I Sbjct: 211 YDAVAQAANDKLR----EQTGRKALILLTDGEDLGSATKPLQAIADAQKANTIVYVILIA 266 Query: 404 ------AFSVNKTQQEKARYFLSNCASPNSFFEA-NSTHELNKIFRDRIGNEIFER 452 ++ T + R A+ N+ +L F++ I E+ + Sbjct: 267 DRGFYGGYTFGYTGDAQMRRLAE--ATGGRMINVGNNGAKLTAAFKE-IARELRTQ 319 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 37.7 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 64/222 (28%), Gaps = 29/222 (13%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 132 TEGAYQDNRLISYNLT-------GKYPDSNNKLSIDTAISALNTKQVFSKVAKGKKGIAL 184 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRS 284 Y + + D S S ++ ++ + K + ++D ++R Sbjct: 185 A----YRNDPIEGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILKDKATIMMRD 240 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K I NV+ + + + G + +I G T D ++ Sbjct: 241 LKDIGNVSVNLVSFSILGSYVQKDFSELDKGTTTIEASI-----NALQTGGVTNPGDGLR 295 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ++ N+ KY+VLLTDG + Sbjct: 296 YGMMSLQ-----------NHSAQLKYVVLLTDGIPNAYTVDT 326 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 37.7 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 81/236 (34%), Gaps = 29/236 (12%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLT-------GKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGI-- 183 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR-HVIKKKHLVRDALASVIRSI 285 Y + + D S S + ++ ++ + + ++RD +I + Sbjct: 184 --AIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINEL 241 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + NV+ + +T + + + + TI ++ +++ G T D ++ Sbjct: 242 QSVGNVSVNLTTFSTTGSYKQAAFSQLD----REAGTIKESIKNLKSDGGVTNPGDGLRY 297 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIR 399 ++ + H KY+VLLTDG N++G A + K +GI+ Sbjct: 298 GMVSLQ-----KQHAQL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 37.7 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 60/201 (29%), Gaps = 33/201 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + ++A + N +G F S + R VK Sbjct: 118 RLTAAKEAGKQFADELTPAIN------LGLVEFAANASLLVSPT-----TNRAAVKAAID 166 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-----QDNE 384 TA + + TA I + V + + IVL +DG + Sbjct: 167 SLKPAPKTATGEGLFTALQAIATVG--SVMGGGDGPPPAR-IVLESDGAENVPLDPNAPQ 223 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ----------QEKARYFLSNC-ASPNSFFEANS 433 AK++G++I TI+F + C + F A+S Sbjct: 224 GAFTAARAAKAEGVQISTISFGTPYGTVEYEGATIPVPVDDQTLQKICEITDGQAFHADS 283 Query: 434 THELNKIF---RDRIGNEIFE 451 L ++ + +IG E + Sbjct: 284 LESLKNVYSTLQRQIGYETVK 304 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 37.3 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 74/202 (36%), Gaps = 23/202 (11%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S + + LV+ +L + ++++ D V V G+ + Sbjct: 210 PASNFVFLIDVSGSMYGATRLDLVKSSLKLLTNNLREKDRVAIVVYAGSAG----EVLPS 265 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + +K + N GST +Q AY + + +K Sbjct: 266 -----TSGANKQKIKEALDNLNAGGSTTGGAGIQLAYKI------AKQNFIKGGNNR--- 311 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ N+ +A+ + + G+ + + + + + K + A + Sbjct: 312 IILCTDGDFNVGVSSNDGLLALIEQERKSGVFLSILGYGMGNYKDSKMQTLAQ--AGNGN 369 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 ++ E NK+ + G + Sbjct: 370 HAYIDNLQEANKVLVNEFGATM 391 >gi|90577458|ref|ZP_01233269.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] gi|90440544|gb|EAS65724.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] Length = 1679 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 32/274 (11%), Positives = 77/274 (28%), Gaps = 30/274 (10%) Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-----SRSYHKEHG 176 + + + S+ + ++ + G + T + + + Sbjct: 1100 VTSDGANHTASASASHQYQVDGVV----IQGGNGDDVITGGNGSDLLIGDLTPPQADQQP 1155 Query: 177 VSIQWVIDFSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 V+I +V+D S SM L +G + R + ++ +V + ++ Sbjct: 1156 VNINYVMDISGSMYFGRLLTLDAVKGHVAKSYEIYVGRNSQLTAADGKEVSHTEGWVTVS 1215 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 L Y + + S + + + + + ++ SI Sbjct: 1216 YDQLKAGLQYDAGSYDNIQIKASDGTEYSDNFNSLPSLIDMAKKSYQTLTSSIIDSVEDK 1275 Query: 293 DTVRMGATFFNDRVISDPSFSWG-VHKL----IRTIVKTFAIDENEMGSTAINDAMQTAY 347 + F+ V + SF + K + + G T A+ Sbjct: 1276 SKITFNMVTFSSDVKGNTSFHYDETSKTFVNDQHQTINNYIDSLVAGGGTQFEGALSDIS 1335 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + + I L+DG++ Sbjct: 1336 RHITDPS------------MRNVIYFLSDGKDED 1357 >gi|330811037|ref|YP_004355499.1| hypothetical protein PSEBR_a4091 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327379145|gb|AEA70495.1| Hypothetical protein; putative exported protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 426 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 34/116 (29%), Gaps = 5/116 (4%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 D + L+ A A + + L Q + S+ + + Sbjct: 21 DGGHMLLNKTRLQNAVDAAALGGAKTLSQVTGGMNMASTTRAAALDTLSRNASAVGNAEL 80 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQVVLSSRYDLLLNPLS-LFLRSMG 152 + N ++ + P + + V S + LN F++SMG Sbjct: 81 ATAVAGNPGAFAAVELSSSVYGPFSYPGPTDAKYVRVSVANYQLNGFFWSFVQSMG 136 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 65/195 (33%), Gaps = 28/195 (14%) Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + S +N P+ V ++ N K + + +++ Sbjct: 24 NLASASNNVTVNLSVTPSQSVVILPTTSNAKASLNLMLTPTGNPQTERDPIDLVFVFDKS 83 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S+ + +SS +K+ + A + ++ D N + R G F+ +D Sbjct: 84 GSMDFKVASNSS-----VKRIDSAKSA---MTNALMFFDGQNTSDRFGFVPFSSNANTDV 135 Query: 311 -----SFSWGVHKLIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 S WG + ++T + G T +A+ A SS++D Sbjct: 136 VSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVASKLFDSSSKD------ 189 Query: 363 NNLEAKKYIVLLTDG 377 K I+ LTDG Sbjct: 190 ------KNIIFLTDG 198 >gi|221193733|gb|ACM07898.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ ++Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRREELDKQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYHGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 55/176 (31%), Gaps = 46/176 (26%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +ND D + + T G T + + + + + D+ Sbjct: 614 YNDYATIDLNLTEDFSDY-----NTEVKALVPDGGTPMRKGLYYSIKHLRDNGRDDA--- 665 Query: 362 KNNLEAKKYIVLLTDGENTQDN-----------EEGIAICNK-----------------A 393 K +V+L+DG+ + + K A Sbjct: 666 ------VKAVVVLSDGDYNYYGDPLARGSGGTKWDWSDMQEKYYTFSDLNSSEQDMRIFA 719 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 K I+I +IA++ + + + L A + ++ A S +L +I+ D G Sbjct: 720 KDNDIKIFSIAYADGISS--EGKAVLQALAEGTGGKYYYAPSGEDLEEIYEDIAGE 773 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 56/206 (27%), Gaps = 44/206 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A ++ ++ V G + S I + Sbjct: 116 SRIRAAQQAAKVLLDTLPAG------VSAGVVAMAGTAAVAQAPSRSKDAAATAIDR--- 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE--------------------AK 368 G TA+ + + A T++ + R+ N Sbjct: 167 --LKPQGGTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPDASHSGEAVTPGSYPS 224 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN----------KTQQEKARYF 418 IVL +DGE+ A A + G+R+ T+ + Sbjct: 225 GAIVLFSDGESNTGPAATQAA-QLAAAHGVRVYTVGVGTTDGVVLSVDGWSARVRLDEKV 283 Query: 419 LSNCASP--NSFFEANSTHELNKIFR 442 L A+ +F +L +++R Sbjct: 284 LKEVANATGAEYFPLADAAQLKRVYR 309 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 63/412 (15%), Positives = 112/412 (27%), Gaps = 59/412 (14%) Query: 56 AAQTAIITASVPLIQSLEEVSSR--AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A A + LI +S + A + ++++ + + Sbjct: 164 ANPAASNSGMTALIGITAALSGKGDAITAGDVQAGALKQFFKGQVLTSGSSGWLA----- 218 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLN-------PLSLFLRSMGIKSWLIQTKAEAET 166 D V R ++ L LN PL L S G+ + + Sbjct: 219 ---DAYVADQGRSQLNGLINYESVLLSLNRGGRLQEPLKLIYPSDGLVTADYPLMLLNDA 275 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 Y ++D RS QR + A S G + Sbjct: 276 RRSEYQA--------LVDRLRSPQVQQRIMQETLRR---PAAPGVALSSEFPPGMLVELP 324 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 S + Y P + FV S K+ ++ AL ++ + Sbjct: 325 FPASAGTIDAILGSYLNDVRRPANT-----IFVLDVSGSMEGKRLEALKAALGNLSGADT 379 Query: 287 KI----DNVNDTVRMGATFFNDRVISDPSFSWGVHK--LIRTIVKTFAIDENEMGSTAIN 340 + D R+ F+ V + SF + G T I Sbjct: 380 SLGWRFAAFADRERVTLIPFSGDVEAVRSFQVNKASRAADLQAIAAAGGALQAGGGTNIY 439 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EGIAICNKAKSQ 396 A+ AY ++ +VL+TDGE T A ++ Sbjct: 440 GALSEAYRQAAAAPAGSYTS----------VVLMTDGEGTAGPSLNEFRDFYAALPAGAR 489 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ T+ F + Q+ L+ F+ L F++ G + Sbjct: 490 SVKTFTVLFGDSDVQEMNEVAALTG----GRTFDGQ--QNLAAAFKEIRGYQ 535 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 71/205 (34%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + LV+ +L ++ +++ D V G N Sbjct: 242 PTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAG 297 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + + K+ I + GSTA + + AY I N + Sbjct: 298 VKLEATPGSDKQKIREAIDE-----LEASGSTAGGEGIMLAY-KIAQKNFISGGNNR--- 348 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 -----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK- 402 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 403 -GNGNHAYIDNLQEANRVLVNEFGA 426 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 22/55 (40%), Gaps = 1/55 (1%) Query: 21 HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 F ++ A+ M V + G VD+ + L++ A A + + + + Sbjct: 1 SFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQP 54 >gi|85374105|ref|YP_458167.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] gi|84787188|gb|ABC63370.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] Length = 195 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 45/120 (37%), Gaps = 4/120 (3%) Query: 13 KLIKSCTGH----FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +L+ G F ++ L+ ++LGV + ++ S + ++QAA+ + + + + Sbjct: 8 RLLDDKRGVTIVEFGMVAPTLIVLLLGVFDITYNMYSSSMLQGTVQQAARNSALEGADSM 67 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 Q+ ++ + + E ++F D I D + + + Sbjct: 68 AQNAAVETAVRQVAPNATFTHKRIAYTTFTEVGSAESFDDVNDDGICADGELFEDANGNG 127 >gi|210623514|ref|ZP_03293859.1| hypothetical protein CLOHIR_01809 [Clostridium hiranonis DSM 13275] gi|210153572|gb|EEA84578.1| hypothetical protein CLOHIR_01809 [Clostridium hiranonis DSM 13275] Length = 381 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 15/167 (8%), Positives = 43/167 (25%), Gaps = 8/167 (4%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 I+ + + + +E S+ M + + F + + + Sbjct: 41 VPIKNEVLVKGIKDKTAQESLNSMSIADAMIYIMGIDSTFKYNEEYDKFIEALAKGLDLD 100 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + ++ P D + + + + KK + Sbjct: 101 LKAYMGYMSNKYFEIGEYTDSLIYMKSLVNKYPEDVNGLYHYAIICQQIAEQYKKDEDAK 160 Query: 276 DALASVIRSIKK------IDNV--NDTVRMGATFFNDRVISDPSFSW 314 ++ ++ K ID ++G ++N +W Sbjct: 161 AVNDFLLDALDKLERVINIDPNFGLAYYQLGYHYYNQDQYIKSKLTW 207 >gi|116622792|ref|YP_824948.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225954|gb|ABJ84663.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 360 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 34/102 (33%), Gaps = 23/102 (22%) Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF------------------- 405 + +K +V+L+DG +T + A+ + ++ F Sbjct: 214 QKGRKALVILSDGVDTGSKVSLSSAVESAQRADTLVYSVLFEDREAYGVPGFGGMGRGRG 273 Query: 406 --SVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 T + L ++ FFE + L KI+++ Sbjct: 274 GRRPMPTAPANGKKVLERISTETGGRFFEVSKKEPLEKIYQE 315 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 40/130 (30%), Gaps = 12/130 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +DA + + K + N R+ ++ ++ + I Sbjct: 88 FKTSMDYAKDAALAFLNQ-AKANPQN---RIAIVAYSTTASLKIGLTYLNATGVTQITNQ 143 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ST I + + + ++ + + + I+LLTDG G Sbjct: 144 I-NALQATNSTNIYAGIVRSETELETNG-------RFDCSTARAIILLTDGVTNVTGTSG 195 Query: 387 IAICNKAKSQ 396 CN +K+ Sbjct: 196 NTNCNVSKTS 205 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 71/205 (34%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + LV+ +L ++ +++ D V G N Sbjct: 242 PTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAG 297 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + + K+ I + GSTA + + AY I N + Sbjct: 298 VKLEATPGSDKQKIREAIDE-----LEASGSTAGGEGIMLAY-KIAQKNFISGGNNR--- 348 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 -----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK- 402 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 403 -GNGNHAYIDNLQEANRVLVNEFGA 426 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 37.3 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 71/215 (33%), Gaps = 38/215 (17%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT---- 300 P ++F+D + + I + + +I ID +T+ + Sbjct: 34 NEIPTKTRDQAKNFLDKTYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILD 93 Query: 301 -FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 FF + + S ++ + ++ G T + + + S + Sbjct: 94 PFFCLKETNAKSLNFDPN---SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-- 148 Query: 360 RMKNNLEAKKYIVLLTDGENT-------QDNEEGI----------AICNKAKSQGIRIMT 402 K +++L+DG++ D + GI +C K K GI+++ Sbjct: 149 ---------KLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVF 199 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I + +C +F+ A + HEL Sbjct: 200 IGIGYVPD--NNIIDWEKDCVGTGNFYLAKNAHEL 232 >gi|170732854|ref|YP_001764801.1| membrane protein [Burkholderia cenocepacia MC0-3] gi|169816096|gb|ACA90679.1| membrane protein [Burkholderia cenocepacia MC0-3] Length = 608 Score = 37.3 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 47/163 (28%), Gaps = 5/163 (3%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G F ++ A++ ++ + V + +++ A A + A + + + ++ A Sbjct: 18 GAFSVM-AIIATLIAITTLGAIGVGNLFFQRRDVQRIADMAALAAVQRMDDACSQPTATA 76 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 ++ N E + T+ + V + Sbjct: 77 TSNAQSNGLDASNGDTINIECGRWDTSVNPAPSYYAAATSGTTQLNAAKVVVTRQVPFFF 136 Query: 140 LLNPLSL----FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + P ++ RS I ++ + A S Sbjct: 137 VGPPQTVSAVSTARSTNIDTFSVGATLAALGGVGCSGGSAPTS 179 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 37.3 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 64/199 (32%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E ++ + + + L + ++ D R G F D Sbjct: 103 DLSGSMEEKDFATEAGEQLSRLTAAKKVLRDFVTQ-RQGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + E M T + DA+ S Sbjct: 156 TPFT----ADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPSTSQ--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS- 424 + +++LTDG +T + A ++GIRI IA + +Q ++ +S Sbjct: 203 DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSL 262 Query: 425 -PNSFFEANSTHELNKIFR 442 F A +LN+ ++ Sbjct: 263 TQARSFVAIDQPQLNEAYQ 281 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 37.3 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 81/236 (34%), Gaps = 29/236 (12%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLT-------GKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGI-- 183 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR-HVIKKKHLVRDALASVIRSI 285 Y + + D S S + ++ ++ + + ++RD +I + Sbjct: 184 --AIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINEL 241 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + NV+ + +T + + + + TI ++ +++ G T D ++ Sbjct: 242 QSVGNVSVNLTTFSTTGSYKQAAFSQLD----REAGTIKESIKNLKSDGGVTNPGDGLRY 297 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIR 399 ++ + H KY+VLLTDG N++G A + K +GI+ Sbjct: 298 GMVSLQ-----KQHAQL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 4/67 (5%) Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF---RDRIGNEIF 450 + Q ++ TI F + + Y + + + +F A S +L + F D I +E Sbjct: 275 RDQLVQTFTIGFGRDISS-AGRNYLTNGASRDDYYFSAESEDDLYRAFDTITDSIKDESQ 333 Query: 451 ERVIRIT 457 VI T Sbjct: 334 NVVIETT 340 >gi|221193687|gb|ACM07875.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 44/310 (14%), Positives = 101/310 (32%), Gaps = 42/310 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +++ + ++ E+Y + + K+++ V++ + + +E + Y Sbjct: 121 TIQNSDDKKSIIEQRQEELDEQYPLTGAYEDTKESYNLEHVKNSIPNGKLEAKA-VNPYS 179 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWL---IQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ LS + + I+ +++ ++ +K+ + + +V+D S Sbjct: 180 SEGEHIREIQEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVFVLDNSN 239 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + ++++ + + + +K N + + ++ Sbjct: 240 SMKNNGKNNKAKKAG---EAVETIIKDVLGANVENRAALVTYGSDIFDGRTVK--VIKGF 294 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 DP E + + KK + A +I+ I K Sbjct: 295 KEDPYHGLETSFTVQTNDYSYKKFT---NIAADIIKKIPKEAPEAK-------------- 337 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 WG L T K D +++G T A A + S Sbjct: 338 ------WGGTSLGLTPEKKREYDLSKVGETFTMKAFMEADTLLSSIQRKS---------- 381 Query: 368 KKYIVLLTDG 377 +K IV LTDG Sbjct: 382 RKIIVHLTDG 391 >gi|75812639|ref|YP_320257.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] gi|75705395|gb|ABA25068.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] Length = 405 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 48/135 (35%), Gaps = 9/135 (6%) Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN--LEAKKYIVLLTDGENTQDN 383 + ST + + ++ A + ++++ + K++ + I+LL+DG + N Sbjct: 165 DYLASLTPCASTNLYEPLKKAVKFLANTSDSRFYLPKDSPQTPPRLSIILLSDGYHNFAN 224 Query: 384 --EEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCA----SPNSFFEANSTHE 436 ++ ++ K I + T+ + + Q + + EA + Sbjct: 225 EAQDFQSLTTLLKRNTNIIVHTLGYGLTPEQLAQKYKLGKPATRADINSGKVPEAEFVDQ 284 Query: 437 LNKIFRDRIGNEIFE 451 +I I E Sbjct: 285 QRLAEIAKITGGIAE 299 >gi|260467412|ref|ZP_05813583.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259028808|gb|EEW30113.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 354 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 12/86 (13%), Positives = 26/86 (30%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +S +G + L++ ++ G VD +QA A+++ + S Sbjct: 8 FARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAVQTRSNQQQALDAALLSITTMDTTSTL 67 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFE 99 A + +F Sbjct: 68 AQRQAALQDSFIANGGQGTATLNSFV 93 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 81/236 (34%), Gaps = 29/236 (12%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 133 TEGAYQDNRLISYNLT-------GKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGI-- 183 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR-HVIKKKHLVRDALASVIRSI 285 Y + + D S S + ++ ++ + + ++RD +I + Sbjct: 184 --AIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINEL 241 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + NV+ + +T + + + + TI ++ +++ G T D ++ Sbjct: 242 QSVGNVSVNLTTFSTTGSYKQAAFSQLD----REAGTIKESIKNLKSDGGVTNPGDGLRY 297 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIR 399 ++ + H KY+VLLTDG N++G A + K +GI+ Sbjct: 298 GMVSLQ-----KQHAQL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 36.9 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 57/165 (34%), Gaps = 11/165 (6%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A V S++ +D D +R+ F+ + +I V++F + Sbjct: 201 AKNFVADSVELMDVDPDVIRVAGMMFHANPLPQFDFDFSFDRDVIADAVRSFVYPTDRNW 260 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA A+ ++ + N A + +TDG + + + + Sbjct: 261 GTATGAALNYIRKYLLVPS------AGNRDPADTIVYFITDGNSQEALSFVQDAADNIHA 314 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 G R++ I + + Q + S SP+ +L+++ Sbjct: 315 TGARVVAIGIT-DAIDQSQLEIIAS---SPDDVIIVEDFADLDEV 355 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 36.9 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 64/199 (32%), Gaps = 26/199 (13%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S E ++ + + + L + ++ D R G F D Sbjct: 103 DLSGSMEEKDFATESGEQLSRLTAAKKVLRDFVTQ-RQGD------RFGLILFGDAAFIQ 155 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 F+ + E M T + DA+ S Sbjct: 156 TPFT----ADQEVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPATSQ--------- 202 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS- 424 + +++LTDG +T + A ++GIRI IA + +Q ++ +S Sbjct: 203 DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSL 262 Query: 425 -PNSFFEANSTHELNKIFR 442 F A +LN+ ++ Sbjct: 263 TQARSFVAIDQPQLNEAYQ 281 >gi|281349175|gb|EFB24759.1| hypothetical protein PANDA_006693 [Ailuropoda melanoleuca] Length = 3437 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 38/283 (13%), Positives = 90/283 (31%), Gaps = 13/283 (4%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G+ V+ SY + Q + + ++S+ SS+ + ++ + Sbjct: 1285 MTAGIFVEETTESYKRKTENEDNQCTDASRNTHNLESVGSDSSKNDTVYIHEEENSFPCI 1344 Query: 95 -IRNFENNLKKNFTDREVRDIVRD----TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 R+ + L F I T +E+ + + V S++ N + ++ Sbjct: 1345 DQRSIDLKLSSQFMKEGNTQIKEGLSDLTCLEVAKAEETFHVNTSNKQQFTANTMGQSIK 1404 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF-GQPA 208 + I T A + KE + ++D + + S+ G Sbjct: 1405 DLDIFDVSFHT---ASGKNIRVSKESLNKVINLLDQKWTEEELNNFSDSLNSELLSGTDI 1461 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 ++T S + +D+ + ++C K+ L P Sbjct: 1462 NKTDISSHEKTENTERKDKIMKESDLTCTKNKLPTLQQRPESEIKKITAPTMLGFHTASG 1521 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 KK + +++ + +K + + + G T F+ + Sbjct: 1522 KKIEIAKES----LDKVKNLFDEKEQDYSGITNFSHQEAKMSK 1560 >gi|301765702|ref|XP_002918269.1| PREDICTED: breast cancer type 2 susceptibility protein homolog [Ailuropoda melanoleuca] Length = 3459 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 38/283 (13%), Positives = 90/283 (31%), Gaps = 13/283 (4%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G+ V+ SY + Q + + ++S+ SS+ + ++ + Sbjct: 1307 MTAGIFVEETTESYKRKTENEDNQCTDASRNTHNLESVGSDSSKNDTVYIHEEENSFPCI 1366 Query: 95 -IRNFENNLKKNFTDREVRDIVRD----TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 R+ + L F I T +E+ + + V S++ N + ++ Sbjct: 1367 DQRSIDLKLSSQFMKEGNTQIKEGLSDLTCLEVAKAEETFHVNTSNKQQFTANTMGQSIK 1426 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF-GQPA 208 + I T A + KE + ++D + + S+ G Sbjct: 1427 DLDIFDVSFHT---ASGKNIRVSKESLNKVINLLDQKWTEEELNNFSDSLNSELLSGTDI 1483 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 ++T S + +D+ + ++C K+ L P Sbjct: 1484 NKTDISSHEKTENTERKDKIMKESDLTCTKNKLPTLQQRPESEIKKITAPTMLGFHTASG 1543 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 KK + +++ + +K + + + G T F+ + Sbjct: 1544 KKIEIAKES----LDKVKNLFDEKEQDYSGITNFSHQEAKMSK 1582 >gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 694 Score = 36.9 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 103/336 (30%), Gaps = 28/336 (8%) Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQVVLSS 135 + K + +++ + K++ + I ++ ++++ ++ V S Sbjct: 59 AEEKQKASETYNEMKTNNKTALLVSQKQDRLSINLCAIPPNSDIDISFTMYTSLPTVFSP 118 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 +L N +SL L T +E + + ++++ I D Sbjct: 119 SNELFFNRVSLPLTLFPRYKLTPNTGSEQAPETVIGSTTYKFNLKFTIPKDSKFETKMED 178 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY-PGPLDPSLS 254 + + T + K + ++++ N + + Y S Sbjct: 179 YSIEGNVMTLKTIPTTDFNVDVFLNKNPVSVKEITGNTQVVNFKINPLNYLSNRKTDVKS 238 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFS 313 +D S + + A+ + S++ V + VR G+ FN + Sbjct: 239 IVFLLDCSGSMTIDNRIENAIKAMDLFLHSLE--PGVKFEIVRFGS-TFNSLFD--FKLT 293 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + T + +G T I + ++ Y+ + + + Sbjct: 294 EYNDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNELSPD----------------VLFV 337 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 LTDG + A+ + + +I ++ Sbjct: 338 LTDGA----VDNSQAVLDFVRDSSTKIFSLGLGAGA 369 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 69/205 (33%), Gaps = 23/205 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P P+ + +D S + LV+ +L ++ +++ D V G N Sbjct: 242 PTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAG 297 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + K+ I + GSTA + + AY I N + Sbjct: 298 EKLASTPGSDKQKIREAIDE-----LEASGSTAGGEGIMLAY-KIAQKNFISGGNNR--- 348 Query: 366 EAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 -----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLSEK- 402 Query: 423 ASPNSFFEANSTHELNKIFRDRIGN 447 + ++ E N++ + G Sbjct: 403 -GNGNHAYIDNLQEANRVLVNEFGA 426 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 86/298 (28%), Gaps = 59/298 (19%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 ++D R+ +S++ N D + + N S Sbjct: 104 LIDRIDAVGELKSYRMAYKRGRSWRSFNENNNASRDVDCEADNDVSGVNWSNI----TAH 159 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S + +++ SS+ + +++D + ++ V +G FN Sbjct: 160 TYVSGNYRNWLSSSTETVRKTRMEIMQDVAKRLADTVTG-------VNIGLMAFNQSQNG 212 Query: 309 DPS-FSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQTAYDTIISSNE---------- 355 + V + K G T +++ + A Sbjct: 213 EGGRVLNNVSNVKDNASAFKAKVDGLYPSGQTPLSETLFGAMRYFQGGKPFLDRNPVSGT 272 Query: 356 -DEVHRMKNNLEAK---KYIVLLTDGENTQDNEEGI-------AIC-----NKAK----- 394 D + K+ +E + ++LLTDG T D C ++ Sbjct: 273 VDGSNNYKSPIELECQANNVILLTDGAPTSDTNHNSFIGSAIGKTCSGNCLDEIAGYMAT 332 Query: 395 ---------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 Q I+ T+ FS++ A A ++ AN+ +L F D Sbjct: 333 NDMSAAFSGDQTIKTYTVGFSIDDPLLGAAAT-----AGGGEYYVANNAQQLADAFDD 385 >gi|221133174|ref|XP_002171310.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 221 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 10/123 (8%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ R + + + +T I+ A++ A + +S Sbjct: 55 RVGVITFSYRAKLSVKL--NSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA- 111 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+LLTDG T D E+ I ++ ++ G+ I+ + + E Sbjct: 112 -----NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE 166 Query: 414 KAR 416 + Sbjct: 167 LSH 169 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 36.9 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 46/409 (11%), Positives = 110/409 (26%), Gaps = 59/409 (14%) Query: 44 VRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 V S + + A AI+ + ++ + N + Sbjct: 1839 VNISVLGNDISGADGAAIVVGVRAGGNTATSAIGGLNSNINGNYGYLTLDAAGNAVYHSN 1898 Query: 104 KN------------FTDREVRDIVRDTAVEMNPR------KSAYQVVLSSRY-DLLLNPL 144 N +T R+ T + +N + V + + DL + Sbjct: 1899 PNSVSPPGATDTFTYTIRDSDGDESTTTITVNVADSKLVASTDQDVTVYEKALDLTQDGQ 1958 Query: 145 SLFLRSM-GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 L ++ G A V ++ V + + Q +++G Sbjct: 1959 DLAPGTVTGSDPGNTGETATGTLVGSVSGGSGAITYTLVGSATGTYGQIQLNADGTYTYT 2018 Query: 204 FGQPADRTVKSY-------------------SSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 T + +S + + P V+ +S+ + Sbjct: 2019 LTSAPKTTPNANDGPNTLSESFTYKATDALGNSTTSTIVVNIVDDVPKAVASERSVAAVE 2078 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + L + +S + + L + A+++++ + + V++ F+ Sbjct: 2079 IDSNILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDLGD----VKVQLVTFSS 2134 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 S W + KT + G T + A+ T Y+ +S + + Sbjct: 2135 NATDRTSV-W----VDVATAKTLLAGLSAGGGTNYDAAVATMYNAFNTSGKLTGAQNVGY 2189 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKT 410 +DG+ + + + I+ I + Sbjct: 2190 --------FFSDGKPNEGDIGTADEATLKAFLDANNIKNYAIGLGSGVS 2230 >gi|163751746|ref|ZP_02158964.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] gi|161328398|gb|EDP99557.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] Length = 447 Score = 36.9 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 48/139 (34%), Gaps = 20/139 (14%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 F +S + G ++ + + +L V + +D + L+ A ++ + A+ Sbjct: 12 SFKQFSLAKLGKQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSALYAA 71 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L +R + L +N E ++ +T+V+++ Sbjct: 72 KILQNGGSLFDAREAATLI-----------------LMQNLGFEENSEL--NTSVDLSSP 112 Query: 126 KS-AYQVVLSSRYDLLLNP 143 A QV + + L P Sbjct: 113 DYNATQVTANIFIEFSLWP 131 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 36.9 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 63/211 (29%), Gaps = 33/211 (15%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 +P F T + G D +++P + N+ + D S S + Sbjct: 29 KPSVEFSVQPSATEYVKPANGDAQGRLDIEVTPKGQATNEERKPIDVVFVHDTSGSMKDS 88 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 ++ ++ ++++L ++ + D + + + G+ Sbjct: 89 F--GGVKKATSAENALKESLRFFNQNQQSKDKYFFVPFDSDVSYKNYGDKRIQPAEGLSD 146 Query: 319 L----------IRTIVKTFAIDE----------NEMGSTAINDAMQTAYDTIISSNEDEV 358 + VK ++ +G T +++ A + Sbjct: 147 ILPMAEHLDFSEAYWVKKYSWYYGYYWSQEIFDFSVGGTNYTQSLEYALSKFSGMRDS-- 204 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 K+YI+ LTDGE T N + Sbjct: 205 ---------KRYIIFLTDGEPTSLNHDNKQY 226 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 36.9 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 90/291 (30%), Gaps = 32/291 (10%) Query: 110 EVRDIVRDTAVEMNPRKSA-YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 E + +T + ++ S Y V + L N K + + E S Sbjct: 127 ESKLKEEETLITISDGGSEHYFVQKIIAFSDLSNGKEFLYDLRFFKPTIKEGGGSRENNS 186 Query: 169 RSYHKEHGVS--------IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +Y + +S + S+ Q G P + A RT Y + G Sbjct: 187 LNYEMKLTLSDGTSQTIDYGTIESKLSSLNALQVVDRGSPSDPATAIAYRTDNVYLKKEG 246 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G+ ++ + S++ S + + + + L++ + Sbjct: 247 AHGV-----ISLVLDNSGSMHTRDLKDSHGNKESRINILKVETGK-------LLKLLSTN 294 Query: 281 VIRSIKKIDNVNDTV----RMGAT----FFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ + N+ + R G F++ + +K Sbjct: 295 KAADVELVPFDNNVLVRSDRKGGYIKPTFYSASKEYREKII--GSNVYEGKLKESMDSLG 352 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 T + ++ A+ +I N D + R + + Y+++L DGE+ Sbjct: 353 AYSGTNTGEGLRYAFYSIDEKNNDLL-RERPEEHFRDYLIILVDGESNAAT 402 >gi|29826729|ref|NP_821363.1| hypothetical protein SAV_189 [Streptomyces avermitilis MA-4680] gi|29603825|dbj|BAC67898.1| putative membrane protein [Streptomyces avermitilis MA-4680] Length = 142 Score = 36.9 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 47/126 (37%), Gaps = 11/126 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA--IITASVPLIQ 70 + G I A++ P +L ++ V W Y + A A +TA+ Sbjct: 15 RRWGDDRGDTSIQMAIIFPFVLIATVAVIQVSMWYYAR---QIALTAAREGLTAARAYES 71 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + +++A+ ++ +R + + N + VR V TA+ M P Q Sbjct: 72 GPADGAAQAREVL----GRVAGDSLRGYSVSASSN--GQRVRVQVSGTAMSMIPGVPGLQ 125 Query: 131 VVLSSR 136 V S+ Sbjct: 126 VTQSAS 131 >gi|258515700|ref|YP_003191922.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] gi|257779405|gb|ACV63299.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] Length = 140 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 47/140 (33%), Gaps = 8/140 (5%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI-----QSLEEVSSRAKNSFTFPK 87 ML + +D R +H L+ A A + S + V+++ Sbjct: 1 MLMFSALGLDYGRAYVLKHQLQAACDAASLAGSSAVSAKLITDGTGSVTNKKLLLDPIIA 60 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS--AYQVVLSSRYD-LLLNPL 144 + + +++K + D +A++ + AY+ ++++ + P+ Sbjct: 61 EARATDVWNQNVSSMKFIDKGVTIVDTSNHSALDEDSDGYLDAYKWGVTAKIQSYIAGPI 120 Query: 145 SLFLRSMGIKSWLIQTKAEA 164 S + + I + Sbjct: 121 SGMGNHITVTRVAISKAKDT 140 >gi|320354173|ref|YP_004195512.1| Na-Ca exchanger/integrin-beta4 [Desulfobulbus propionicus DSM 2032] gi|320122675|gb|ADW18221.1| Na-Ca exchanger/integrin-beta4 [Desulfobulbus propionicus DSM 2032] Length = 2704 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 36/285 (12%), Positives = 93/285 (32%), Gaps = 23/285 (8%) Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL-FLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + + + S+ +V+ + + N + G +L+ V + Sbjct: 1438 NSSKIAFKAQTSSATIVVPNAHYYTWNDTDGDGEQEAGEDVYLVTIPGSGYGVG-----D 1492 Query: 175 HGVSIQWVID-FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + D S +++D +E + + +Q I + YM Sbjct: 1493 YSLHYYLFTDADSDNVVDNGELTEKTGAEVPTSVIPKRYDAAGNQ-----ITNASQLAYM 1547 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V + + ++ Y + + + ++ V H + +++ + + D+ Sbjct: 1548 VRQDFADWFSFYRKRMLTTKAAIGLT-VEGMKGVELGLHTINRSVSEPLVEMTTADSAEK 1606 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID-----ENEMGSTAINDAMQTAYD 348 + +N R G++++ + K + D T +A Sbjct: 1607 VAFLSTI-YNIRASGSTPLRRGLYEVGKYFEKGTSGDYSGLLTTSGLGTNQGSTCTSADT 1665 Query: 349 TII---SSNEDEVHRMKNNLEAKK-YIVLLTDGENTQDNEEGIAI 389 ++ N+D + E ++ Y+V +TDG +D + I Sbjct: 1666 SVFWDADKNDDLDTCDDSGGECQRAYVVAMTDGYYNEDFYSIVNI 1710 >gi|309364927|emb|CAP23537.2| CBR-CLEC-143 protein [Caenorhabditis briggsae AF16] Length = 666 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 62/180 (34%), Gaps = 15/180 (8%) Query: 274 VRDALASVIRSIKKIDNVN-----DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 VR+ L V+ SI KI V + +G ++D + K + Sbjct: 330 VRNTLTQVLGSISKIGPVKYPADPRSTCVGIVTYDDNATTQSQL--DASKSFSDLYNVIQ 387 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T + + A + +D +R K I D + + + Sbjct: 388 SSLISVDNTNTS-YLSLALLAAEKALKDGRNRTYRFNYKKVIIAFAADYQGHGTALDAMP 446 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH--ELNKIFRDRIG 446 I N+ K + I+T+A + N +Q + ASP F + +L F + G Sbjct: 447 IANRLKDNAVTIITVACTSNSDKQTA----IQGIASPG-FDLVDEMDTPKLPTAFAQKSG 501 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 36.9 bits (83), Expect = 8.0, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 53/178 (29%), Gaps = 23/178 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A+ ++ ++ D R +++ V + Sbjct: 109 KVRDAKAAVKGLVEGLRSQD------RFSLVTYSNSVNGGDGL--------HYLTADKRN 154 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N M A V R + ++L++DG+ Q + + Sbjct: 155 SLNWM--VDSIPAGGGTNLGGGLEKGVGVLRAYGAPDRMGKVILISDGQANQGVTDPNQL 212 Query: 390 CNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRD 443 A + G+ + ++ + Q+ ++ A ++ + + +F++ Sbjct: 213 AAMAALRDDGL-VYSV--TTVGIGQDFNEQLMATVADGGRGRYYYLENPGDFLAVFQE 267 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 36.5 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 64/222 (28%), Gaps = 29/222 (13%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +Y +S Y + ++ + GK GI Sbjct: 132 TEGAYQDNRLISYNLT-------GKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAL 184 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRS 284 Y + + D S S ++ ++ + K + ++D ++R Sbjct: 185 A----YRNDPIEGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILQDKATIMMRD 240 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K I NV+ + + + G + +I G T D ++ Sbjct: 241 LKDIGNVSVNLVSFSILGSYVQKDFSELDKGTTTIEASI-----NALQTGGVTNPGDGLR 295 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ++ N+ KY+VLLTDG + Sbjct: 296 YGMMSLQ-----------NHSAQLKYVVLLTDGIPNAYTVDT 326 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 36.5 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 44/121 (36%), Gaps = 13/121 (10%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N N+ V G ++ V + + R+ + + G TA DA+ A Sbjct: 405 NDNNYV--GLVSYSSSVTVEVPIA-QFDLNQRSYFQGSVNNLMASGGTASYDAVVVAMKM 461 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 I + + +AK + LL+DG N + + I + I + TI + + Sbjct: 462 ITDAKA-------EHPDAKCMLFLLSDGYANVGYSMDEITSA--LRQSNIPVYTIGYGGD 512 Query: 409 K 409 Sbjct: 513 A 513 >gi|313204752|ref|YP_004043409.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312444068|gb|ADQ80424.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 626 Score = 36.5 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 54/377 (14%), Positives = 114/377 (30%), Gaps = 33/377 (8%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S + T ++ +EE ++ + + + V D V + S++ V+ Sbjct: 89 KSEYIDVALTPDQKALEEVVVVGYASQRMISTVGAISVADVADVKVSNYNQPSSFMPVMI 148 Query: 135 SRYDLLLNPLSLFL--RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 R + R + K + T + + + +++ + D Sbjct: 149 KRVQPDAEEYGSYKENRFLSAKEQALSTFSLDVDAASYGNMRRMINM-----GQKPPKDA 203 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 R E + P S + + K+ P P+ Sbjct: 204 IRVEELINYFSYDYPKPTGKDPVSINTETSICPWDATHRLVKIGVKAR---EIPSENLPA 260 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +D S V K LV+ ++ + +++K D V V GA Sbjct: 261 SNFVFLLDVSGSMDVPNKLELVKSSIKLLTNNLRKTDRVAIVVYAGAAGVVLE------- 313 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + GSTA +Q AY + + + IV Sbjct: 314 --STEGTDKQKIMEAVDGLHAGGSTAGGAGIQLAYKIAEKNFIENGNNR---------IV 362 Query: 373 LLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 L TDG+ N E ++ + G+ + + + + + K + + Sbjct: 363 LCTDGDFNVGVSSNNELESLIESKRKTGVYLTVLGYGMGNYKDNKLQILAEK--GNGNHA 420 Query: 430 EANSTHELNKIFRDRIG 446 ++ E NK+ + G Sbjct: 421 YIDNIQEANKVLVNEFG 437 >gi|164454843|dbj|BAF96966.1| serum opacity factor [Streptococcus suis] Length = 1058 Score = 36.5 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 46/316 (14%), Positives = 100/316 (31%), Gaps = 39/316 (12%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 +E+ + + + + + + + E S+ L +P +L Sbjct: 43 LEKTAETDSASPIMPTAAAEDAKVVQANENKEGEVVDSSSISALPKSNAESASPEALTNE 102 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + + + + +E V + E + + Sbjct: 103 NASEPTGQVAENTSSSEEKATEKEERAVQY--------------VEKEVDDYSTKVEKPT 148 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 TV S + K + + ++ + P +D VD+S Sbjct: 149 ETVASTVDKTSKDLFTVSRTAEVGQDGVVTVTTQIVPKEIDKGAEIVVLVDTSKKMD--- 205 Query: 270 KKHLVRDALASVIRSIKKID------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + A A+++ +KK+ N ++VR+ FN ++ + + T+ Sbjct: 206 -VEAKKTAKANIVELVKKMTEPNDVYNSRNSVRV--IGFNRKLSEAQEVT--SANVETTV 260 Query: 324 VKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 K F E + A+ A + S N + +K+IVLL+ GE+T Sbjct: 261 EKLFTDAEQNYNWGVDMQGAIHEARRILESENSGK----------RKHIVLLSQGESTFS 310 Query: 383 NEEGIAICNKAKSQGI 398 + A + K + I Sbjct: 311 YDLTDAAKTETKYKTI 326 >gi|87124032|ref|ZP_01079882.1| hypothetical protein RS9917_10491 [Synechococcus sp. RS9917] gi|86168601|gb|EAQ69858.1| hypothetical protein RS9917_10491 [Synechococcus sp. RS9917] Length = 254 Score = 36.5 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 53/209 (25%), Gaps = 6/209 (2%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 A A++ + + ++ T P + L + + I Sbjct: 13 SAPLGALLPGTTGVALGTPALAQAPAMPSTAPSTSLTAAQANEAARTLLEAIKGKNGATI 72 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + + S V L NP R I + T EA V + E Sbjct: 73 YNGLSDPLRNSTSIEAVQ----QRLNSNPRVSSYRISEISRGMDDTTVEAFAVVETRKGE 128 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + V+D S ++ ++ P+ + + + + E Sbjct: 129 VPL--LLVLDDSGKLVAWKWVGTTLPIEETALKFVNDLNAGRWIAARYYLDLEFQQELTP 186 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 K + L + V S Sbjct: 187 QDLKRKWTKLKRTLGGVKRVKSALVASQG 215 >gi|87310695|ref|ZP_01092823.1| BatB [Blastopirellula marina DSM 3645] gi|87286676|gb|EAQ78582.1| BatB [Blastopirellula marina DSM 3645] Length = 364 Score = 36.5 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 39/112 (34%), Gaps = 15/112 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ + + + G + + DA+ A I Sbjct: 131 RVGLIAFAGETRQSVPLT-SHYEDFKQTLDSVGPHTVRSGGSRLGDAIAAASKGFIDKTY 189 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFS 406 D K IV+ TDGE+ + + + + +GIR+ T+ Sbjct: 190 DH-----------KAIVVFTDGEDQES--KPVEVAKSLYAERGIRVFTVGLG 228 >gi|167624203|ref|YP_001674497.1| hypothetical protein Shal_2279 [Shewanella halifaxensis HAW-EB4] gi|167354225|gb|ABZ76838.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4] Length = 446 Score = 36.5 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 22/198 (11%), Positives = 63/198 (31%), Gaps = 5/198 (2%) Query: 5 TKFIFYSKKLIKS---CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 +K +Y + + S +G ++ + + ++ + +D + L+ A A Sbjct: 2 SKSHYYVSRGLSSIGRQSGAILVMFTIGIFAVIAFAALALDGGHMLLSKGRLQNAVDAAA 61 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A+ L + + +R ++T Q + ++ + D + VE Sbjct: 62 LNAAKELQEGATLLEAREA-AYTILLQNLSFTENGELNTSVSLSSPDFNNTQVTPRLQVE 120 Query: 122 MNPRKSAYQ-VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + ++ + + ++ L + + A V+ V+ Sbjct: 121 FSELPDPFNPILAEGSEYVRVRVENVKLSNFLADILNFDKEIRASAVAGRSQDLACVNKI 180 Query: 181 WVIDFSRSMLDYQRDSEG 198 + D+ G Sbjct: 181 LPLLVCGKEGSTAEDNYG 198 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 50/162 (30%), Gaps = 17/162 (10%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ F+ + ++ ++ L I ST + AM+ A + + Sbjct: 281 QVAIYSFDSEITLLTDYTSDINLLEAAIDTIPNSVLERGNSTNLLGAMEIAAERWNDQID 340 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGI--RIMTIAFSVNKTQQ 412 + Y VLLTDGE+ D+ I + G ++ IA N Sbjct: 341 LIA-------VERGYAVLLTDGEHNFDSRSPADIEADLTNFFGTRKKVYAIAVGNNVN-- 391 Query: 413 EKARYFLSNCASPNSFFEA---NSTHELNKIFRDRIGNEIFE 451 + AS S EL +F + E Sbjct: 392 --LENLEAITASSEQVLTVNSFESAEELEAVFTEVATTEAKA 431 >gi|319782171|ref|YP_004141647.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168059|gb|ADV11597.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 509 Score = 36.5 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 44/130 (33%), Gaps = 2/130 (1%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL--I 69 + G I+ ++ +P ++G + +D+ R + + L++ A + + A+ L Sbjct: 6 RAFWNDQRGIALILVSITLPAIIGFSLLAIDMSRINNLHNDLQKGADSFALAAAAELDGS 65 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + RA + + + R + ++ P Sbjct: 66 SGAWVRAERAMDVLVRNESQFSTAGPRILLAGQPGGTQRCNSAGSISWCFLKAIPAADGI 125 Query: 130 QVVLSSRYDL 139 Q+ ++++ Sbjct: 126 QITVANQATY 135 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 36.5 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 36.5 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 36.5 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 47/139 (33%), Gaps = 18/139 (12%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A + R + ++DN +G F + + + +++ + Sbjct: 112 LEKAKRLISRLVDELDNDK----IGMIVFAGDAFTQLPITSDYISA-KMFLESISPSLIS 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A+ A + + IV++TDGEN + A A Sbjct: 167 KQGTAIGEAINLAVRSFT-----------PQEGVGRAIVVITDGENHEGGAVEAAK--VA 213 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 +GI++ + + Sbjct: 214 AEKGIQVSVLGVGMPDGAP 232 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 36.5 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 36.5 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|170584326|ref|XP_001896952.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158595641|gb|EDP34180.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 945 Score = 36.5 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 46/122 (37%), Gaps = 13/122 (10%) Query: 285 IKKIDNV--NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I++I N+ + + A F+ + + + I+ + G+T+ A Sbjct: 1 IEQIPPSAFNERIHVAAISFSSNAQINFQ--FNEFNSQKEILNALLSFIHTGGNTSSVSA 58 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A I + + ++ IVL++DG + E+ + ++ + I Sbjct: 59 VNLAIKEI---------QERGRENVRRMIVLISDGYSQDRWEDLLDASDRLHAINAIIYA 109 Query: 403 IA 404 I+ Sbjct: 110 IS 111 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|154250681|ref|YP_001411505.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154631|gb|ABS61848.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 187 Score = 36.5 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 27/74 (36%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ +++C G + AL+ PVM+ ++ + A TA + S Sbjct: 17 KRFLRNCAGIAAVEFALIFPVMIAFYFGSIETTNMLTANRRVTSVAYTAADITAQATSIS 76 Query: 72 LEEVSSRAKNSFTF 85 +++ S Sbjct: 77 NSDLADIFAASSAI 90 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|227819050|ref|YP_002823021.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] gi|227338049|gb|ACP22268.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] Length = 155 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 22/43 (51%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 + +K +G +I A+ +PV++G G+ + W + L+ Sbjct: 18 RGWLKDESGTVAVIAAVTLPVLVGAMGLGAETGYWYLKDRKLQ 60 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 36.5 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 36.5 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 21/41 (51%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + G I+ AL++ V++G G+ +D+ + L+ + Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 36.5 bits (82), Expect = 10.0, Method: Composition-based stats. Identities = 39/358 (10%), Positives = 93/358 (25%), Gaps = 33/358 (9%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV--SSRAKNSFTFPKQKIEEY-LIRNF 98 + W + + + + T +E + + ++ + + Sbjct: 526 YIFNWQTNNNGMIEGSITDKKGGGTIQYNQMETIVFGDGSYIGEKPQEEAPAQTIFKVDI 585 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW-- 156 L + I+++ S Y+V +S + + Sbjct: 586 SAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLTMKV 645 Query: 157 ------LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I + EA+ +++ +++ ++ Sbjct: 646 PQEDAKNINLQIEAKATEARDNEDG--------------QNFKTATDSTTDKTPTLVVGS 691 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S + I ++ Y + +E ++ I + Sbjct: 692 NKDSVINGGAGKDILIGDTGGTELNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISR 751 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++DAL ++ S+K D + F+ +F+ K I ++ T Sbjct: 752 IDLLKDALKNLADSLKGHDGKINV---SIIDFDTNAKEPITFNDLTSKNISDLI-TKIDA 807 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T DA + + + LTDG+ T N + Sbjct: 808 LKAEGGTNYEDAFLKTTSWFDTQSVTYGKA----QGYENLTYFLTDGDPTFSNRDTKN 861 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 36.5 bits (82), Expect = 10.0, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 62/183 (33%), Gaps = 25/183 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ +L +++ + D R+ F++ ++ Sbjct: 176 KIEMVKASLIVLLQFLGDND------RLQLITFDNDAHRLTPLKTVTNQNKSYFT-QIIK 228 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEG-I 387 G I++A + A+ + S + NN+ + + LL+DG + T + I Sbjct: 229 QIKANGGNRISEATKMAFYQLKS------RKYINNVTS---VFLLSDGVDYTYPEVKNQI 279 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 N+ + T F + Q + SF+ L++ F D +G Sbjct: 280 QTVNEV----FTLHTFGFGEDHDAQMMTQLCNLK---SGSFYFVQDVTLLDEFFADALGG 332 Query: 448 EIF 450 I Sbjct: 333 LIS 335 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.109 0.253 Lambda K H 0.267 0.0330 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,922,788,521 Number of Sequences: 14124377 Number of extensions: 202576917 Number of successful extensions: 628404 Number of sequences better than 10.0: 3805 Number of HSP's better than 10.0 without gapping: 1188 Number of HSP's successfully gapped in prelim test: 2879 Number of HSP's that attempted gapping in prelim test: 623852 Number of HSP's gapped (non-prelim): 5794 length of query: 458 length of database: 4,842,793,630 effective HSP length: 143 effective length of query: 315 effective length of database: 2,823,007,719 effective search space: 889247431485 effective search space used: 889247431485 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 82 (36.5 bits)